Back to All Books
Engineers ~60–90 pages

Token Economics

Cutting Your LLM Bill Without Cutting Quality

Free Ebook EPUB + Markdown By David Kelly Price

About This Ebook

Engineering managers, senior engineers, and platform engineers responsible for LLM-powered systems and their costs — making decisions about inference spend

What you'll learn:

  • Where the Money Actually Goes
  • The Token Tax: Input vs. Output Costs
  • Context Compression Strategies
  • Prompt Caching and When to Use It
  • Model Selection and Cost-Quality Tradeoffs
  • Batching, Routing, and Request Shaping
  • Measuring What You're Spending
  • Building a Token Budget System

Get instant access to the EPUB and Markdown versions — read offline, share freely, and explore at your own pace.

Free Semantic Code Search

Try Pyckle in your codebase

The tool this book explores — semantic search, context routing, and code intelligence for Claude Code.

Get Started Free