Engineers ~60–90 pages

Token Economics

Cutting Your LLM Bill Without Cutting Quality

Free Ebook EPUB + Markdown By David Kelly Price

About This Ebook

Engineering managers, senior engineers, and platform engineers responsible for LLM-powered systems and their costs — making decisions about inference spend

What you'll learn:

Where the Money Actually Goes
The Token Tax: Input vs. Output Costs
Context Compression Strategies
Prompt Caching and When to Use It
Model Selection and Cost-Quality Tradeoffs
Batching, Routing, and Request Shaping
Measuring What You're Spending
Building a Token Budget System

Get instant access to the EPUB and Markdown versions — read offline, share freely, and explore at your own pace.

Free Semantic Code Search

Try Pyckle in your codebase

The tool this book explores — semantic search, context routing, and code intelligence for Claude Code.

Get Started Free