DEV Community

# costoptimization

Practical strategies and stories about reducing cloud infrastructure costs.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Part 8 — Token-by-Token: Why AI Generates Text One Word at a Time (And Why It Costs 4x More)

Part 8 — Token-by-Token: Why AI Generates Text One Word at a Time (And Why It Costs 4x More)

Comments
9 min read
The Shadow Cloud Spend: $50k a Month FinOps Audit of Forgotten Dev Accounts

The Shadow Cloud Spend: $50k a Month FinOps Audit of Forgotten Dev Accounts

Comments
7 min read
They Don't Have the Money (And Neither Do You): The Coming Era of Small Models

They Don't Have the Money (And Neither Do You): The Coming Era of Small Models

Comments
6 min read
The History of Expanso (Part 4): The Mismatch

The History of Expanso (Part 4): The Mismatch

Comments
3 min read
How a SaaS platform cut infrastructure costs by 40% while improving response times

How a SaaS platform cut infrastructure costs by 40% while improving response times

Comments
3 min read
How to Add Old Models to Claude Code /model Picker: 3 Methods Tested

How to Add Old Models to Claude Code /model Picker: 3 Methods Tested

Comments
5 min read
I Audited 3 Months of Claude Code Billing — Most Community Cost-Saving Tips Don''t Work

I Audited 3 Months of Claude Code Billing — Most Community Cost-Saving Tips Don''t Work

Comments
7 min read
Slash LLM Costs: open source LLM API gateway for 14+ Providers

Slash LLM Costs: open source LLM API gateway for 14+ Providers

1
Comments
8 min read
Cutting our Claude API bill by 78% with prompt caching

Cutting our Claude API bill by 78% with prompt caching

Comments
3 min read
Cloud Cost Optimization: Strategies That Actually Work

Cloud Cost Optimization: Strategies That Actually Work

3
Comments
7 min read
How to Measure and Reduce Your LLM Tokenizer Costs

How to Measure and Reduce Your LLM Tokenizer Costs

Comments
5 min read
Cloud Cost FinOps: Cut Your AWS Bill by 40% Without Sacrificing Performance

Cloud Cost FinOps: Cut Your AWS Bill by 40% Without Sacrificing Performance

Comments
6 min read
When cloud becomes more expensive than bare metal

When cloud becomes more expensive than bare metal

Comments
3 min read
Does a Long Claude Code Session Waste Tokens? A Cost Model Most People Get Wrong

Does a Long Claude Code Session Waste Tokens? A Cost Model Most People Get Wrong

Comments
7 min read
How to Scale Video Processing to 1000+ Videos Per Day (Without Breaking the Bank)

How to Scale Video Processing to 1000+ Videos Per Day (Without Breaking the Bank)

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.