DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
From 66% to 96%: How I Fixed a Drive-Thru Voice Agent Before It Took a Single Real Call

From 66% to 96%: How I Fixed a Drive-Thru Voice Agent Before It Took a Single Real Call

1
Comments
4 min read
Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks

Sonnet 4.6 vs Haiku 4.5 vs Opus 4.6: I Tested 3 Claude Models on 10 Real Tasks

Comments
3 min read
Model Showdown: Benchmarking Local vs Cloud LLMs on a Real Coding Task

Model Showdown: Benchmarking Local vs Cloud LLMs on a Real Coding Task

1
Comments
14 min read
Why I Built TokenBar: AI Spend Should Not Be a Monthly Surprise

Why I Built TokenBar: AI Spend Should Not Be a Monthly Surprise

Comments
1 min read
How Top Companies Are Shipping AI Agents Today (Apr 15)

How Top Companies Are Shipping AI Agents Today (Apr 15)

Comments
3 min read
Build an MCP Server for Agentic Web Scraping and Real-Time LLM Grounding

Build an MCP Server for Agentic Web Scraping and Real-Time LLM Grounding

1
Comments
6 min read
🩊 GoClaw Deep Dive đŸ€– — A Builder's Guide to a Multi-Tenant AI Agent Platform 📘

🩊 GoClaw Deep Dive đŸ€– — A Builder's Guide to a Multi-Tenant AI Agent Platform 📘

5
Comments
23 min read
The day I realized AI costs need a warning light

The day I realized AI costs need a warning light

Comments
2 min read
đŸ€– nanobot: A Comprehensive Build-Your-Own Guide 📚

đŸ€– nanobot: A Comprehensive Build-Your-Own Guide 📚

7
Comments
18 min read
I was tired of losing track of my AI conversations, so I built a Chrome extension

I was tired of losing track of my AI conversations, so I built a Chrome extension

5
Comments
4 min read
SafePaths: How We Reduced Token Consumption by 85% — The Benchmark Story

SafePaths: How We Reduced Token Consumption by 85% — The Benchmark Story

Comments
2 min read
Boosting llama.cpp with Auto-Tuning, Qwen Quantization Benchmarks, & Mobile Ollama AI Servers

Boosting llama.cpp with Auto-Tuning, Qwen Quantization Benchmarks, & Mobile Ollama AI Servers

Comments
3 min read
I Built an AI Agent to Do My Pre-Refinement. It Turned Into a Mirror of How We Wrote Tickets.

I Built an AI Agent to Do My Pre-Refinement. It Turned Into a Mirror of How We Wrote Tickets.

1
Comments
10 min read
340% and Climbing: What the CIS Prompt Injection Report Means for Enterprise AI Agents

340% and Climbing: What the CIS Prompt Injection Report Means for Enterprise AI Agents

Comments
10 min read
Voice-Controlled Local AI Agent

Voice-Controlled Local AI Agent

Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.