DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why RAG is Like Playing Space Invaders. The Higher the Level the More Difficult it Becomes to Win.

Why RAG is Like Playing Space Invaders. The Higher the Level the More Difficult it Becomes to Win.

Comments
15 min read
GPT-5.5 Just Raised the Bar for Everyone — And It's Not About Benchmarks

GPT-5.5 Just Raised the Bar for Everyone — And It's Not About Benchmarks

Comments
3 min read
What is Model Context Protocol (MCP)? A Developer’s Guide

What is Model Context Protocol (MCP)? A Developer’s Guide

1
Comments
3 min read
AI Weekly — 2026-05-08 | MS-OpenAI loosens, and the race moves to control

AI Weekly — 2026-05-08 | MS-OpenAI loosens, and the race moves to control

Comments
9 min read
AI 週報 — 2026-05-08 MS-OpenAI 合作鬆動,AI 競賽轉向控制面

AI 週報 — 2026-05-08 MS-OpenAI 合作鬆動,AI 競賽轉向控制面

Comments
5 min read
Quando a IA mente e te pede CALMA: um papo reto sobre alucinações

Quando a IA mente e te pede CALMA: um papo reto sobre alucinações

Comments
3 min read
What PocketOS Teaches Us About Agentic Architecture

What PocketOS Teaches Us About Agentic Architecture

Comments
8 min read
The Context Window Is Not Your Memory

The Context Window Is Not Your Memory

Comments
4 min read
Building a Production LLM Evaluation Harness in Pytest: Cost-Bounded, Flake-Aware, CI-Gated (Runnable Python)

Building a Production LLM Evaluation Harness in Pytest: Cost-Bounded, Flake-Aware, CI-Gated (Runnable Python)

Comments
9 min read
# What LoRA Actually Adapts and Why Higher Rank Doesn't Always Buy What It Looks Like It Should Explainer by: Eyoel Nebiyu

# What LoRA Actually Adapts and Why Higher Rank Doesn't Always Buy What It Looks Like It Should Explainer by: Eyoel Nebiyu

Comments
5 min read
llama.cpp supports Sparse MoE, new Qwen3.6 GGUF, & WebWorld for local agents

llama.cpp supports Sparse MoE, new Qwen3.6 GGUF, & WebWorld for local agents

Comments
3 min read
I tracked 332 AI releases this week. 85% were noise.

I tracked 332 AI releases this week. 85% were noise.

Comments
2 min read
Meta's AI agent rewrote its own harness 100 times -- the loop that makes self-improving agents work

Meta's AI agent rewrote its own harness 100 times -- the loop that makes self-improving agents work

Comments
4 min read
AI API Cost Caps and Multi-Key Failover: The Boring Layer That Matters

AI API Cost Caps and Multi-Key Failover: The Boring Layer That Matters

1
Comments
1 min read
Documents are records waiting to exist

Documents are records waiting to exist

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.