DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Lost in the Middle: Why LLMs Quietly Ignore the Centre of Their Own Context Window

Lost in the Middle: Why LLMs Quietly Ignore the Centre of Their Own Context Window

Comments
3 min read
I shipped 14 MCP servers this week. Gemma 4 changes which ones matter.

Gemma 4 Challenge: Write about Gemma 4 Submission

I shipped 14 MCP servers this week. Gemma 4 changes which ones matter.

3
Comments
6 min read
The 10 Best AI Memory Layers for Agents in 2026

The 10 Best AI Memory Layers for Agents in 2026

Comments
7 min read
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Comments
5 min read
AI boyfriends are 10x bigger than AI girlfriends

AI boyfriends are 10x bigger than AI girlfriends

Comments
15 min read
LangChain vs LangGraph: Why AI Agents Need Stateful Orchestration

LangChain vs LangGraph: Why AI Agents Need Stateful Orchestration

Comments 1
4 min read
Your RAG works on Claude. Does it work on Gemma 4? Drift detection across model families.

Your RAG works on Claude. Does it work on Gemma 4? Drift detection across model families.

Comments 1
6 min read
Why “Local Document AI” Is Really an OCR + RAG + Local Inference Problem

Why “Local Document AI” Is Really an OCR + RAG + Local Inference Problem

5
Comments
4 min read
How I Cut My AI API Costs by 60%: A Data-Driven Approach to LLM Model Selection

How I Cut My AI API Costs by 60%: A Data-Driven Approach to LLM Model Selection

Comments
2 min read
OWASP Top 10 for LLMs: A Practitioner’s Implementation Guide

OWASP Top 10 for LLMs: A Practitioner’s Implementation Guide

Comments
9 min read
I Benchmarked the Voice AI Stack in May 2026: What Actually Holds Up in Production

I Benchmarked the Voice AI Stack in May 2026: What Actually Holds Up in Production

Comments
12 min read
DeepClaude Merges Two AI Models Into One Agent Loop

DeepClaude Merges Two AI Models Into One Agent Loop

Comments
6 min read
RAG - Chunking

RAG - Chunking

Comments
3 min read
XML Tags Don't Help Short Prompts — Here's When They Actually Matter (2026)

XML Tags Don't Help Short Prompts — Here's When They Actually Matter (2026)

Comments
4 min read
Building an MCP server — lessons from thunderbit-mcp

Building an MCP server — lessons from thunderbit-mcp

Comments 1
8 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.