DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Forget Your RAG: Build Your Own LLM Wiki in C# with Ollama + Kimi (Step‑by‑Step Guide)

Forget Your RAG: Build Your Own LLM Wiki in C# with Ollama + Kimi (Step‑by‑Step Guide)

2
Comments
10 min read
Agentic RAG: What It Is, Why Teams Use It, and Where It Gets Complicated

Agentic RAG: What It Is, Why Teams Use It, and Where It Gets Complicated

Comments
3 min read
Coding in the Age of AI Is Not What You Think

Coding in the Age of AI Is Not What You Think

Comments
6 min read
DeepClaude: I Combined Claude Code with DeepSeek V4 Pro in My Agent Loop and the Numbers Threw Me Off

DeepClaude: I Combined Claude Code with DeepSeek V4 Pro in My Agent Loop and the Numbers Threw Me Off

1
Comments
8 min read
Gemma 4 MTP, vibevoice.cpp for Multimodal AI, & Ollama Desktop Layer for Local Deployment

Gemma 4 MTP, vibevoice.cpp for Multimodal AI, & Ollama Desktop Layer for Local Deployment

Comments
3 min read
What I learned tuning a Reddit DM agent through 8 versions in 24 hours

What I learned tuning a Reddit DM agent through 8 versions in 24 hours

Comments
15 min read
PII Protection for AI Agents: Why Detection Isn't Enough and What Prevents Actual Exposure

PII Protection for AI Agents: Why Detection Isn't Enough and What Prevents Actual Exposure

2
Comments 1
8 min read
I rebuilt my open-source AI coding agent that routes each pipeline stage to a different LLM

I rebuilt my open-source AI coding agent that routes each pipeline stage to a different LLM

Comments
5 min read
# Why `$0.0029` and `$0.0047` Can Both Be Right: Prefix Caching for API-Served LLM Judges *By Eyoel Nebiyu*

# Why `$0.0029` and `$0.0047` Can Both Be Right: Prefix Caching for API-Served LLM Judges *By Eyoel Nebiyu*

Comments
3 min read
From OOM to 262K Context: Running Qwen3-Coder 30B Locally on 8GB VRAM

From OOM to 262K Context: Running Qwen3-Coder 30B Locally on 8GB VRAM

Comments
8 min read
The Writing Is the Moat, Not the Model

The Writing Is the Moat, Not the Model

Comments
5 min read
27 days to the DeepSeek V4-Pro cliff: what a 4x price jump looks like in production

27 days to the DeepSeek V4-Pro cliff: what a 4x price jump looks like in production

Comments
5 min read
RAG patterns that work for structured data vs ones that fail

RAG patterns that work for structured data vs ones that fail

Comments
5 min read
Why comparing average scores is the wrong way to evaluate LLM prompts (and what to do instead)

Why comparing average scores is the wrong way to evaluate LLM prompts (and what to do instead)

5
Comments
6 min read
4 Types of Hallucinations: One Detection Pattern Per Type

4 Types of Hallucinations: One Detection Pattern Per Type

Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.