DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Comments 1
4 min read
Building a Scalable RAG Backend with Cloud Run Jobs and AlloyDB

Building a Scalable RAG Backend with Cloud Run Jobs and AlloyDB

39
Comments 4
6 min read
Bringing The Receipts - 95% AI LLM Token Savings

Bringing The Receipts - 95% AI LLM Token Savings

1
Comments
10 min read
Your AI Memory System Can't Tell a River Bank from a Savings Account

Your AI Memory System Can't Tell a River Bank from a Savings Account

25
Comments 4
5 min read
Building a Perplexity Clone for Local LLMs in 50 Lines of Python

Building a Perplexity Clone for Local LLMs in 50 Lines of Python

1
Comments 1
6 min read
Scaling LLMs at the Edge: A journey through distillation, routers, and embeddings

Scaling LLMs at the Edge: A journey through distillation, routers, and embeddings

1
Comments
20 min read
RAG + FastAPI in Action: Creating a Smart Business Analytics Dashboard in Python

RAG + FastAPI in Action: Creating a Smart Business Analytics Dashboard in Python

Comments
9 min read
I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

Comments
10 min read
RAG in Practice — Part 6: RAG, Fine-Tuning, or Long Context?

RAG in Practice — Part 6: RAG, Fine-Tuning, or Long Context?

2
Comments
9 min read
What MCP Actually Is (And Why It Exists)

What MCP Actually Is (And Why It Exists)

2
Comments 3
4 min read
How CodiLay Reads a Codebase the Way a Detective Reads a Crime Scene

How CodiLay Reads a Codebase the Way a Detective Reads a Crime Scene

2
Comments
9 min read
Anatomy of a RAG System Architecture

Anatomy of a RAG System Architecture

Comments
5 min read
Qu'est-ce qu'OpenViking ?

Qu'est-ce qu'OpenViking ?

Comments
15 min read
OpenViking คืออะไร

OpenViking คืออะไร

Comments
3 min read
¿Qué es OpenViking?

¿Qué es OpenViking?

Comments
11 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.