DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Build a RAG Pipeline from Scratch in Python: A Step-by-Step Guide

Build a RAG Pipeline from Scratch in Python: A Step-by-Step Guide

Comments
9 min read
Are We Using AI at the Wrong Scale?

Small models rivaling giants in code tasks

Are We Using AI at the Wrong Scale?

73
Comments 25
5 min read
🤖 Building a Private, Local WhatsApp AI Assistant with Node.js & Ollama

🤖 Building a Private, Local WhatsApp AI Assistant with Node.js & Ollama

Comments
2 min read
Why AI Teams Need a Unified Gateway Instead of More API Chaos

Why AI Teams Need a Unified Gateway Instead of More API Chaos

Comments
1 min read
Day 4: ReAct - Reasoning + Acting upon(Prompting Technique)

Day 4: ReAct - Reasoning + Acting upon(Prompting Technique)

1
Comments
1 min read
Gemma4 Tool Calling Fixes in llama.cpp, RTX cuBLAS MatMul Bug, & Local Ollama + Whisper UI

Gemma4 Tool Calling Fixes in llama.cpp, RTX cuBLAS MatMul Bug, & Local Ollama + Whisper UI

Comments
3 min read
The gay jailbreak: I ran the viral technique against my own production prompts and here's what I found

The gay jailbreak: I ran the viral technique against my own production prompts and here's what I found

2
Comments
8 min read
Five Atomic Skills, Two Approaches: Claude Code and a Paper

Five Atomic Skills, Two Approaches: Claude Code and a Paper

Comments
22 min read
Software Engineers Are Building Agents Wrong: Treat Agentic AI Like Distributed Systems, Not Prompt Chains

Software Engineers Are Building Agents Wrong: Treat Agentic AI Like Distributed Systems, Not Prompt Chains

Comments
4 min read
RAG Architecture — Prototype to Production in Three Stages

RAG Architecture — Prototype to Production in Three Stages

1
Comments
8 min read
Claude Code install and config for Ollama, llama.cpp, pricing

Claude Code install and config for Ollama, llama.cpp, pricing

Comments
9 min read
Building Your Own "Google Maps for Codebases": A Guide to Codebase Q&A with LLMs

Building Your Own "Google Maps for Codebases": A Guide to Codebase Q&A with LLMs

Comments
6 min read
Large Language Models, Explained Like You're a Curious Human

Large Language Models, Explained Like You're a Curious Human

Comments
6 min read
AI Agent Monitoring: How to Observe Autonomous AI Agents in Production

AI Agent Monitoring: How to Observe Autonomous AI Agents in Production

Comments
8 min read
Why I dropped stuffed prompts for Hindsight reflections

Why I dropped stuffed prompts for Hindsight reflections

Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.