Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
KVQuant: Run 70B LLMs on 8GB RAM with Real-Time KV Cache Compression
Aman Sachan
Aman Sachan
Aman Sachan
Follow
Apr 30
KVQuant: Run 70B LLMs on 8GB RAM with Real-Time KV Cache Compression
#
python
#
llm
#
ai
#
opensource
1
 reaction
Comments
Add Comment
1 min read
I Built a Knowledge Base That Thinks — Inspired by Karpathy’s LLM Wiki
Charles Wu
Charles Wu
Charles Wu
Follow
for
seekdb
Apr 30
I Built a Knowledge Base That Thinks — Inspired by Karpathy’s LLM Wiki
#
ai
#
productivity
#
llm
#
bigdata
5
 reactions
Comments
Add Comment
6 min read
Cencori: A Serverless Infrastructure Layer for Secure and Scalable AI Applications
Ladipo Samuel
Ladipo Samuel
Ladipo Samuel
Follow
Apr 30
Cencori: A Serverless Infrastructure Layer for Secure and Scalable AI Applications
#
ai
#
llm
#
security
#
serverless
2
 reactions
Comments
Add Comment
5 min read
KVQuant: Run 70B LLMs on 8GB RAM with 4-bit KV Cache Quantization
Aman Sachan
Aman Sachan
Aman Sachan
Follow
Apr 30
KVQuant: Run 70B LLMs on 8GB RAM with 4-bit KV Cache Quantization
#
python
#
llm
#
quantization
#
optimization
Comments
Add Comment
1 min read
Securing Agentic Workflows: A Deterministic 'Human-in-the-Loop' Pattern for LLMs
Badri C
Badri C
Badri C
Follow
Apr 30
Securing Agentic Workflows: A Deterministic 'Human-in-the-Loop' Pattern for LLMs
#
agents
#
architecture
#
llm
#
security
Comments
Add Comment
5 min read
software engineers are becoming reliability engineers for generated output
Paulo Victor Leite Lima Gomes
Paulo Victor Leite Lima Gomes
Paulo Victor Leite Lima Gomes
Follow
Apr 30
software engineers are becoming reliability engineers for generated output
#
ai
#
softwareengineering
#
reliability
#
llm
Comments
Add Comment
5 min read
I just wanted to chat with my Raspberry Pi.
Grega Snoj
Grega Snoj
Grega Snoj
Follow
May 4
I just wanted to chat with my Raspberry Pi.
#
ai
#
python
#
raspberrypi
#
llm
Comments
Add Comment
9 min read
Fix Your Prompt Structure Before You Touch Your Infrastructure
Parag Darade
Parag Darade
Parag Darade
Follow
Apr 30
Fix Your Prompt Structure Before You Touch Your Infrastructure
#
ai
#
llm
#
rag
#
machinelearning
Comments
Add Comment
4 min read
Why File-to-Markdown Conversion Is Becoming an AI Input Layer
dengkui yang
dengkui yang
dengkui yang
Follow
Apr 30
Why File-to-Markdown Conversion Is Becoming an AI Input Layer
#
markitdown
#
llm
#
ai
Comments
1
 comment
7 min read
I Compressed GPT-2 to Run on an Arduino
Aman Sachan
Aman Sachan
Aman Sachan
Follow
Apr 30
I Compressed GPT-2 to Run on an Arduino
#
llm
#
embedded
#
tinyml
#
python
Comments
Add Comment
1 min read
How LLMs Memorize Phone Numbers (and How Labs Stop It)
Gabriel Anhaia
Gabriel Anhaia
Gabriel Anhaia
Follow
Apr 29
How LLMs Memorize Phone Numbers (and How Labs Stop It)
#
ai
#
llm
#
security
#
privacy
Comments
Add Comment
7 min read
I Let My AI Agent Run Overnight. It Cost $437.
Magicrails
Magicrails
Magicrails
Follow
Apr 29
I Let My AI Agent Run Overnight. It Cost $437.
#
agents
#
ai
#
devjournal
#
llm
1
 reaction
Comments
Add Comment
5 min read
TurboQuant on a MacBook Pro, part 2: perplexity, KL divergence, and asymmetric K/V on M5 Max
Christopher Maher
Christopher Maher
Christopher Maher
Follow
Apr 29
TurboQuant on a MacBook Pro, part 2: perplexity, KL divergence, and asymmetric K/V on M5 Max
#
ai
#
llm
#
kubernetes
#
opensource
Comments
Add Comment
8 min read
Why I'm Building a Local-First AI Coding Workspace (And How Behavioral Routing Makes It Work)
Eli Hadam Zucker
Eli Hadam Zucker
Eli Hadam Zucker
Follow
Apr 29
Why I'm Building a Local-First AI Coding Workspace (And How Behavioral Routing Makes It Work)
#
ai
#
rust
#
llm
#
webdev
Comments
Add Comment
6 min read
Prompt Caching Works. Your Prompt Assembly Code Does Not.
Parag Darade
Parag Darade
Parag Darade
Follow
Apr 29
Prompt Caching Works. Your Prompt Assembly Code Does Not.
#
ai
#
llm
#
rag
#
machinelearning
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account