Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Building an Enterprise RAG System for Non-English Documents: A Turkish Case Study

Building an Enterprise RAG System for Non-English Documents: A Turkish Case Study

Comments
4 min read
5 Critical Failures We Hit Shipping a Multi-Tenant RAG Chatbot to 500+ Enterprises

5 Critical Failures We Hit Shipping a Multi-Tenant RAG Chatbot to 500+ Enterprises

5
Comments
5 min read
Why Naive Similarity Search Will Destroy Your RAG Agent (And What To Do Instead)

Why Naive Similarity Search Will Destroy Your RAG Agent (And What To Do Instead)

Comments
4 min read
Building an Enterprise RAG System for Non-English Documents

Building an Enterprise RAG System for Non-English Documents

1
Comments
5 min read
Why RAG Is Failing at Complex Questions (And How Knowledge Graphs Fix It)
Cover image for Why RAG Is Failing at Complex Questions (And How Knowledge Graphs Fix It)

Why RAG Is Failing at Complex Questions (And How Knowledge Graphs Fix It)

Comments
6 min read
RAG Is Not Dead: Advanced Retrieval Patterns That Actually Work in 2026

RAG Is Not Dead: Advanced Retrieval Patterns That Actually Work in 2026

Comments
6 min read
Our Group project was chaos until this agent

Our Group project was chaos until this agent

Comments 1
6 min read
Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)
Cover image for Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Why Most RAG Systems Fail in Production (And How to Design One That Actually Works)

Comments 1
4 min read
Bringing The Receipts - 95% AI LLM Token Savings
Cover image for Bringing The Receipts - 95% AI LLM Token Savings

Bringing The Receipts - 95% AI LLM Token Savings

1
Comments
10 min read
Building a Perplexity Clone for Local LLMs in 50 Lines of Python

Building a Perplexity Clone for Local LLMs in 50 Lines of Python

1
Comments 1
6 min read
Scaling LLMs at the Edge: A journey through distillation, routers, and embeddings
Cover image for Scaling LLMs at the Edge: A journey through distillation, routers, and embeddings

Scaling LLMs at the Edge: A journey through distillation, routers, and embeddings

1
Comments
20 min read
Self-Improving RAG: Teaching Claude Code to Learn From Errors
Cover image for Self-Improving RAG: Teaching Claude Code to Learn From Errors

Self-Improving RAG: Teaching Claude Code to Learn From Errors

Comments
6 min read
RAG + FastAPI in Action: Creating a Smart Business Analytics Dashboard in Python

RAG + FastAPI in Action: Creating a Smart Business Analytics Dashboard in Python

Comments
9 min read
I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

Comments
10 min read
What MCP Actually Is (And Why It Exists)
Cover image for What MCP Actually Is (And Why It Exists)

What MCP Actually Is (And Why It Exists)

2
Comments 3
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.