Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Build a RAG System from Scratch: Create an AI That Answers Questions About Your Codebase

Build a RAG System from Scratch: Create an AI That Answers Questions About Your Codebase

Comments
5 min read
Building a Production-Ready AI Customer Service Agent with HazelJS
Cover image for Building a Production-Ready AI Customer Service Agent with HazelJS

Building a Production-Ready AI Customer Service Agent with HazelJS

1
Comments
7 min read
Docling Speaks LaTeX: Unlocking Academic and Scientific Documents

Docling Speaks LaTeX: Unlocking Academic and Scientific Documents

Comments
23 min read
Show DEV: PardusDB – The "SQLite of Vector DBs" written in Rust

Show DEV: PardusDB – The "SQLite of Vector DBs" written in Rust

5
Comments
1 min read
Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Comments
4 min read
Scaling RAG : Demo to Production Ready

Scaling RAG : Demo to Production Ready

Comments
2 min read
From Chatbot to Medical AI: How I Used RAG, FAISS & Mistral to Ground AI in Reality
Cover image for From Chatbot to Medical AI: How I Used RAG, FAISS & Mistral to Ground AI in Reality

From Chatbot to Medical AI: How I Used RAG, FAISS & Mistral to Ground AI in Reality

4
Comments
4 min read
Are We Over-Engineering LLM Stacks Too Early?
Cover image for Are We Over-Engineering LLM Stacks Too Early?

Are We Over-Engineering LLM Stacks Too Early?

Comments 1
2 min read
What It Actually Takes to Run a RAG System in Production
Cover image for What It Actually Takes to Run a RAG System in Production

What It Actually Takes to Run a RAG System in Production

Comments
2 min read
Build a multi-assistant workflow with Pinecone Assistant in n8n
Cover image for Build a multi-assistant workflow with Pinecone Assistant in n8n

Build a multi-assistant workflow with Pinecone Assistant in n8n

Comments
2 min read
Building a Production RAG Server with Ollama, Open WebUI and Chroma DB
Cover image for Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Comments
1 min read
Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

1
Comments
4 min read
From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

1
Comments
4 min read
Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Comments
4 min read
Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.