Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
You Don't Know RAG. You Know Simple RAG.

You Don't Know RAG. You Know Simple RAG.

Comments
4 min read
Building a Browser-Based RAG System with WebGPU
Cover image for Building a Browser-Based RAG System with WebGPU

Building a Browser-Based RAG System with WebGPU

3
Comments
3 min read
# Medical RAG Architecture Overview #llmszoomcamp

# Medical RAG Architecture Overview #llmszoomcamp

Comments
5 min read
# Medical RAG Architecture Overview #llmszoomcamp

# Medical RAG Architecture Overview #llmszoomcamp

Comments
5 min read
Building Custom Evaluators for AI Applications: A Technical Guide to AI Quality Assessment

Building Custom Evaluators for AI Applications: A Technical Guide to AI Quality Assessment

Comments
19 min read
RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)
Cover image for RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

Comments
5 min read
# Data Ingestion & Vector Store #llmszoomcamp

# Data Ingestion & Vector Store #llmszoomcamp

Comments
2 min read
TOON Benchmarks: A Critical Analysis of Different Results
Cover image for TOON Benchmarks: A Critical Analysis of Different Results

TOON Benchmarks: A Critical Analysis of Different Results

2
Comments 1
7 min read
Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.
Cover image for Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Comments
5 min read
Train it or feed it? Teaching LLMs your data the smart way
Cover image for Train it or feed it? Teaching LLMs your data the smart way

Train it or feed it? Teaching LLMs your data the smart way

Comments
4 min read
Understanding RAG: How AI Models Learn to Search Before They Speak

Understanding RAG: How AI Models Learn to Search Before They Speak

1
Comments
3 min read
About context and LLM
Cover image for About context and LLM

About context and LLM

2
Comments
7 min read
The RAG Debugging Playbook: A Step-by-Step Guide to Trace-Level Failures and Fixes

The RAG Debugging Playbook: A Step-by-Step Guide to Trace-Level Failures and Fixes

Comments
10 min read
Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

Synthetic Data for RAG: Safe Generation, Deduplication, and Drift-Aware Curation in 2025

2
Comments
10 min read
Why Your AI Agents Keep Dropping the Ball—and How LangChain Plus PyTorch Can Salvage Your Solo Gig
Cover image for Why Your AI Agents Keep Dropping the Ball—and How LangChain Plus PyTorch Can Salvage Your Solo Gig

Why Your AI Agents Keep Dropping the Ball—and How LangChain Plus PyTorch Can Salvage Your Solo Gig

Comments
6 min read
Building a Simple Modern RAG Application with Asyncio and Chainlit

Building a Simple Modern RAG Application with Asyncio and Chainlit

2
Comments
4 min read
Tired of AI Hallucinations? I Built a RAG App to Keep My Research Grounded.

Tired of AI Hallucinations? I Built a RAG App to Keep My Research Grounded.

5
Comments 1
4 min read
Why Most RAG Pipelines Fail in Production (and How to Fix Them)
Cover image for Why Most RAG Pipelines Fail in Production (and How to Fix Them)

Why Most RAG Pipelines Fail in Production (and How to Fix Them)

3
Comments
2 min read
Set up RAG with Genkit and Firebase in 15 minutes

Set up RAG with Genkit and Firebase in 15 minutes

2
Comments
6 min read
# Comprehensive Monitoring & Observability #llmszoomcamp

# Comprehensive Monitoring & Observability #llmszoomcamp

Comments
8 min read
Official Native Java Support for Docling: Building Better Apps Just Got Easier

Official Native Java Support for Docling: Building Better Apps Just Got Easier

Comments
4 min read
Building Semantica: An AI-Powered Academic Search Platform with MindsDB
Cover image for Building Semantica: An AI-Powered Academic Search Platform with MindsDB

Building Semantica: An AI-Powered Academic Search Platform with MindsDB

10
Comments
9 min read
Amazon S3 Vectors: When Storage Learns to Think
Cover image for Amazon S3 Vectors: When Storage Learns to Think

Amazon S3 Vectors: When Storage Learns to Think

Comments
8 min read
From 70K to 2K Tokens: Optimizing SQL Generation with RAG Architecture

From 70K to 2K Tokens: Optimizing SQL Generation with RAG Architecture

15
Comments 1
4 min read
🚀 Deploying Cognee AI Starter App on AWS ECS Using Terraform
Cover image for 🚀 Deploying Cognee AI Starter App on AWS ECS Using Terraform

🚀 Deploying Cognee AI Starter App on AWS ECS Using Terraform

13
Comments 2
8 min read
loading...