Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Python] Build a Smart Document Assistant LINE Bot with Python + Gemini File Search: Let AI Help You Read Documents

Python] Build a Smart Document Assistant LINE Bot with Python + Gemini File Search: Let AI Help You Read Documents

7
Comments
9 min read
LLMs Hallucinate. RAG Fixes That — Here’s How We Built a Reliable Healthcare AI
Cover image for LLMs Hallucinate. RAG Fixes That — Here’s How We Built a Reliable Healthcare AI

LLMs Hallucinate. RAG Fixes That — Here’s How We Built a Reliable Healthcare AI

Comments
3 min read
Multi-Tenant Design for Bedrock Knowledge Base: Solving the Account Limit with Metadata Filtering
Cover image for Multi-Tenant Design for Bedrock Knowledge Base: Solving the Account Limit with Metadata Filtering

Multi-Tenant Design for Bedrock Knowledge Base: Solving the Account Limit with Metadata Filtering

2
Comments
3 min read
I Built a TUI to Visualize RAG Chunking because chunk_size=1000 is a Lie 📉
Cover image for I Built a TUI to Visualize RAG Chunking because chunk_size=1000 is a Lie 📉

I Built a TUI to Visualize RAG Chunking because chunk_size=1000 is a Lie 📉

Comments
3 min read
Introducing Agentic Chart Extraction
Cover image for Introducing Agentic Chart Extraction

Introducing Agentic Chart Extraction

141
Comments 3
6 min read
The Client Who Wanted AI to "Remember Everything" (And Why That Was a Terrible Idea)
Cover image for The Client Who Wanted AI to "Remember Everything" (And Why That Was a Terrible Idea)

The Client Who Wanted AI to "Remember Everything" (And Why That Was a Terrible Idea)

Comments
5 min read
Building an Intelligent Legal Contract Auditor with Python

Building an Intelligent Legal Contract Auditor with Python

Comments
5 min read
CLaRa: Fixing RAG’s Broken Retrieval–Generation Pipeline With Shared-Space Learning
Cover image for CLaRa: Fixing RAG’s Broken Retrieval–Generation Pipeline With Shared-Space Learning

CLaRa: Fixing RAG’s Broken Retrieval–Generation Pipeline With Shared-Space Learning

Comments
3 min read
A RAG-Free Technique That Makes LLM Outputs Stable, Predictable, and Auditable

A RAG-Free Technique That Makes LLM Outputs Stable, Predictable, and Auditable

Comments
2 min read
Course: Large Language Models and Generative AI for NLP — 2025

Course: Large Language Models and Generative AI for NLP — 2025

10
Comments 1
1 min read
Inside Memcortex: A Lightweight Semantic Memory Layer for LLMs

Inside Memcortex: A Lightweight Semantic Memory Layer for LLMs

1
Comments 1
4 min read
Vector Database (OpenAI and Supabase )-Part 2 (Setup)

Vector Database (OpenAI and Supabase )-Part 2 (Setup)

11
Comments 1
6 min read
Building a Hybrid-Private RAG Platform on AWS: From Prototype to Production with Python
Cover image for Building a Hybrid-Private RAG Platform on AWS: From Prototype to Production with Python

Building a Hybrid-Private RAG Platform on AWS: From Prototype to Production with Python

Comments
7 min read
JVector — An Introduction-What is JVector? (Part 1)

JVector — An Introduction-What is JVector? (Part 1)

Comments 1
3 min read
Local RAG vs Cloud RAG: What Changes When You Leave the Demo
Cover image for Local RAG vs Cloud RAG: What Changes When You Leave the Demo

Local RAG vs Cloud RAG: What Changes When You Leave the Demo

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.