Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
🧠OrKA-Reasoning: How Workflow Execution Really Works
Cover image for 🧠OrKA-Reasoning: How Workflow Execution Really Works

🧠OrKA-Reasoning: How Workflow Execution Really Works

4
Comments 2
4 min read
Snowflake AI_EMBED Function - Your Gateway to Unified Multimodal Vector Search

Snowflake AI_EMBED Function - Your Gateway to Unified Multimodal Vector Search

1
Comments
5 min read
Code Generation with ‘Graph RAG’, AstraDB and gpt-oss

Code Generation with ‘Graph RAG’, AstraDB and gpt-oss

2
Comments 1
18 min read
Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search
Cover image for Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search

Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search

Comments
4 min read
Let's Build a Voice RAG System That Actually Works 🎉

Let's Build a Voice RAG System That Actually Works 🎉

3
Comments
20 min read
Sub-millisecond similarity search on IVF indexes with PDX
Cover image for Sub-millisecond similarity search on IVF indexes with PDX

Sub-millisecond similarity search on IVF indexes with PDX

Comments
6 min read
Building Production RAG in 2025: Lessons from 50+ Deployments
Cover image for Building Production RAG in 2025: Lessons from 50+ Deployments

Building Production RAG in 2025: Lessons from 50+ Deployments

1
Comments
2 min read
Beyond Search: How to Chat with Your Documents Using AstraDB Vector Database, Docling and Granite

Beyond Search: How to Chat with Your Documents Using AstraDB Vector Database, Docling and Granite

Comments
15 min read
RAG Chatbot - MoviesGPT
Cover image for RAG Chatbot - MoviesGPT

RAG Chatbot - MoviesGPT

Comments
2 min read
How Retrieval Algorithms Shape Better LLM Responses?
Cover image for How Retrieval Algorithms Shape Better LLM Responses?

How Retrieval Algorithms Shape Better LLM Responses?

2
Comments
3 min read
Can you take your AI's memory with you? 🚫
Cover image for Can you take your AI's memory with you? 🚫

Can you take your AI's memory with you? 🚫

1
Comments
1 min read
✨ A Small Update: RAG Pitfalls, Unexpected Endorsement, and That Feeling of Fighting Ghosts with FAISS

✨ A Small Update: RAG Pitfalls, Unexpected Endorsement, and That Feeling of Fighting Ghosts with FAISS

Comments
2 min read
🚀 Amazon S3 Adds Native Vector Search — A Game-Changer for GenAI Builders (Especially Students)
Cover image for 🚀 Amazon S3 Adds Native Vector Search — A Game-Changer for GenAI Builders (Especially Students)

🚀 Amazon S3 Adds Native Vector Search — A Game-Changer for GenAI Builders (Especially Students)

1
Comments
4 min read
🔍 What is Retrieval-Augmented Generation (RAG)?
Cover image for 🔍 What is Retrieval-Augmented Generation (RAG)?

🔍 What is Retrieval-Augmented Generation (RAG)?

Comments
2 min read
Teach Your Free AI Chatbot with Reports & Web Data (RAG Basics)
Cover image for Teach Your Free AI Chatbot with Reports & Web Data (RAG Basics)

Teach Your Free AI Chatbot with Reports & Web Data (RAG Basics)

9
Comments
10 min read
Accelerate Advanced RAG with Tensorlake
Cover image for Accelerate Advanced RAG with Tensorlake

Accelerate Advanced RAG with Tensorlake

7
Comments 1
20 min read
Binary Quantization: the 1-bit trick that turns terabytes of vectors into pocket-sized fingerprints
Cover image for Binary Quantization: the 1-bit trick that turns terabytes of vectors into pocket-sized fingerprints

Binary Quantization: the 1-bit trick that turns terabytes of vectors into pocket-sized fingerprints

Comments
7 min read
🚀 RAG in Go: From Zero to Answers in One Evening

🚀 RAG in Go: From Zero to Answers in One Evening

1
Comments
1 min read
Let's build a Free Chatbot with Streamlit and Gemini AI (Step-by-Step for Beginners)
Cover image for Let's build a Free Chatbot with Streamlit and Gemini AI (Step-by-Step for Beginners)

Let's build a Free Chatbot with Streamlit and Gemini AI (Step-by-Step for Beginners)

6
Comments
6 min read
halfvec: Half the Bits, Twice the speed?
Cover image for halfvec: Half the Bits, Twice the speed?

halfvec: Half the Bits, Twice the speed?

Comments
7 min read
🛡️ Paladin-mini: Open-Source Grounding Model That Actually Works in Production

🛡️ Paladin-mini: Open-Source Grounding Model That Actually Works in Production

Comments
5 min read
Probably Secure: A Look At The Security Concerns Of Deterministic Vs Probabilistic Systems
Cover image for Probably Secure: A Look At The Security Concerns Of Deterministic Vs Probabilistic Systems

Probably Secure: A Look At The Security Concerns Of Deterministic Vs Probabilistic Systems

1
Comments
6 min read
RAG with LLMs: The Complete Guide to Retrieval-Augmented Generation
Cover image for RAG with LLMs: The Complete Guide to Retrieval-Augmented Generation

RAG with LLMs: The Complete Guide to Retrieval-Augmented Generation

2
Comments
5 min read
Enhancing Domain-Specific Knowledge Graph Reasoning via Metapath-Based Large Model Prompt Learning

Enhancing Domain-Specific Knowledge Graph Reasoning via Metapath-Based Large Model Prompt Learning

Comments
1 min read
Use Cases for HNSW-SQLite Library
Cover image for Use Cases for HNSW-SQLite Library

Use Cases for HNSW-SQLite Library

1
Comments
2 min read
loading...