Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
My Local/Remote LLM Studio — watsonx.ai and Ollama (part 1)

My Local/Remote LLM Studio — watsonx.ai and Ollama (part 1)

6
Comments
18 min read
Fitera: AI-Powered Nutrition and Fitness Tracking Application
Cover image for Fitera: AI-Powered Nutrition and Fitness Tracking Application

Fitera: AI-Powered Nutrition and Fitness Tracking Application

1
Comments
3 min read
Building NeuroStash - III

Building NeuroStash - III

Comments
5 min read
Embeddings & Cosine Similarity Explained Simply

Embeddings & Cosine Similarity Explained Simply

1
Comments
10 min read
Retrieval Technique Series-6.A Discourse on Design in High-Performance Retrieval Systems

Retrieval Technique Series-6.A Discourse on Design in High-Performance Retrieval Systems

Comments
4 min read
The Hidden Failures in RAG Systems — And How WFGY Fixes Them

The Hidden Failures in RAG Systems — And How WFGY Fixes Them

1
Comments
3 min read
🧠 Build Your Own Document Q&A Assistant with GPT, Redis & Docker
Cover image for 🧠 Build Your Own Document Q&A Assistant with GPT, Redis & Docker

🧠 Build Your Own Document Q&A Assistant with GPT, Redis & Docker

Comments
3 min read
Building Neurostash - I

Building Neurostash - I

Comments
5 min read
Automation & Optimization of Grocery Shopping

Automation & Optimization of Grocery Shopping

Comments
1 min read
RAG to Riches: Transforming AI with Smarter Context

RAG to Riches: Transforming AI with Smarter Context

Comments 1
5 min read
What Is Vertex AI Agent Memory Bank ?
Cover image for What Is Vertex AI Agent Memory Bank ?

What Is Vertex AI Agent Memory Bank ?

8
Comments
4 min read
🚀 Building an AI Resume Screener with GPT-4 + LangChain + FAISS
Cover image for 🚀 Building an AI Resume Screener with GPT-4 + LangChain + FAISS

🚀 Building an AI Resume Screener with GPT-4 + LangChain + FAISS

Comments
2 min read
Snowflake AI_EMBED Function - Your Gateway to Unified Multimodal Vector Search

Snowflake AI_EMBED Function - Your Gateway to Unified Multimodal Vector Search

1
Comments
5 min read
Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search
Cover image for Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search

Benchmarking LLM Search APIs: Tavily vs Web Search Plus vs OpenAI Web Search

Comments
4 min read
Sub-millisecond similarity search on IVF indexes with PDX
Cover image for Sub-millisecond similarity search on IVF indexes with PDX

Sub-millisecond similarity search on IVF indexes with PDX

Comments
6 min read
Beyond Search: How to Chat with Your Documents Using AstraDB Vector Database, Docling and Granite

Beyond Search: How to Chat with Your Documents Using AstraDB Vector Database, Docling and Granite

Comments
15 min read
RAG Chatbot - MoviesGPT
Cover image for RAG Chatbot - MoviesGPT

RAG Chatbot - MoviesGPT

Comments
2 min read
✨ A Small Update: RAG Pitfalls, Unexpected Endorsement, and That Feeling of Fighting Ghosts with FAISS

✨ A Small Update: RAG Pitfalls, Unexpected Endorsement, and That Feeling of Fighting Ghosts with FAISS

Comments
2 min read
🚀 Amazon S3 Adds Native Vector Search — A Game-Changer for GenAI Builders (Especially Students)
Cover image for 🚀 Amazon S3 Adds Native Vector Search — A Game-Changer for GenAI Builders (Especially Students)

🚀 Amazon S3 Adds Native Vector Search — A Game-Changer for GenAI Builders (Especially Students)

1
Comments
4 min read
🔍 What is Retrieval-Augmented Generation (RAG)?
Cover image for 🔍 What is Retrieval-Augmented Generation (RAG)?

🔍 What is Retrieval-Augmented Generation (RAG)?

Comments
2 min read
Binary Quantization: the 1-bit trick that turns terabytes of vectors into pocket-sized fingerprints
Cover image for Binary Quantization: the 1-bit trick that turns terabytes of vectors into pocket-sized fingerprints

Binary Quantization: the 1-bit trick that turns terabytes of vectors into pocket-sized fingerprints

Comments
7 min read
halfvec: Half the Bits, Twice the speed?
Cover image for halfvec: Half the Bits, Twice the speed?

halfvec: Half the Bits, Twice the speed?

Comments
7 min read
🛡️ Paladin-mini: Open-Source Grounding Model That Actually Works in Production

🛡️ Paladin-mini: Open-Source Grounding Model That Actually Works in Production

Comments
5 min read
Graphs over Chains - My LangGraph Journey (part-1)
Cover image for Graphs over Chains - My LangGraph Journey (part-1)

Graphs over Chains - My LangGraph Journey (part-1)

1
Comments
3 min read
Use Cases for HNSW-SQLite Library
Cover image for Use Cases for HNSW-SQLite Library

Use Cases for HNSW-SQLite Library

1
Comments
2 min read
loading...