Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Claude's Model Context Protocol (MCP): The Standard for AI Interaction

Claude's Model Context Protocol (MCP): The Standard for AI Interaction

12
Comments
6 min read
How to Build a Retrieval Augmented Generation (RAG) Chatbot with LangChain, Milvus, Anthropic Claude 3 Sonnet, and mistral-embed

How to Build a Retrieval Augmented Generation (RAG) Chatbot with LangChain, Milvus, Anthropic Claude 3 Sonnet, and mistral-embed

1
Comments
8 min read
AI Agent Memory: A Comparative Analysis of LangGraph, CrewAI, and AutoGen

AI Agent Memory: A Comparative Analysis of LangGraph, CrewAI, and AutoGen

12
Comments 3
6 min read
Why Most AI Teams Are Stuck 🤔

Why Most AI Teams Are Stuck 🤔

39
Comments 4
2 min read
Building a simplified Retrieval Augmented Generation System with Supabase Storage and OpenAI Embeddings in Next.js

Building a simplified Retrieval Augmented Generation System with Supabase Storage and OpenAI Embeddings in Next.js

2
Comments
11 min read
The Rise of LLMs: Are We Dumbing Ourselves Down?

The Rise of LLMs: Are We Dumbing Ourselves Down?

3
Comments 1
2 min read
The Hidden Magic Behind Search: Dense, Sparse, and Metadata Filtering
Cover image for The Hidden Magic Behind Search: Dense, Sparse, and Metadata Filtering

The Hidden Magic Behind Search: Dense, Sparse, and Metadata Filtering

Comments
3 min read
How to Create RAG using DeepSeek R1, Ollama & Semantic Kernel .NET
Cover image for How to Create RAG using DeepSeek R1, Ollama & Semantic Kernel .NET

How to Create RAG using DeepSeek R1, Ollama & Semantic Kernel .NET

7
Comments 2
4 min read
Are LLMs Really Doomed?
Cover image for Are LLMs Really Doomed?

Are LLMs Really Doomed?

26
Comments
3 min read
Weather App With State Management for Long Running Conversations Using AI Agents
Cover image for Weather App With State Management for Long Running Conversations Using AI Agents

Weather App With State Management for Long Running Conversations Using AI Agents

2
Comments
2 min read
Best Practices for Production-Scale RAG Systems — An Implementation Guide
Cover image for Best Practices for Production-Scale RAG Systems — An Implementation Guide

Best Practices for Production-Scale RAG Systems — An Implementation Guide

1
Comments
12 min read
RAG Chatbot Tutorial: LangChain, Milvus, GPT-4o mini, and text-embedding-3-large

RAG Chatbot Tutorial: LangChain, Milvus, GPT-4o mini, and text-embedding-3-large

7
Comments
4 min read
NoLiMA: GPT-4o achieve 99.3% accuracy in short contexts (<1K tokens), performance degrades to 69.7% at 32K tokens.
Cover image for NoLiMA: GPT-4o achieve 99.3% accuracy in short contexts (<1K tokens), performance degrades to 69.7% at 32K tokens.

NoLiMA: GPT-4o achieve 99.3% accuracy in short contexts (<1K tokens), performance degrades to 69.7% at 32K tokens.

6
Comments 1
1 min read
Building Local AI Agents: A Practical Guide to Frameworks and Deployment
Cover image for Building Local AI Agents: A Practical Guide to Frameworks and Deployment

Building Local AI Agents: A Practical Guide to Frameworks and Deployment

3
Comments
6 min read
Building a RAG-Powered Support Chatbot in 24 Hours of Hackathon
Cover image for Building a RAG-Powered Support Chatbot in 24 Hours of Hackathon

Building a RAG-Powered Support Chatbot in 24 Hours of Hackathon

28
Comments 12
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.