Forem

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Binary embedding: shrink vector storage by 95%
Cover image for Binary embedding: shrink vector storage by 95%

Binary embedding: shrink vector storage by 95%

6
Comments
4 min read
🚀 Exploring the Power of Visualization: From Dependency Graphs to Molecular Structures 🧬
Cover image for 🚀 Exploring the Power of Visualization: From Dependency Graphs to Molecular Structures 🧬

🚀 Exploring the Power of Visualization: From Dependency Graphs to Molecular Structures 🧬

1
Comments
1 min read
Build RAG 10X Faster
Cover image for Build RAG 10X Faster

Build RAG 10X Faster

1
Comments
3 min read
The Rise of AI Coding Assistants: How They’re Changing the Developer’s Workflow
Cover image for The Rise of AI Coding Assistants: How They’re Changing the Developer’s Workflow

The Rise of AI Coding Assistants: How They’re Changing the Developer’s Workflow

11
Comments 2
5 min read
How we used gpt-4o for image detection with 350 very similar, single image classes.

How we used gpt-4o for image detection with 350 very similar, single image classes.

2
Comments
9 min read
pg_auto_embeddings — text embeddings directly in Postgres, without extensions

pg_auto_embeddings — text embeddings directly in Postgres, without extensions

Comments
4 min read
NVIDIA CES 2025 Keynote: AI Revolution and the $3000 Personal Supercomputer
Cover image for NVIDIA CES 2025 Keynote: AI Revolution and the $3000 Personal Supercomputer

NVIDIA CES 2025 Keynote: AI Revolution and the $3000 Personal Supercomputer

Comments
3 min read
Relational Databases Holding You Back?
Cover image for Relational Databases Holding You Back?

Relational Databases Holding You Back?

5
Comments
1 min read
Inference with Fine-Tuned Models: Delivering the Message
Cover image for Inference with Fine-Tuned Models: Delivering the Message

Inference with Fine-Tuned Models: Delivering the Message

1
Comments
2 min read
Git clone - that repo is too big : HELP!

Git clone - that repo is too big : HELP!

Comments
2 min read
Building a Friends-Themed Chatbot: Exploring Amazon Bedrock for Dialogue Refinement
Cover image for Building a Friends-Themed Chatbot: Exploring Amazon Bedrock for Dialogue Refinement

Building a Friends-Themed Chatbot: Exploring Amazon Bedrock for Dialogue Refinement

3
Comments
10 min read
Building an AI Workflow to Generate Reddit Comments with KaibanJS

Building an AI Workflow to Generate Reddit Comments with KaibanJS

Comments
2 min read
How to run Ollama on Windows using WSL

How to run Ollama on Windows using WSL

6
Comments
3 min read
Rust and Generative AI: Creating High-Performance Applications
Cover image for Rust and Generative AI: Creating High-Performance Applications

Rust and Generative AI: Creating High-Performance Applications

3
Comments 1
4 min read
RAG - Designing the CLI interface
Cover image for RAG - Designing the CLI interface

RAG - Designing the CLI interface

4
Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.