Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Release my PR for the project Bifrost

Release my PR for the project Bifrost

Comments
2 min read
LiteLLM Broke at 300 RPS in Production. Here's How We Fixed It
Cover image for LiteLLM Broke at 300 RPS in Production. Here's How We Fixed It

LiteLLM Broke at 300 RPS in Production. Here's How We Fixed It

5
Comments
4 min read
RAG is more than Vector Search

RAG is more than Vector Search

1
Comments
4 min read
Code Generation for Ablation Technique — Documentation

Code Generation for Ablation Technique — Documentation

Comments
3 min read
How Bifrost Integrates With Your Existing LLM Stack (No Refactoring Required)

How Bifrost Integrates With Your Existing LLM Stack (No Refactoring Required)

Comments
4 min read
Semantic Caching Cut Our LLM Costs by 40%

Semantic Caching Cut Our LLM Costs by 40%

Comments
3 min read
Uncounted Tokens: The Game of Attack and Defense in AI Gateway Rate Limiting

Uncounted Tokens: The Game of Attack and Defense in AI Gateway Rate Limiting

Comments
3 min read
The Observability Tax: What You're Actually Paying for AI Agents (2026 Cost Reality)

The Observability Tax: What You're Actually Paying for AI Agents (2026 Cost Reality)

Comments
2 min read
Building Your First Agentic AI: Complete Guide to MCP + Ollama Tool Calling

Building Your First Agentic AI: Complete Guide to MCP + Ollama Tool Calling

1
Comments
14 min read
The 6 Best AI Code Review Tools for Pull Requests in 2025
Cover image for The 6 Best AI Code Review Tools for Pull Requests in 2025

The 6 Best AI Code Review Tools for Pull Requests in 2025

Comments
11 min read
Why Your API's Error Messages Fail When Called by an LLM (And How to Fix Them)

Why Your API's Error Messages Fail When Called by an LLM (And How to Fix Them)

Comments
9 min read
Create Your First MCP App
Cover image for Create Your First MCP App

Create Your First MCP App

2
Comments
6 min read
Stop Evaluating AI Agents Like ML Models: A Paradigm Shift for Developers
Cover image for Stop Evaluating AI Agents Like ML Models: A Paradigm Shift for Developers

Stop Evaluating AI Agents Like ML Models: A Paradigm Shift for Developers

Comments
3 min read
Progressing in Bifrost project

Progressing in Bifrost project

Comments
2 min read
Base LLMs vs Instruction-Tuned LLMs: Understanding the Architecture Behind ChatGPT and Claude

Base LLMs vs Instruction-Tuned LLMs: Understanding the Architecture Behind ChatGPT and Claude

Comments
3 min read
Reranking and Two-Stage Retrieval: Precision When It Matters Most

Reranking and Two-Stage Retrieval: Precision When It Matters Most

Comments
2 min read
Build Better RAG Pipelines: Scraping Technical Docs to Clean Markdown

Build Better RAG Pipelines: Scraping Technical Docs to Clean Markdown

Comments
2 min read
Bifrost: The Fastest Open Source LLM Gateway

Bifrost: The Fastest Open Source LLM Gateway

Comments
4 min read
Dec 12, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab
Cover image for Dec 12, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Dec 12, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Comments
5 min read
TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

Comments
3 min read
Dense vs Sparse Retrieval: Mastering FAISS, BM25, and Hybrid Search

Dense vs Sparse Retrieval: Mastering FAISS, BM25, and Hybrid Search

Comments
15 min read
Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

3
Comments 2
2 min read
Prompt Length vs. Context Window: The Real Limits Behind LLM Performance
Cover image for Prompt Length vs. Context Window: The Real Limits Behind LLM Performance

Prompt Length vs. Context Window: The Real Limits Behind LLM Performance

Comments
4 min read
Beyond the Black Box: Neuro‑Symbolic AI, Metacognition, and the Next Leap in Machine Intelligence
Cover image for Beyond the Black Box: Neuro‑Symbolic AI, Metacognition, and the Next Leap in Machine Intelligence

Beyond the Black Box: Neuro‑Symbolic AI, Metacognition, and the Next Leap in Machine Intelligence

Comments
12 min read
Prompt‑Powered User Personas: From Messy Logs to Living Profiles
Cover image for Prompt‑Powered User Personas: From Messy Logs to Living Profiles

Prompt‑Powered User Personas: From Messy Logs to Living Profiles

Comments
14 min read
loading...