Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
vLLM vs SGLang: Enterprise LLM Inference Comparison

vLLM vs SGLang: Enterprise LLM Inference Comparison

1
Comments
5 min read
MCP Made Me Rethink Who My Software Serves

MCP Made Me Rethink Who My Software Serves

Comments
7 min read
First Principles of AI Context
Cover image for First Principles of AI Context

First Principles of AI Context

2
Comments
7 min read
How We Achieved 30% Conversion Lift by Moving from GPT-4 to LoRA Adapters
Cover image for How We Achieved 30% Conversion Lift by Moving from GPT-4 to LoRA Adapters

How We Achieved 30% Conversion Lift by Moving from GPT-4 to LoRA Adapters

Comments
11 min read
I Spent 48 Hours Red-Teaming the "Magic AI Assistant" Everyone's Hyping. Here's What I Found.
Cover image for I Spent 48 Hours Red-Teaming the "Magic AI Assistant" Everyone's Hyping. Here's What I Found.

I Spent 48 Hours Red-Teaming the "Magic AI Assistant" Everyone's Hyping. Here's What I Found.

Comments
9 min read
Diseñando memoria narrativa trazable para agentes conversacionales

Diseñando memoria narrativa trazable para agentes conversacionales

2
Comments
15 min read
I Built a Multi-Agent LLM Orchestrator That Runs Claude, GPT, and Gemini in Parallel

I Built a Multi-Agent LLM Orchestrator That Runs Claude, GPT, and Gemini in Parallel

Comments 1
5 min read
The Monolith Is Dead: Why Multi-Agent Architecture Is the Most Critical AI Engineering Decision of 2026
Cover image for The Monolith Is Dead: Why Multi-Agent Architecture Is the Most Critical AI Engineering Decision of 2026

The Monolith Is Dead: Why Multi-Agent Architecture Is the Most Critical AI Engineering Decision of 2026

Comments
7 min read
I gave an LLM 248 tools and accuracy dropped to 12%. Here's what fixed it.
Cover image for I gave an LLM 248 tools and accuracy dropped to 12%. Here's what fixed it.

I gave an LLM 248 tools and accuracy dropped to 12%. Here's what fixed it.

4
Comments
3 min read
Local LLM Inference on Windows 11 and AMD GPU using WSL and llama.cpp

Local LLM Inference on Windows 11 and AMD GPU using WSL and llama.cpp

1
Comments
3 min read
Best Ways to Monitor Claude Code Token Usage and Costs in 2026

Best Ways to Monitor Claude Code Token Usage and Costs in 2026

Comments 1
6 min read
Why I Replaced LangChain with 15KB of httpx

Why I Replaced LangChain with 15KB of httpx

Comments
6 min read
I built a “deterministic” LLM text rephraser with a validation pipeline - looking for architectural feedback
Cover image for I built a “deterministic” LLM text rephraser with a validation pipeline - looking for architectural feedback

I built a “deterministic” LLM text rephraser with a validation pipeline - looking for architectural feedback

Comments
3 min read
One command to add structured markup to your AI agent
Cover image for One command to add structured markup to your AI agent

One command to add structured markup to your AI agent

23
Comments 2
4 min read
I built memory decay for AI agents using the Ebbinghaus forgetting curve

I built memory decay for AI agents using the Ebbinghaus forgetting curve

24
Comments 2
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.