Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I Tested 28 Query Pairs to See if Semantic Caches Actually Lie to Users. The Result Surprised Me
Cover image for I Tested 28 Query Pairs to See if Semantic Caches Actually Lie to Users. The Result Surprised Me

I Tested 28 Query Pairs to See if Semantic Caches Actually Lie to Users. The Result Surprised Me

7
Comments
11 min read
When Generic Benchmarks Fail: Building a Sales-Domain Evaluation Bench from Scratch

When Generic Benchmarks Fail: Building a Sales-Domain Evaluation Bench from Scratch

1
Comments
7 min read
Best AI Agent Frameworks for Building Production-Ready Agents

Best AI Agent Frameworks for Building Production-Ready Agents

Comments
14 min read
Turbocharging LLM Inference with Optimized Caching

Turbocharging LLM Inference with Optimized Caching

Comments
2 min read
From Prompt Engineering to Context Engineering: What Actually Changed (And What Didn't)
Cover image for From Prompt Engineering to Context Engineering: What Actually Changed (And What Didn't)

From Prompt Engineering to Context Engineering: What Actually Changed (And What Didn't)

Comments
5 min read
What AI Assistants Don't Know About Your .NET Stack

What AI Assistants Don't Know About Your .NET Stack

Comments
5 min read
80% of LLM 'Thinking' Is a Lie — What CoT Faithfulness Research Actually Shows

80% of LLM 'Thinking' Is a Lie — What CoT Faithfulness Research Actually Shows

Comments
7 min read
AI agent context still misses the product layer
Cover image for AI agent context still misses the product layer

AI agent context still misses the product layer

Comments
5 min read
DAG vs Langraph Nodes

DAG vs Langraph Nodes

Comments
2 min read
The gay jailbreak: probé la técnica viral sobre mis propios prompts de producción y esto encontré
Cover image for The gay jailbreak: probé la técnica viral sobre mis propios prompts de producción y esto encontré

The gay jailbreak: probé la técnica viral sobre mis propios prompts de producción y esto encontré

Comments
9 min read
Your AI Agent Has No Runtime Policy. That's the Actual Security Problem.
Cover image for Your AI Agent Has No Runtime Policy. That's the Actual Security Problem.

Your AI Agent Has No Runtime Policy. That's the Actual Security Problem.

2
Comments
4 min read
I spent months trying to stop LLM hallucinations. Prompt engineering wasn't enough. So I wrote a graph engine in Rust.
Cover image for I spent months trying to stop LLM hallucinations. Prompt engineering wasn't enough. So I wrote a graph engine in Rust.

I spent months trying to stop LLM hallucinations. Prompt engineering wasn't enough. So I wrote a graph engine in Rust.

2
Comments
5 min read
The Trust Layer Nobody Built: Why AI Agents Need Verification Before They Can Spend

The Trust Layer Nobody Built: Why AI Agents Need Verification Before They Can Spend

1
Comments
4 min read
I cut my LLM API costs by 71% — here's the open-source SDK I built

I cut my LLM API costs by 71% — here's the open-source SDK I built

Comments
1 min read
Why Qwen Won't Run on Your MacBook Air (and How to Fix It)
Cover image for Why Qwen Won't Run on Your MacBook Air (and How to Fix It)

Why Qwen Won't Run on Your MacBook Air (and How to Fix It)

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.