Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Prompt Injection Is an Agent Problem, Not a Model Problem
Cover image for Prompt Injection Is an Agent Problem, Not a Model Problem

Prompt Injection Is an Agent Problem, Not a Model Problem

1
Comments
9 min read
Experience Working with OpenClaw (Clawbot)

Experience Working with OpenClaw (Clawbot)

1
Comments
3 min read
How Typed Conflict Resolution Beats Mem0 and MemGPT on the Hardest Memory Benchmark

How Typed Conflict Resolution Beats Mem0 and MemGPT on the Hardest Memory Benchmark

2
Comments
6 min read
NER: Gemini vs Spacy vs Compromise
Cover image for NER: Gemini vs Spacy vs Compromise

NER: Gemini vs Spacy vs Compromise

1
Comments
4 min read
How Developers Can Use AI for Smarter Google Search

How Developers Can Use AI for Smarter Google Search

Comments
3 min read
The 600x LLM Price Gap Is Your Biggest Optimization Opportunity

The 600x LLM Price Gap Is Your Biggest Optimization Opportunity

1
Comments
2 min read
I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

I Built a Fully Local Paper RAG on an RTX 4060 8GB — BGE-M3 + Qwen2.5-32B + ChromaDB

Comments
10 min read
Running Qwen2.5-32B on RTX 4060 8GB — Beating M4 at 10.8 t/s with llama.cpp

Running Qwen2.5-32B on RTX 4060 8GB — Beating M4 at 10.8 t/s with llama.cpp

1
Comments
7 min read
I built LLM Council: frontier models debating in an immersive 3D chamber

I built LLM Council: frontier models debating in an immersive 3D chamber

1
Comments
3 min read
Show HN: I built a private AI inference API in Australia — data sovereignty, Gemma3, live now
Cover image for Show HN: I built a private AI inference API in Australia — data sovereignty, Gemma3, live now

Show HN: I built a private AI inference API in Australia — data sovereignty, Gemma3, live now

Comments 1
1 min read
AI Gateway Caching Explained — Why L1 + L2 Cache Layers Cut 90% of Your LLM Bill
Cover image for AI Gateway Caching Explained — Why L1 + L2 Cache Layers Cut 90% of Your LLM Bill

AI Gateway Caching Explained — Why L1 + L2 Cache Layers Cut 90% of Your LLM Bill

5
Comments 1
6 min read
We built an AI that audits other AI agents (here's how A2A works in production)

We built an AI that audits other AI agents (here's how A2A works in production)

Comments
4 min read
What MCP Actually Is (And Why It Exists)
Cover image for What MCP Actually Is (And Why It Exists)

What MCP Actually Is (And Why It Exists)

2
Comments 3
4 min read
How LLMs Can Control Your Computer - Voice-Driven, Local, No API Keys

How LLMs Can Control Your Computer - Voice-Driven, Local, No API Keys

Comments
3 min read
Local LLMs vs Cloud APIs — A Real Cost Comparison (2026)

Local LLMs vs Cloud APIs — A Real Cost Comparison (2026)

1
Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.