Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Stop Caching the Whole LLM Response. Cache the Embedding.
Cover image for Stop Caching the Whole LLM Response. Cache the Embedding.

Stop Caching the Whole LLM Response. Cache the Embedding.

Comments
8 min read
GEO / AI Search Thread

GEO / AI Search Thread

Comments
5 min read
Why Apple Is Pivoting Hard Toward On-Device AI in 2026
Cover image for Why Apple Is Pivoting Hard Toward On-Device AI in 2026

Why Apple Is Pivoting Hard Toward On-Device AI in 2026

Comments
7 min read
Vibe Coding Just Failed Its First Real Audit
Cover image for Vibe Coding Just Failed Its First Real Audit

Vibe Coding Just Failed Its First Real Audit

Comments
8 min read
Hugging Face Pulled Dozens of Backdoored Models. Here's the Pattern.
Cover image for Hugging Face Pulled Dozens of Backdoored Models. Here's the Pattern.

Hugging Face Pulled Dozens of Backdoored Models. Here's the Pattern.

Comments
7 min read
The 100-Line LLM Cache That Pays For Itself in a Week
Cover image for The 100-Line LLM Cache That Pays For Itself in a Week

The 100-Line LLM Cache That Pays For Itself in a Week

Comments
8 min read
Anthropic Skills Is Quietly Killing the Prompt Management Category
Cover image for Anthropic Skills Is Quietly Killing the Prompt Management Category

Anthropic Skills Is Quietly Killing the Prompt Management Category

Comments
7 min read
The Single Unit Test Every LLM Prompt Should Have
Cover image for The Single Unit Test Every LLM Prompt Should Have

The Single Unit Test Every LLM Prompt Should Have

Comments
7 min read
The 6-Line Postgres Migration That Halved a Team's LLM Bill
Cover image for The 6-Line Postgres Migration That Halved a Team's LLM Bill

The 6-Line Postgres Migration That Halved a Team's LLM Bill

Comments
7 min read
OpenAI Outage Postmortem: What Status Pages Don't Tell You
Cover image for OpenAI Outage Postmortem: What Status Pages Don't Tell You

OpenAI Outage Postmortem: What Status Pages Don't Tell You

Comments
7 min read
The 2-Line Defense That Stops 90% of Real-World Prompt Injection
Cover image for The 2-Line Defense That Stops 90% of Real-World Prompt Injection

The 2-Line Defense That Stops 90% of Real-World Prompt Injection

Comments
7 min read
I built an MCP server for a knowledge graph. It doesn't call any LLM.
Cover image for I built an MCP server for a knowledge graph. It doesn't call any LLM.

I built an MCP server for a knowledge graph. It doesn't call any LLM.

1
Comments 2
2 min read
AI Agent Persona Gone Rogue: 140 Direct Edits and the Foreman Pattern

AI Agent Persona Gone Rogue: 140 Direct Edits and the Foreman Pattern

Comments
3 min read
Why I Stopped Counting Tokens: Building a Zero‑Token Cognition Engine
Cover image for Why I Stopped Counting Tokens: Building a Zero‑Token Cognition Engine

Why I Stopped Counting Tokens: Building a Zero‑Token Cognition Engine

Comments
1 min read
Prompt Injection in AI Coding Agents: 3 Attack Vectors, 4 Defenses

Prompt Injection in AI Coding Agents: 3 Attack Vectors, 4 Defenses

Comments
12 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.