Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I added a local eval loop to my personal AI assistant — here's what 800 scored interactions taught me
Cover image for I added a local eval loop to my personal AI assistant — here's what 800 scored interactions taught me

I added a local eval loop to my personal AI assistant — here's what 800 scored interactions taught me

Comments
1 min read
Indeed Data API: Extract Structured JSON in 2026
Cover image for Indeed Data API: Extract Structured JSON in 2026

Indeed Data API: Extract Structured JSON in 2026

Comments
8 min read
To Teach AI How to Remember, First Teach It How to Forget 2/2

To Teach AI How to Remember, First Teach It How to Forget 2/2

Comments
9 min read
38% of AI Answers Are Wrong — And It's Your Prompt's Fault

38% of AI Answers Are Wrong — And It's Your Prompt's Fault

Comments
3 min read
Anthropic Closes Claude Loophole for Agent Tools

Anthropic Closes Claude Loophole for Agent Tools

Comments
4 min read
Spec-Driven Development Based on DSPI: Design-Specify-Plan-Implement

Spec-Driven Development Based on DSPI: Design-Specify-Plan-Implement

1
Comments 1
9 min read
EVAL #010: The AI Coding Agent Wars — 10 Agents, 4 Architectures, 1 Winner (For Now)

EVAL #010: The AI Coding Agent Wars — 10 Agents, 4 Architectures, 1 Winner (For Now)

Comments
12 min read
Claude Haiku 4.5 Outperformed Sonnet 4.6 on PR Writing - Context Was the Difference
Cover image for Claude Haiku 4.5 Outperformed Sonnet 4.6 on PR Writing - Context Was the Difference

Claude Haiku 4.5 Outperformed Sonnet 4.6 on PR Writing - Context Was the Difference

Comments
5 min read
CARE Loop: A Human-Centered Framework for Local LLM Development

CARE Loop: A Human-Centered Framework for Local LLM Development

2
Comments
5 min read
Your AI agent does not need a bigger context window
Cover image for Your AI agent does not need a bigger context window

Your AI agent does not need a bigger context window

Comments
5 min read
Building a Voice-Controlled AI Agent

Building a Voice-Controlled AI Agent

Comments
1 min read
Compare harnesses not models: Blitzy vs GPT-5.4 on SWE-Bench Pro
Cover image for Compare harnesses not models: Blitzy vs GPT-5.4 on SWE-Bench Pro

Compare harnesses not models: Blitzy vs GPT-5.4 on SWE-Bench Pro

Comments
7 min read
Building a Voice-Controlled AI Agent using AssemblyAI and Groq

Building a Voice-Controlled AI Agent using AssemblyAI and Groq

Comments
1 min read
I Ran 163 Benchmarks Across 10 LLMs So You Don't Have To. Here's What I Found

I Ran 163 Benchmarks Across 10 LLMs So You Don't Have To. Here's What I Found

Comments
6 min read
What Actually Happens When Claude Says "Compacting Our Conversation"
Cover image for What Actually Happens When Claude Says "Compacting Our Conversation"

What Actually Happens When Claude Says "Compacting Our Conversation"

2
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.