Skip to content

Forem

# llm

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Cover image for I built an LLM eval rig in a weekend. Most of it was wrong.

Apr 27

I built an LLM eval rig in a weekend. Most of it was wrong.

#ai #llm #testing #webdev

4 min read

Apr 27

How to Detect Prompt Injection in Your LLM Agent — Python, 5 Minutes

#agents #llm #python #security

5 min read

NEE

Apr 27

Deep Dive into Open Agent SDK (Part 6): Multi-LLM Providers and Runtime Controls

#ai #swift #llm #opensource

13 min read

Cover image for Harness Engineering with Nothing but Markdown

Kento IKEDA for AWS Community Builders

Apr 26

Harness Engineering with Nothing but Markdown

#ai #productivity #llm #automation

10 min read

Cover image for GPT-5 vs Claude Sonnet 4: real per-task cost and benchmark comparison for production workloads

Apr 27

GPT-5 vs Claude Sonnet 4: real per-task cost and benchmark comparison for production workloads

#openai #claude #llm #ai

7 min read

Vivek Raja

Apr 27

Skills for eval-driven agent optimization

#agents #ai #llm #testing

1 min read

Cover image for 62.2% on Aider Polyglot from a MacBook Pro. Then the other model we tried scored 4%. Here's what actually happened, with a working cost loop attached.

Christopher Maher

Apr 27

62.2% on Aider Polyglot from a MacBook Pro. Then the other model we tried scored 4%. Here's what actually happened, with a working cost loop attached.

#kubernetes #ai #llm #opensource

16 min read

Cover image for DeepSeek-V4 Changes the Context Game for Agents — And Your Memory Architecture Should Adapt

Apr 28

DeepSeek-V4 Changes the Context Game for Agents — And Your Memory Architecture Should Adapt

#ai #agents #llm #deepseek

3 min read

Cover image for What If You Compressed Your Prompts Into Chinese Emoji? (A Token-Saving Thought Experiment)

Mei Hammer

Apr 27

What If You Compressed Your Prompts Into Chinese Emoji? (A Token-Saving Thought Experiment)

#ai #llm #productivity #machinelearning

3 min read

Cover image for Your RAG Eval Set Is Probably Wrong. The Test That Catches It.

Apr 26

Your RAG Eval Set Is Probably Wrong. The Test That Catches It.

#ai #rag #llm #observability

7 min read

cited

Apr 27

GEO / AI Search Thread

#discuss #ai #llm #marketing

5 min read

Cover image for Hybrid Search Is the Phrase You'll Hear at Every RAG Talk in 2026

Apr 26

Hybrid Search Is the Phrase You'll Hear at Every RAG Talk in 2026

#ai #rag #llm #database

7 min read

Cover image for The JSON-Mode Prompt Pattern That Survives Claude Version Bumps

Apr 26

The JSON-Mode Prompt Pattern That Survives Claude Version Bumps

#ai #llm #prompt #tutorial

7 min read

Cover image for The 3 Alerts Every LLM Team Should Have Set Up by Tomorrow

Apr 26

The 3 Alerts Every LLM Team Should Have Set Up by Tomorrow

#llm #observability #devops #tutorial

7 min read

Cover image for Stop Caching the Whole LLM Response. Cache the Embedding.

Apr 26

Stop Caching the Whole LLM Response. Cache the Embedding.

#ai #rag #llm #observability

8 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.