Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
EVAL #006: LLM Evaluation Tools — RAGAS vs DeepEval vs Braintrust vs LangSmith vs Arize Phoenix

EVAL #006: LLM Evaluation Tools — RAGAS vs DeepEval vs Braintrust vs LangSmith vs Arize Phoenix

Comments
10 min read
LLM Cost Management: From Monitoring Dashboards to Real-Time Enforcement

LLM Cost Management: From Monitoring Dashboards to Real-Time Enforcement

Comments
6 min read
Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)
Cover image for Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)

Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)

Comments
8 min read
LLM Evals on Real Traffic — Not Just Test Suites
Cover image for LLM Evals on Real Traffic — Not Just Test Suites

LLM Evals on Real Traffic — Not Just Test Suites

1
Comments 1
4 min read
I Gave AI Agents a Telecom Job Interview. Most Failed Without a Cheat Sheet
Cover image for I Gave AI Agents a Telecom Job Interview. Most Failed Without a Cheat Sheet

I Gave AI Agents a Telecom Job Interview. Most Failed Without a Cheat Sheet

Comments
8 min read
Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

Comments
19 min read
Agents, Smart Contracts, and a Unified ML Engine

Agents, Smart Contracts, and a Unified ML Engine

Comments
2 min read
An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them
Cover image for An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them

An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them

Comments
26 min read
Stop Your AI Agent From Hallucinating Features

Stop Your AI Agent From Hallucinating Features

Comments
2 min read
How to Give Your AI Agent Memory Between Conversations

How to Give Your AI Agent Memory Between Conversations

Comments
2 min read
The Complete Guide to AI Agent Observability in Production

The Complete Guide to AI Agent Observability in Production

Comments
9 min read
What we found when an AI audited an AI (real findings, no sanitising)

What we found when an AI audited an AI (real findings, no sanitising)

Comments
4 min read
I Was Paying Anthropic to Read CSS Class Names
Cover image for I Was Paying Anthropic to Read CSS Class Names

I Was Paying Anthropic to Read CSS Class Names

13
Comments 2
8 min read
When "Slow Thinking" Is Just "Slow Talking"

When "Slow Thinking" Is Just "Slow Talking"

Comments
3 min read
CPF: Compact Prompt Format — 30-50% Fewer Tokens, Zero Loss
Cover image for CPF: Compact Prompt Format — 30-50% Fewer Tokens, Zero Loss

CPF: Compact Prompt Format — 30-50% Fewer Tokens, Zero Loss

Comments
7 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.