Llm Page 138

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Ultra Dune

Mar 17

EVAL #006: LLM Evaluation Tools — RAGAS vs DeepEval vs Braintrust vs LangSmith vs Arize Phoenix

#llm #evaluation #ai #machinelearning

10 min read

matt-dean-git

Mar 17

LLM Cost Management: From Monitoring Dashboards to Real-Time Enforcement

#ai #api #security #llm

6 min read

Cover image for Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)

Juan Torchia

Apr 20

Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)

#produccion #anthropic #llm #agentesia

8 min read

Cover image for LLM Evals on Real Traffic — Not Just Test Suites

grepture

Mar 21

LLM Evals on Real Traffic — Not Just Test Suites

#ai #llm #observability #devops

4 min read

Cover image for I Gave AI Agents a Telecom Job Interview. Most Failed Without a Cheat Sheet

Ivo Brett

Mar 17

I Gave AI Agents a Telecom Job Interview. Most Failed Without a Cheat Sheet

#agents #ai #automation #llm

8 min read

Valeria Solovyova

Mar 17

Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

#llm #applesilicon #finetuning #mlx

19 min read

Anikalp Jaiswal

Mar 17

Agents, Smart Contracts, and a Unified ML Engine

#ai #technology #machinelearning #llm

2 min read

Cover image for An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them

joinwell52

Apr 20

An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them

#ai #agents #llm #aialignment

26 min read

Jay Guthrie

Mar 17

Stop Your AI Agent From Hallucinating Features

#agents #ai #llm #machinelearning

2 min read

Jay Guthrie

Mar 17

How to Give Your AI Agent Memory Between Conversations

#agents #ai #architecture #llm

2 min read

Jay Guthrie

Mar 17

The Complete Guide to AI Agent Observability in Production

#agents #ai #llm #monitoring

9 min read

gary-botlington

Mar 17

What we found when an AI audited an AI (real findings, no sanitising)

#ai #agents #llm #productivity

4 min read

Cover image for I Was Paying Anthropic to Read CSS Class Names

Aral Roca

Apr 17

I Was Paying Anthropic to Read CSS Class Names

#markdown #webdev #llm #ai

8 min read

Cophy Origin

Apr 20

When "Slow Thinking" Is Just "Slow Talking"

#ai #machinelearning #llm #evaluation

3 min read

Cover image for CPF: Compact Prompt Format — 30-50% Fewer Tokens, Zero Loss

victorstackAI

Mar 17

CPF: Compact Prompt Format — 30-50% Fewer Tokens, Zero Loss

#devlog #agents #ai #llm

7 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Forem

# llm

EVAL #006: LLM Evaluation Tools — RAGAS vs DeepEval vs Braintrust vs LangSmith vs Arize Phoenix

LLM Cost Management: From Monitoring Dashboards to Real-Time Enforcement

Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)

LLM Evals on Real Traffic — Not Just Test Suites

I Gave AI Agents a Telecom Job Interview. Most Failed Without a Cheat Sheet

Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

Agents, Smart Contracts, and a Unified ML Engine

An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them

Stop Your AI Agent From Hallucinating Features

How to Give Your AI Agent Memory Between Conversations

The Complete Guide to AI Agent Observability in Production

What we found when an AI audited an AI (real findings, no sanitising)

I Was Paying Anthropic to Read CSS Class Names

When "Slow Thinking" Is Just "Slow Talking"

CPF: Compact Prompt Format — 30-50% Fewer Tokens, Zero Loss