Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Tokenizer Quirks: Claude, GPT, and Gemini Don't Count the Same Text the Same Way
Cover image for Tokenizer Quirks: Claude, GPT, and Gemini Don't Count the Same Text the Same Way

Tokenizer Quirks: Claude, GPT, and Gemini Don't Count the Same Text the Same Way

Comments
6 min read
Cosine Similarity Lies. Here's What to Use When Your Embeddings All Cluster at 0.85
Cover image for Cosine Similarity Lies. Here's What to Use When Your Embeddings All Cluster at 0.85

Cosine Similarity Lies. Here's What to Use When Your Embeddings All Cluster at 0.85

Comments
7 min read
The Hidden Tax of Structured Output: How Much Extra You Pay for JSON Mode
Cover image for The Hidden Tax of Structured Output: How Much Extra You Pay for JSON Mode

The Hidden Tax of Structured Output: How Much Extra You Pay for JSON Mode

Comments
7 min read
When 'Take a Deep Breath' Stopped Working: Prompt Tricks With an Expiry Date
Cover image for When 'Take a Deep Breath' Stopped Working: Prompt Tricks With an Expiry Date

When 'Take a Deep Breath' Stopped Working: Prompt Tricks With an Expiry Date

Comments
7 min read
Build vs. Buy vs. Prompt Is the Wrong AI Question
Cover image for Build vs. Buy vs. Prompt Is the Wrong AI Question

Build vs. Buy vs. Prompt Is the Wrong AI Question

Comments
8 min read
Alibaba's Qwen3.6-Max-Preview Challenges GPT-5.4 on Agentic Coding

Alibaba's Qwen3.6-Max-Preview Challenges GPT-5.4 on Agentic Coding

Comments
7 min read
Cut Claude Code Token Usage by Delegating to Cheaper Models with Boss Mode

Cut Claude Code Token Usage by Delegating to Cheaper Models with Boss Mode

Comments
4 min read
Why ChatGPT will silently lie about your bank statement (and how to catch it)

Why ChatGPT will silently lie about your bank statement (and how to catch it)

Comments
4 min read
MCP in Production Reality vs the Spec

MCP in Production Reality vs the Spec

Comments
3 min read
Who Owns the Code Claude Wrote? The Legal Mess No One's Talking About
Cover image for Who Owns the Code Claude Wrote? The Legal Mess No One's Talking About

Who Owns the Code Claude Wrote? The Legal Mess No One's Talking About

Comments
3 min read
Building a Multi-Agent Travel Planner: From a One-Sentence Prompt to a Validated, Budget-Aware Itinerary
Cover image for Building a Multi-Agent Travel Planner: From a One-Sentence Prompt to a Validated, Budget-Aware Itinerary

Building a Multi-Agent Travel Planner: From a One-Sentence Prompt to a Validated, Budget-Aware Itinerary

1
Comments
8 min read
How Much VRAM Do You *Actually* Need for Local LLMs?
Cover image for How Much VRAM Do You *Actually* Need for Local LLMs?

How Much VRAM Do You *Actually* Need for Local LLMs?

Comments
2 min read
Building Reliable AI Systems: Why Prompting Isn’t Enough
Cover image for Building Reliable AI Systems: Why Prompting Isn’t Enough

Building Reliable AI Systems: Why Prompting Isn’t Enough

Comments
3 min read
GPT-5.5: OpenAI’s Smartest Model Yet — But Is the Hype Bigger Than the Model?
Cover image for GPT-5.5: OpenAI’s Smartest Model Yet — But Is the Hype Bigger Than the Model?

GPT-5.5: OpenAI’s Smartest Model Yet — But Is the Hype Bigger Than the Model?

Comments
5 min read
AI Testing Is Not Magic — You Already Know How to Do It
Cover image for AI Testing Is Not Magic — You Already Know How to Do It

AI Testing Is Not Magic — You Already Know How to Do It

2
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.