Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I Prompted 5 Frontier LLMs to “Report Uncertainty” Here’s What Happened to Their Statistical Validity Scores

I Prompted 5 Frontier LLMs to “Report Uncertainty” Here’s What Happened to Their Statistical Validity Scores

Comments
2 min read
How to Run a 35B Parameter Model on Your Laptop Without Melting It
Cover image for How to Run a 35B Parameter Model on Your Laptop Without Melting It

How to Run a 35B Parameter Model on Your Laptop Without Melting It

Comments
5 min read
RAG: How AI Models Use Your Data Without Forgetting
Cover image for RAG: How AI Models Use Your Data Without Forgetting

RAG: How AI Models Use Your Data Without Forgetting

4
Comments 2
14 min read
We built traceAI, an open-source tool for tracing LLM calls in production

We built traceAI, an open-source tool for tracing LLM calls in production

Comments
1 min read
How to Audit Your Site's AI Search Visibility in 30 Minutes (with a Free CLI)

How to Audit Your Site's AI Search Visibility in 30 Minutes (with a Free CLI)

Comments
6 min read
Claude's default teaching shape has no return: the 5-node loop that fixes it

Claude's default teaching shape has no return: the 5-node loop that fixes it

1
Comments
6 min read
How to choose the right AIOps platform

How to choose the right AIOps platform

Comments
4 min read
Qwen3.6 GGUF Benchmarks, Ternary Bonsai 1.58-bit Models, & Ollama Code Explainer Tool

Qwen3.6 GGUF Benchmarks, Ternary Bonsai 1.58-bit Models, & Ollama Code Explainer Tool

Comments
3 min read
How to Run LLMs Locally When Cloud AI Gets Too Invasive
Cover image for How to Run LLMs Locally When Cloud AI Gets Too Invasive

How to Run LLMs Locally When Cloud AI Gets Too Invasive

Comments
5 min read
Most document AI questions aren't retrieval problems

Most document AI questions aren't retrieval problems

4
Comments
4 min read
How I got 80% code retrieval accuracy without vectors, embeddings, or any ML

How I got 80% code retrieval accuracy without vectors, embeddings, or any ML

Comments
2 min read
Agentic AI's Infrastructure Boom Meets Its Reliability Problem

Agentic AI's Infrastructure Boom Meets Its Reliability Problem

Comments
3 min read
Why I built ragwise: pip-installable RAG with hybrid search, streaming, and agent tools by default

Why I built ragwise: pip-installable RAG with hybrid search, streaming, and agent tools by default

Comments
4 min read
Stop Paying for the Same Answer Twice: A Deep Dive into llm-cache
Cover image for Stop Paying for the Same Answer Twice: A Deep Dive into llm-cache

Stop Paying for the Same Answer Twice: A Deep Dive into llm-cache

3
Comments
9 min read
Running LLM Classification After the Response: Next.js after() + OpenRouter at $0.0002 per Call
Cover image for Running LLM Classification After the Response: Next.js after() + OpenRouter at $0.0002 per Call

Running LLM Classification After the Response: Next.js after() + OpenRouter at $0.0002 per Call

5
Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.