Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Ustaad: Building a Wiki That Thinks

Ustaad: Building a Wiki That Thinks

Comments
4 min read
Why your LLM agent fails at 3 AM (and how state machines fix it)
Cover image for Why your LLM agent fails at 3 AM (and how state machines fix it)

Why your LLM agent fails at 3 AM (and how state machines fix it)

Comments
2 min read
Chapter 7. Context Management and Token Optimization

Chapter 7. Context Management and Token Optimization

Comments
10 min read
RAG vs Fine-Tuning — I've Used Both in Production, Here's What Actually Matters

RAG vs Fine-Tuning — I've Used Both in Production, Here's What Actually Matters

Comments
3 min read
Kaelux: Engineering the Future of Intelligent Infrastructure
Cover image for Kaelux: Engineering the Future of Intelligent Infrastructure

Kaelux: Engineering the Future of Intelligent Infrastructure

Comments
3 min read
All Data and AI Weekly #236-06-April-2026
Cover image for All Data and AI Weekly #236-06-April-2026

All Data and AI Weekly #236-06-April-2026

6
Comments
13 min read
I Ran Google's New Gemma 4 Models Locally (26B and 31B) — Here's What I Found

I Ran Google's New Gemma 4 Models Locally (26B and 31B) — Here's What I Found

Comments
4 min read
Your Phone Now Has Its Own Agent Skills. Google Just Showed Us What That Means.

Your Phone Now Has Its Own Agent Skills. Google Just Showed Us What That Means.

Comments
2 min read
I built an open-source memory layer for LLMs — here's how it works

I built an open-source memory layer for LLMs — here's how it works

Comments
4 min read
I needed to know if the cheaper model was good enough. So I built an LLM-as-a-Judge pipeline

I needed to know if the cheaper model was good enough. So I built an LLM-as-a-Judge pipeline

Comments
2 min read
Why your LLM product hallucinates the one thing it shouldn't, and the architectural pattern that fixes it
Cover image for Why your LLM product hallucinates the one thing it shouldn't, and the architectural pattern that fixes it

Why your LLM product hallucinates the one thing it shouldn't, and the architectural pattern that fixes it

Comments
4 min read
OpenAI’s $1M API Credits, Holos’ Agentic Web, and Xpertbench’s Expert Tasks

OpenAI’s $1M API Credits, Holos’ Agentic Web, and Xpertbench’s Expert Tasks

Comments
2 min read
I tested speculative decoding on my home GPU cluster. Here's why it didn't help.
Cover image for I tested speculative decoding on my home GPU cluster. Here's why it didn't help.

I tested speculative decoding on my home GPU cluster. Here's why it didn't help.

Comments
5 min read
Letting AI Control RAG Search Improved Accuracy by 79%

Letting AI Control RAG Search Improved Accuracy by 79%

Comments
6 min read
Why Some AI Feels “Process-Obsessed” While Others Just Ship Code
Cover image for Why Some AI Feels “Process-Obsessed” While Others Just Ship Code

Why Some AI Feels “Process-Obsessed” While Others Just Ship Code

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.