Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Your Phone Now Has Its Own Agent Skills. Google Just Showed Us What That Means.

Your Phone Now Has Its Own Agent Skills. Google Just Showed Us What That Means.

Comments
2 min read
I Ran Google's New Gemma 4 Models Locally (26B and 31B) — Here's What I Found

I Ran Google's New Gemma 4 Models Locally (26B and 31B) — Here's What I Found

Comments
4 min read
I built an open-source memory layer for LLMs — here's how it works

I built an open-source memory layer for LLMs — here's how it works

Comments
4 min read
I needed to know if the cheaper model was good enough. So I built an LLM-as-a-Judge pipeline

I needed to know if the cheaper model was good enough. So I built an LLM-as-a-Judge pipeline

Comments
2 min read
Why your LLM product hallucinates the one thing it shouldn't, and the architectural pattern that fixes it
Cover image for Why your LLM product hallucinates the one thing it shouldn't, and the architectural pattern that fixes it

Why your LLM product hallucinates the one thing it shouldn't, and the architectural pattern that fixes it

Comments
4 min read
OpenAI’s $1M API Credits, Holos’ Agentic Web, and Xpertbench’s Expert Tasks

OpenAI’s $1M API Credits, Holos’ Agentic Web, and Xpertbench’s Expert Tasks

Comments
2 min read
I tested speculative decoding on my home GPU cluster. Here's why it didn't help.
Cover image for I tested speculative decoding on my home GPU cluster. Here's why it didn't help.

I tested speculative decoding on my home GPU cluster. Here's why it didn't help.

Comments
5 min read
Letting AI Control RAG Search Improved Accuracy by 79%

Letting AI Control RAG Search Improved Accuracy by 79%

Comments
6 min read
Why Some AI Feels “Process-Obsessed” While Others Just Ship Code
Cover image for Why Some AI Feels “Process-Obsessed” While Others Just Ship Code

Why Some AI Feels “Process-Obsessed” While Others Just Ship Code

Comments
1 min read
Cut AI Costs: Flutter On-Device LLM Integration Works

Cut AI Costs: Flutter On-Device LLM Integration Works

Comments
10 min read
Strategic LLM Adoption: A Director's Guide to Fine-Tuning Models for Domain-Specific Applications

Strategic LLM Adoption: A Director's Guide to Fine-Tuning Models for Domain-Specific Applications

1
Comments
3 min read
Jetson Containers Quickstart on NVIDIA Jetson AGX Orin 64GB
Cover image for Jetson Containers Quickstart on NVIDIA Jetson AGX Orin 64GB

Jetson Containers Quickstart on NVIDIA Jetson AGX Orin 64GB

Comments
8 min read
Two Kinds of AI Agents (And Why You Need Both)

Two Kinds of AI Agents (And Why You Need Both)

Comments
10 min read
How I Built Persistent Memory for AI Agents in Python

How I Built Persistent Memory for AI Agents in Python

Comments
2 min read
Why Your LLM App Fails in Production (and How to Debug It)
Cover image for Why Your LLM App Fails in Production (and How to Debug It)

Why Your LLM App Fails in Production (and How to Debug It)

4
Comments 1
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.