Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
LLM Foundry on a tiny model: the stack still does the heavy lifting

LLM Foundry on a tiny model: the stack still does the heavy lifting

Comments
1 min read
You Vibe-Coded Your SaaS Landing Page — Google Can't See It

You Vibe-Coded Your SaaS Landing Page — Google Can't See It

Comments
2 min read
llama.cpp MTP Beta, Gemma GGUF Fixes, & Sentinel Local-First AI Coding App

llama.cpp MTP Beta, Gemma GGUF Fixes, & Sentinel Local-First AI Coding App

Comments
3 min read
Vision Models for OCR: When They Beat Tesseract and When They Don't
Cover image for Vision Models for OCR: When They Beat Tesseract and When They Don't

Vision Models for OCR: When They Beat Tesseract and When They Don't

Comments
7 min read
AI Agents Have Two Souls. You Only Control One
Cover image for AI Agents Have Two Souls. You Only Control One

AI Agents Have Two Souls. You Only Control One

1
Comments
10 min read
The LLM-shaped hole in your XGBoost pipeline

The LLM-shaped hole in your XGBoost pipeline

Comments
1 min read
AI Agent Context Window Cost: The Compounding Math Your Architecture Is Hiding
Cover image for AI Agent Context Window Cost: The Compounding Math Your Architecture Is Hiding

AI Agent Context Window Cost: The Compounding Math Your Architecture Is Hiding

Comments
7 min read
How I cut my multi-turn LLM API costs by 90% (O(N ) O(N))

How I cut my multi-turn LLM API costs by 90% (O(N ) O(N))

Comments
2 min read
Why Pairing Your Bootstrap Is Necessary — And When It Stops Helping

Why Pairing Your Bootstrap Is Necessary — And When It Stops Helping

1
Comments 1
5 min read
What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes
Cover image for What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

Comments
10 min read
Building AI-Powered Apps for Free in 2026 — The Complete Guide
Cover image for Building AI-Powered Apps for Free in 2026 — The Complete Guide

Building AI-Powered Apps for Free in 2026 — The Complete Guide

Comments
2 min read
What we shipped -- 2026-05-05

What we shipped -- 2026-05-05

Comments
1 min read
Local LLM vs Gemini API — Cost, Quality, Privacy Compared (2026)
Cover image for Local LLM vs Gemini API — Cost, Quality, Privacy Compared (2026)

Local LLM vs Gemini API — Cost, Quality, Privacy Compared (2026)

Comments
2 min read
About Sharing Local Inference: A Marketplace for Renting Idle GPUs with an OpenAI-Compatible Backend
Cover image for About Sharing Local Inference: A Marketplace for Renting Idle GPUs with an OpenAI-Compatible Backend

About Sharing Local Inference: A Marketplace for Renting Idle GPUs with an OpenAI-Compatible Backend

Comments
8 min read
Cut Your AI Agent Token Costs by 75% With One Skill Plugin
Cover image for Cut Your AI Agent Token Costs by 75% With One Skill Plugin

Cut Your AI Agent Token Costs by 75% With One Skill Plugin

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.