Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why I Replaced My AI Assistant With an Orchestra

Why I Replaced My AI Assistant With an Orchestra

Comments
6 min read
llama.swap Model Switcher Quickstart for OpenAI-Compatible Local LLMs

llama.swap Model Switcher Quickstart for OpenAI-Compatible Local LLMs

1
Comments
11 min read
Your LLM prompts are probably wasting 90% of tokens. Here’s how I fixed mine.

Your LLM prompts are probably wasting 90% of tokens. Here’s how I fixed mine.

1
Comments 1
4 min read
Drift Artifact: A Method for Writing That Performs Its Own Argument
Cover image for Drift Artifact: A Method for Writing That Performs Its Own Argument

Drift Artifact: A Method for Writing That Performs Its Own Argument

1
Comments
3 min read
Local AI in 2026: Ollama Benchmarks, $0 Inference, and the End of Per-Token Pricing
Cover image for Local AI in 2026: Ollama Benchmarks, $0 Inference, and the End of Per-Token Pricing

Local AI in 2026: Ollama Benchmarks, $0 Inference, and the End of Per-Token Pricing

Comments 2
6 min read
SGLang QuickStart: Install, Configure, and Serve LLMs via OpenAI API

SGLang QuickStart: Install, Configure, and Serve LLMs via OpenAI API

1
Comments
6 min read
Deploying vLLM on your Linux Server
Cover image for Deploying vLLM on your Linux Server

Deploying vLLM on your Linux Server

Comments
2 min read
A11 — A Deterministic Reasoning Architecture for Autonomous Systems and LLM-Based Agents
Cover image for A11 — A Deterministic Reasoning Architecture for Autonomous Systems and LLM-Based Agents

A11 — A Deterministic Reasoning Architecture for Autonomous Systems and LLM-Based Agents

Comments
4 min read
Why Asking an LLM for JSON Isn’t Enough
Cover image for Why Asking an LLM for JSON Isn’t Enough

Why Asking an LLM for JSON Isn’t Enough

24
Comments 38
4 min read
I Benchmarked 5 AI Agent Frameworks — Here's What Actually Matters
Cover image for I Benchmarked 5 AI Agent Frameworks — Here's What Actually Matters

I Benchmarked 5 AI Agent Frameworks — Here's What Actually Matters

Comments
7 min read
AnythingLLM: The All-in-One AI App for RAG, Agents, and Document Chat

AnythingLLM: The All-in-One AI App for RAG, Agents, and Document Chat

Comments
2 min read
Why Connecting AI to Real Systems Is Still Hard
Cover image for Why Connecting AI to Real Systems Is Still Hard

Why Connecting AI to Real Systems Is Still Hard

Comments
6 min read
Production-Grade GraphRAG Data Pipeline: End-to-End Construction from PDF Parsing to Knowledge Graph

Production-Grade GraphRAG Data Pipeline: End-to-End Construction from PDF Parsing to Knowledge Graph

Comments 1
9 min read
Running Local LLMs with NeuroLink and Ollama: Complete Guide
Cover image for Running Local LLMs with NeuroLink and Ollama: Complete Guide

Running Local LLMs with NeuroLink and Ollama: Complete Guide

Comments
7 min read
Wiring Claude Into Real Systems With Tool Use

Wiring Claude Into Real Systems With Tool Use

Comments
6 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.