Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why Your AI Character Keeps Breaking Under Pressure (And What I Built Instead of Yet Another System Prompt)

Why Your AI Character Keeps Breaking Under Pressure (And What I Built Instead of Yet Another System Prompt)

5
Comments
8 min read
LLM on EKS: Serving with vLLM
Cover image for LLM on EKS: Serving with vLLM

LLM on EKS: Serving with vLLM

5
Comments
10 min read
Local LLM Acceleration, Framework Comparisons, & Ollama Observability

Local LLM Acceleration, Framework Comparisons, & Ollama Observability

1
Comments
4 min read
I Built a Spatial Audio Radar ft. Vibe Code Arena
Cover image for I Built a Spatial Audio Radar ft. Vibe Code Arena

I Built a Spatial Audio Radar ft. Vibe Code Arena

Comments
4 min read
Spud Was the Rumored GPT-6. It Shipped as GPT-5.5, Two Tiers Inside.
Cover image for Spud Was the Rumored GPT-6. It Shipped as GPT-5.5, Two Tiers Inside.

Spud Was the Rumored GPT-6. It Shipped as GPT-5.5, Two Tiers Inside.

Comments
7 min read
The Apple-Gemini Deal Anthropic Wasn't In: Google Cloud Next 2026
Cover image for The Apple-Gemini Deal Anthropic Wasn't In: Google Cloud Next 2026

The Apple-Gemini Deal Anthropic Wasn't In: Google Cloud Next 2026

Comments
7 min read
Prompting Without the Menu
Cover image for Prompting Without the Menu

Prompting Without the Menu

Comments
5 min read
One Open Source Project a Day (No.49): free-claude-code - Run Claude Code for Free with One Environment Variable
Cover image for One Open Source Project a Day (No.49): free-claude-code - Run Claude Code for Free with One Environment Variable

One Open Source Project a Day (No.49): free-claude-code - Run Claude Code for Free with One Environment Variable

Comments
8 min read
Thoughts on GPT-5.5 and What It Means for Learning to Code

Thoughts on GPT-5.5 and What It Means for Learning to Code

Comments
1 min read
Reducing AI Latency Through Smarter Model Routing and Token Optimization

Reducing AI Latency Through Smarter Model Routing and Token Optimization

Comments
3 min read
Agentic Tools, Rust LangFlow, and AI Pharma Breakthroughs

Agentic Tools, Rust LangFlow, and AI Pharma Breakthroughs

Comments
2 min read
Llama-Server Router Mode - Dynamic Model Switching Without Restarts

Llama-Server Router Mode - Dynamic Model Switching Without Restarts

Comments
9 min read
I Built a GPU Dataset for LLM Inference — Here’s What I Learned
Cover image for I Built a GPU Dataset for LLM Inference — Here’s What I Learned

I Built a GPU Dataset for LLM Inference — Here’s What I Learned

1
Comments
2 min read
I found 100% prompt injection success rate against AI SOC assistants - here is the detection layer I built

I found 100% prompt injection success rate against AI SOC assistants - here is the detection layer I built

Comments
2 min read
Your LLM Bill Is Too High. Here's How to Fix It (Part 1)
Cover image for Your LLM Bill Is Too High. Here's How to Fix It (Part 1)

Your LLM Bill Is Too High. Here's How to Fix It (Part 1)

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.