Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
On-Device AI Is Changing How We Build — With Cover Image Test
Cover image for On-Device AI Is Changing How We Build — With Cover Image Test

On-Device AI Is Changing How We Build — With Cover Image Test

Comments
1 min read
Building a Secure GPT Gateway (Part 1)

Building a Secure GPT Gateway (Part 1)

1
Comments
3 min read
Your Mac Is a Supercomputer. It's Time We Benchmarked It Like One.
Cover image for Your Mac Is a Supercomputer. It's Time We Benchmarked It Like One.

Your Mac Is a Supercomputer. It's Time We Benchmarked It Like One.

Comments
6 min read
I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support

I built an Ollama alternative with TurboQuant, model groups, and multi-GPU support

1
Comments 1
4 min read
The Vibe Coding Paradox: Why My Weekend Project is Faster Than My Enterprise R&D

The Vibe Coding Paradox: Why My Weekend Project is Faster Than My Enterprise R&D

Comments 2
7 min read
The AI Stack: A Practical Guide to Building Your Own Intelligent Applications

The AI Stack: A Practical Guide to Building Your Own Intelligent Applications

1
Comments
5 min read
Long-Horizon Agents Are Here. Full Autopilot Isn't

Small tasks exposing fragile model loops

Long-Horizon Agents Are Here. Full Autopilot Isn't

34
Comments 18
7 min read
How I Built a PII Tokenization Middleware to Keep Sensitive Data Out of LLM APIs

How I Built a PII Tokenization Middleware to Keep Sensitive Data Out of LLM APIs

8
Comments 4
5 min read
Same Model, Different Environment, Different Results

Interface design shaping model reasoning

Same Model, Different Environment, Different Results

5
Comments 9
9 min read
Utility is all you need
Cover image for Utility is all you need

Utility is all you need

15
Comments 2
8 min read
Exclusive: China's DeepSeek trained AI model on Nvidia's best chip despite US ban, official says - Reuters

Exclusive: China's DeepSeek trained AI model on Nvidia's best chip despite US ban, official says - Reuters

Comments
6 min read
Text-to-SQL Failure Demo

Text-to-SQL Failure Demo

1
Comments
6 min read
Why We Ditched Bedrock Agents for Nova Pro and Built a Custom Orchestrator
Cover image for Why We Ditched Bedrock Agents for Nova Pro and Built a Custom Orchestrator

Why We Ditched Bedrock Agents for Nova Pro and Built a Custom Orchestrator

2
Comments 1
7 min read
What Is Context Engineering and How to Apply It in Real Systems
Cover image for What Is Context Engineering and How to Apply It in Real Systems

What Is Context Engineering and How to Apply It in Real Systems

4
Comments
6 min read
Why the E8 lattice is the perfect quantizer for KV caches
Cover image for Why the E8 lattice is the perfect quantizer for KV caches

Why the E8 lattice is the perfect quantizer for KV caches

1
Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.