Forem

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
AI Inference Cost Calculator: The Hidden Reality of Production AI Costs

AI Inference Cost Calculator: The Hidden Reality of Production AI Costs

Comments
4 min read
I don’t hate SQL. I hate metadata friction.

I don’t hate SQL. I hate metadata friction.

Comments
4 min read
Building Production AI Agents with LangGraph: Beyond the Toy Examples (2026)

Building Production AI Agents with LangGraph: Beyond the Toy Examples (2026)

3
Comments 2
11 min read
You Don’t “Prompt Engineer” Identity — You Architect It (Why CloYou Explores Constrained AI Clones)
Cover image for You Don’t “Prompt Engineer” Identity — You Architect It (Why CloYou Explores Constrained AI Clones)

You Don’t “Prompt Engineer” Identity — You Architect It (Why CloYou Explores Constrained AI Clones)

Comments
4 min read
The Symbol for All of Us is Null
Cover image for The Symbol for All of Us is Null

The Symbol for All of Us is Null

Comments
6 min read
AI coding philosophy

AI coding philosophy

3
Comments
3 min read
Building Sema: A Lisp with LLM Primitives, Built with AI Agents

Building Sema: A Lisp with LLM Primitives, Built with AI Agents

Comments
15 min read
Building AI Agents with Tool Use: Patterns That Work in Production (2026)

Building AI Agents with Tool Use: Patterns That Work in Production (2026)

1
Comments
12 min read
Circuit Breakers for LLM APIs: Applying SRE Patterns to AI Infrastructure

Circuit Breakers for LLM APIs: Applying SRE Patterns to AI Infrastructure

Comments
6 min read
Fazendo um LLM do Zero — Sessão 06: Dando uma Profissão ao Modelo (Fine-Tuning) 🎯👨‍⚕️

Fazendo um LLM do Zero — Sessão 06: Dando uma Profissão ao Modelo (Fine-Tuning) 🎯👨‍⚕️

Comments
4 min read
How to Train a Small Language Model: The Complete Guide for 2026
Cover image for How to Train a Small Language Model: The Complete Guide for 2026

How to Train a Small Language Model: The Complete Guide for 2026

1
Comments
9 min read
How to Implement Prompt Caching on Amazon Bedrock and Cut Inference Costs in Half
Cover image for How to Implement Prompt Caching on Amazon Bedrock and Cut Inference Costs in Half

How to Implement Prompt Caching on Amazon Bedrock and Cut Inference Costs in Half

Comments
12 min read
Securing AI-Powered Applications: A Comprehensive Guide to Protecting Your LLM-Integrated Web App
Cover image for Securing AI-Powered Applications: A Comprehensive Guide to Protecting Your LLM-Integrated Web App

Securing AI-Powered Applications: A Comprehensive Guide to Protecting Your LLM-Integrated Web App

Comments
8 min read
How I ran LLM + RAG fully offline on Android using MNN

How I ran LLM + RAG fully offline on Android using MNN

Comments
3 min read
5 Agent Design Patterns Every Developer Needs to Know in 2026

5 Agent Design Patterns Every Developer Needs to Know in 2026

2
Comments
15 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.