Skip to content

Forem

# llm

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Dady Fredy

Jan 20

The 2am Conversation: What Happens When You Treat AI Like a Colleague

#ai #devjournal #llm #vibecoding

3 min read

Dec 29 '25

The Hidden Switchboard Behind vLLM Attention

#vllm #llm #attention #aiinference

10 min read

Rodolfo Olivieri

Dec 19 '25

Let’s talk about: Goose!

#research #goose #llm #opensource

15 min read

Cover image for Why Your AI Feels Dumb (And How MCP Fixes It)

Jan 23

Why Your AI Feels Dumb (And How MCP Fixes It)

#mcp #ai #llmops #llm

3 min read

Cover image for The Orphan Axiom Problem in Ontology-Based RAG

Dec 18 '25

The Orphan Axiom Problem in Ontology-Based RAG

#rag #llm #architecture #ai

6 min read

Cover image for All Data and AI Weekly #224-12 Jan 2026

Jan 12

All Data and AI Weekly #224-12 Jan 2026

#mcp #llm #snowflake #genai

4 min read

Dec 19 '25

⚙️ One Tool, Many Brains: Building a Multi-Model DevOps Architect

#devops #terraform #k8s #llm

7 min read

Cover image for The LLM Control Stack: From Words to Weights

Jan 1

The LLM Control Stack: From Words to Weights

#deeplearning #llm #promptengineering #ai

4 min read

Cover image for LLMs Can Now Write GPU Kernels That Beat torch.compile

Jan 23

LLMs Can Now Write GPU Kernels That Beat torch.compile

#gpu #cuda #triton #llm

7 min read

Dec 18 '25

The Squeezing Effect: Why Your Aligned AI Model Gets Worse

#ai #llm #machinelearning #aiimplementation

3 min read

Cover image for Developers Love Tools. AI Needs Better Instructions.

Mvtsahil (Sahil Khan)

Jan 23

Developers Love Tools. AI Needs Better Instructions.

#ai #llm #productivity #softwaredevelopment

3 min read

Cover image for Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Dec 18 '25

Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

#algorithms #rag #llm #ai

12 min read

Dec 19 '25

How to Build Multi-Provider Failover Strategies with Bifrost for Ultra‑Reliable AI Applications

#ai #architecture #llm

8 min read

Dec 19 '25

Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

#rag #performance #llm #ai

7 min read

osman uygar köse

Dec 30 '25

The Art of Context Windows: Our AI Had Alzheimer's: Here's How We Taught It To Remember

#ai #llm #architecture #python

9 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.