Forem

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Complete Guide to Reducing LLM Costs Without Sacrificing Quality

The Complete Guide to Reducing LLM Costs Without Sacrificing Quality

2
Comments
11 min read
Tiny Recursive Models: Rethinking AI with Small Neural “Brains” That Think in Loops
Cover image for Tiny Recursive Models: Rethinking AI with Small Neural “Brains” That Think in Loops

Tiny Recursive Models: Rethinking AI with Small Neural “Brains” That Think in Loops

2
Comments
4 min read
LLMs: Decoding the Geometry of Alignment

LLMs: Decoding the Geometry of Alignment

Comments
2 min read
Building Reliable Compound AI Systems: Architecture, Evaluation, and Observability

Building Reliable Compound AI Systems: Architecture, Evaluation, and Observability

Comments
10 min read
I Analyzed 100 Claude MCP Servers and Found Critical Security Flaws in 43% of Them

I Analyzed 100 Claude MCP Servers and Found Critical Security Flaws in 43% of Them

4
Comments 2
5 min read
6 AI Models vs. 3 Advanced Security Vulnerabilities

6 AI Models vs. 3 Advanced Security Vulnerabilities

Comments
9 min read
How to Improve Cross-Lingual Retrieval Accuracy in Bilingual RAG Chatbots

How to Improve Cross-Lingual Retrieval Accuracy in Bilingual RAG Chatbots

2
Comments 1
9 min read
Strands Agents: A Model-First SDK for Building Autonomous AI on AWS and Beyond
Cover image for Strands Agents: A Model-First SDK for Building Autonomous AI on AWS and Beyond

Strands Agents: A Model-First SDK for Building Autonomous AI on AWS and Beyond

Comments
6 min read
Understanding MoE Offloading

Understanding MoE Offloading

1
Comments
5 min read
Amazon Bedrock AgentCore Runtime - Part 8 AgentCore Memory Observability

Amazon Bedrock AgentCore Runtime - Part 8 AgentCore Memory Observability

1
Comments
5 min read
From Idea to AI Launch: How Devs Can Build Projects Like Serial Founder
Cover image for From Idea to AI Launch: How Devs Can Build Projects Like Serial Founder

From Idea to AI Launch: How Devs Can Build Projects Like Serial Founder

3
Comments
3 min read
Building a Chat Interface: From Components to Conversation

Building a Chat Interface: From Components to Conversation

1
Comments
4 min read
[KubeRay로 LLM 서빙 인프라 찍먹] 3부: vLLM과 Ray Serve를 활용한 고성능 추론 엔드포인트 구축기
Cover image for [KubeRay로 LLM 서빙 인프라 찍먹] 3부: vLLM과 Ray Serve를 활용한 고성능 추론 엔드포인트 구축기

[KubeRay로 LLM 서빙 인프라 찍먹] 3부: vLLM과 Ray Serve를 활용한 고성능 추론 엔드포인트 구축기

1
Comments
2 min read
Why JSON Mode Fails (and How to Fix It)

Why JSON Mode Fails (and How to Fix It)

Comments
2 min read
I Used Autogen GraphFlow and Qwen3 Coder to Solve Math Problems — And It Worked
Cover image for I Used Autogen GraphFlow and Qwen3 Coder to Solve Math Problems — And It Worked

I Used Autogen GraphFlow and Qwen3 Coder to Solve Math Problems — And It Worked

Comments
12 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.