Forem

# costoptimization

Practical strategies and stories about reducing cloud infrastructure costs.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
LLM Cost Optimization for Agent Workflows: A Practical Guide

LLM Cost Optimization for Agent Workflows: A Practical Guide

Comments
13 min read
Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production
Cover image for Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Comments
6 min read
Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production
Cover image for Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Kubernetes 1.36 Pod-Level Resource Managers: Advanced Resource Optimization in Production

Comments
6 min read
ARES: Cut LLM Agent Reasoning Costs 52% Per Step

ARES: Cut LLM Agent Reasoning Costs 52% Per Step

Comments
7 min read
AI Agent Cost Explosion: Why Your Automation Is Bleeding Money

AI Agent Cost Explosion: Why Your Automation Is Bleeding Money

Comments
7 min read
The AI Bill Is Coming. Here Is the FinOps Playbook to Tame It.

The AI Bill Is Coming. Here Is the FinOps Playbook to Tame It.

1
Comments
8 min read
I Cut My LLM API Bill by 73% — Here's the Exact Optimization Playbook

I Cut My LLM API Bill by 73% — Here's the Exact Optimization Playbook

Comments
5 min read
How a fintech startup cut cloud costs 65% with an open-source sovereign stack
Cover image for How a fintech startup cut cloud costs 65% with an open-source sovereign stack

How a fintech startup cut cloud costs 65% with an open-source sovereign stack

Comments
2 min read
Local LLMs vs Cloud APIs: Building Offline-First AI Workflows

Local LLMs vs Cloud APIs: Building Offline-First AI Workflows

Comments
8 min read
Kinesis Data Firehose Is Burning Our Budget: One Setting Changed Everything

Kinesis Data Firehose Is Burning Our Budget: One Setting Changed Everything

2
Comments
4 min read
How to Reduce LLM API Costs by 70% — 5 Strategies That Actually Work
Cover image for How to Reduce LLM API Costs by 70% — 5 Strategies That Actually Work

How to Reduce LLM API Costs by 70% — 5 Strategies That Actually Work

4
Comments
4 min read
Part 8 — Token-by-Token: Why AI Generates Text One Word at a Time (And Why It Costs 4x More)

Part 8 — Token-by-Token: Why AI Generates Text One Word at a Time (And Why It Costs 4x More)

Comments 1
9 min read
The Shadow Cloud Spend: $50k a Month FinOps Audit of Forgotten Dev Accounts

The Shadow Cloud Spend: $50k a Month FinOps Audit of Forgotten Dev Accounts

Comments
7 min read
They Don't Have the Money (And Neither Do You): The Coming Era of Small Models

They Don't Have the Money (And Neither Do You): The Coming Era of Small Models

Comments
6 min read
The History of Expanso (Part 4): The Mismatch

The History of Expanso (Part 4): The Mismatch

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.