Forem

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
How I Troubleshot a KVM Memory Issue That Led to Swap & High CPU (Runbook + Real Scenario)

How I Troubleshot a KVM Memory Issue That Led to Swap & High CPU (Runbook + Real Scenario)

2
Comments
3 min read
Rotating Residential Proxy Evaluation Mini-Lab You Can Run in 90 Minutes
Cover image for Rotating Residential Proxy Evaluation Mini-Lab You Can Run in 90 Minutes

Rotating Residential Proxy Evaluation Mini-Lab You Can Run in 90 Minutes

Comments
6 min read
Build an AI Code Review Agent in GitHub Actions (That Actually Reduces Incidents
Cover image for Build an AI Code Review Agent in GitHub Actions (That Actually Reduces Incidents

Build an AI Code Review Agent in GitHub Actions (That Actually Reduces Incidents

6
Comments 4
4 min read
Workflow Deep Dive

Workflow Deep Dive

Comments
1 min read
Why a Status Page Should Not Depend on Third-Party CDNs
Cover image for Why a Status Page Should Not Depend on Third-Party CDNs

Why a Status Page Should Not Depend on Third-Party CDNs

1
Comments 2
4 min read
Building a Config Drift Detector for AWS (with Snapshots, Lambdas, and a Next.js Dashboard)
Cover image for Building a Config Drift Detector for AWS (with Snapshots, Lambdas, and a Next.js Dashboard)

Building a Config Drift Detector for AWS (with Snapshots, Lambdas, and a Next.js Dashboard)

Comments
5 min read
Running Cluster on 100% Spot Instances: How K8s Does It Better Than ECS

Running Cluster on 100% Spot Instances: How K8s Does It Better Than ECS

Comments
4 min read
Debugging Kubernetes Nodes in NotReady State

Debugging Kubernetes Nodes in NotReady State

Comments
5 min read
Kubernetes 1.36 apiserver /readyz now waits for watch cache

Kubernetes 1.36 apiserver /readyz now waits for watch cache

Comments
5 min read
Two Terraform Traps That Burned Me: Hidden Defaults & Circular Dependencies

Two Terraform Traps That Burned Me: Hidden Defaults & Circular Dependencies

Comments
4 min read
Why Your Engineering Wiki is a Graveyard (And How to Fix It)

Why Your Engineering Wiki is a Graveyard (And How to Fix It)

Comments
3 min read
Chapter 3: A Better Abstraction — Managing LLM Apps with Terraform + Helm

Chapter 3: A Better Abstraction — Managing LLM Apps with Terraform + Helm

1
Comments
10 min read
How to Make Engineering Knowledge Searchable (A Complete Guide)

How to Make Engineering Knowledge Searchable (A Complete Guide)

1
Comments
3 min read
How we fixed Real Kubernetes Production Incidents

How we fixed Real Kubernetes Production Incidents

4
Comments
3 min read
Os 4 Sinais Dourados da Google

Os 4 Sinais Dourados da Google

4
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.