Forem

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Kubernetes 1.36 apiserver /readyz now waits for watch cache

Kubernetes 1.36 apiserver /readyz now waits for watch cache

Comments
5 min read
Two Terraform Traps That Burned Me: Hidden Defaults & Circular Dependencies

Two Terraform Traps That Burned Me: Hidden Defaults & Circular Dependencies

Comments
4 min read
Why Your Engineering Wiki is a Graveyard (And How to Fix It)

Why Your Engineering Wiki is a Graveyard (And How to Fix It)

Comments
3 min read
Chapter 3: A Better Abstraction — Managing LLM Apps with Terraform + Helm

Chapter 3: A Better Abstraction — Managing LLM Apps with Terraform + Helm

1
Comments
10 min read
SaaS Uptime Monitoring Explained: How Late Outage Detection Hurts Growth and Trust
Cover image for SaaS Uptime Monitoring Explained: How Late Outage Detection Hurts Growth and Trust

SaaS Uptime Monitoring Explained: How Late Outage Detection Hurts Growth and Trust

5
Comments 1
3 min read
How to Make Engineering Knowledge Searchable (A Complete Guide)

How to Make Engineering Knowledge Searchable (A Complete Guide)

1
Comments
3 min read
How we fixed Real Kubernetes Production Incidents

How we fixed Real Kubernetes Production Incidents

4
Comments
3 min read
Why Platform Engineering Is the Next Big Shift (and How Ops Teams Win)
Cover image for Why Platform Engineering Is the Next Big Shift (and How Ops Teams Win)

Why Platform Engineering Is the Next Big Shift (and How Ops Teams Win)

Comments 3
3 min read
Os 4 Sinais Dourados da Google

Os 4 Sinais Dourados da Google

4
Comments
5 min read
How to Get IP, ASN, and Network Information with curl (No API Key Required)

How to Get IP, ASN, and Network Information with curl (No API Key Required)

1
Comments
2 min read
Backpressure, Buffers, and Logging Sidecars

Backpressure, Buffers, and Logging Sidecars

2
Comments
5 min read
Wild Ride from Raw Syscalls to Figuring Out NSS and libc
Cover image for Wild Ride from Raw Syscalls to Figuring Out NSS and libc

Wild Ride from Raw Syscalls to Figuring Out NSS and libc

2
Comments 1
4 min read
You’re Running EC2 Instances That Do Nothing
Cover image for You’re Running EC2 Instances That Do Nothing

You’re Running EC2 Instances That Do Nothing

1
Comments
2 min read
The Real Reason AI Agents “Work” in Software
Cover image for The Real Reason AI Agents “Work” in Software

The Real Reason AI Agents “Work” in Software

Comments
6 min read
10 Proven Ways to Cut Your AWS Bill
Cover image for 10 Proven Ways to Cut Your AWS Bill

10 Proven Ways to Cut Your AWS Bill

1
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.