Forem

Site Reliability Engineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Retry Pattern: Handling Transient Failures in Distributed Systems
Cover image for Retry Pattern: Handling Transient Failures in Distributed Systems

Retry Pattern: Handling Transient Failures in Distributed Systems

Comments
3 min read
Retry Pattern: Manejando Fallos Transitorios en Sistemas Distribuidos
Cover image for Retry Pattern: Manejando Fallos Transitorios en Sistemas Distribuidos

Retry Pattern: Manejando Fallos Transitorios en Sistemas Distribuidos

Comments
3 min read
Procedimentos como base sólida da experiência do desenvolvedor antes da automação

Procedimentos como base sólida da experiência do desenvolvedor antes da automação

1
Comments
2 min read
SRE Deployment Engineer Managing Reliable & Automated Deployments
Cover image for SRE Deployment Engineer Managing Reliable & Automated Deployments

SRE Deployment Engineer Managing Reliable & Automated Deployments

1
Comments
4 min read
Postmortem: A Importância de uma Análise Estruturada de Incidentes em SRE
Cover image for Postmortem: A Importância de uma Análise Estruturada de Incidentes em SRE

Postmortem: A Importância de uma Análise Estruturada de Incidentes em SRE

2
Comments
4 min read
K8s Plugins For Solid Security

K8s Plugins For Solid Security

Comments
2 min read
What are Kata Containers?
Cover image for What are Kata Containers?

What are Kata Containers?

Comments
2 min read
Designing a fault-tolerant etcd cluster on AWS
Cover image for Designing a fault-tolerant etcd cluster on AWS

Designing a fault-tolerant etcd cluster on AWS

8
Comments 1
5 min read
Zero-Downtime Blue-Green Deployment with a Simple 'git pull & bash run.sh' Command

Zero-Downtime Blue-Green Deployment with a Simple 'git pull & bash run.sh' Command

1
Comments
1 min read
Internal Developer Portals: Autonomy, Governance and the Golden Path
Cover image for Internal Developer Portals: Autonomy, Governance and the Golden Path

Internal Developer Portals: Autonomy, Governance and the Golden Path

1
Comments
15 min read
DynamoDB: Query x Scan! Para de torrar dinheiro usando Scan em produção

DynamoDB: Query x Scan! Para de torrar dinheiro usando Scan em produção

38
Comments 6
4 min read
7 Kubernetes Security Best Practices in 2024
Cover image for 7 Kubernetes Security Best Practices in 2024

7 Kubernetes Security Best Practices in 2024

6
Comments
3 min read
How to Fix Kubernetes Node Disk Pressure
Cover image for How to Fix Kubernetes Node Disk Pressure

How to Fix Kubernetes Node Disk Pressure

4
Comments
2 min read
Some of the less-known ping types you should know

Some of the less-known ping types you should know

6
Comments 1
1 min read
How a Pod is Deleted - Behind the Scenes Breakdown

How a Pod is Deleted - Behind the Scenes Breakdown

8
Comments 2
2 min read
How to Set up Disk and Bandwidth Limits in Docker
Cover image for How to Set up Disk and Bandwidth Limits in Docker

How to Set up Disk and Bandwidth Limits in Docker

2
Comments
2 min read
How To Fix OOMKilled

How To Fix OOMKilled

1
Comments
2 min read
Creating an Efficient IT Incident Management Plan: A Guide to Templates and Best Practices
Cover image for Creating an Efficient IT Incident Management Plan: A Guide to Templates and Best Practices

Creating an Efficient IT Incident Management Plan: A Guide to Templates and Best Practices

Comments
7 min read
SLOs and Customer Experience: Uniting Engineering Excellence with Customer Satisfaction
Cover image for SLOs and Customer Experience: Uniting Engineering Excellence with Customer Satisfaction

SLOs and Customer Experience: Uniting Engineering Excellence with Customer Satisfaction

Comments
5 min read
SRE and the Enterprise: Building a Culture of Reliability at Scale
Cover image for SRE and the Enterprise: Building a Culture of Reliability at Scale

SRE and the Enterprise: Building a Culture of Reliability at Scale

Comments
4 min read
SRE vs DevOps: What’s the Difference and Why Does It Matter? 🤓
Cover image for SRE vs DevOps: What’s the Difference and Why Does It Matter? 🤓

SRE vs DevOps: What’s the Difference and Why Does It Matter? 🤓

Comments
1 min read
Best Practices for Choosing a Status Page Provider
Cover image for Best Practices for Choosing a Status Page Provider

Best Practices for Choosing a Status Page Provider

Comments
5 min read
DevOps vs. SRE Understanding the Differences and Benefits
Cover image for DevOps vs. SRE Understanding the Differences and Benefits

DevOps vs. SRE Understanding the Differences and Benefits

Comments
2 min read
How to Define Engineering Standards (with Backstage)
Cover image for How to Define Engineering Standards (with Backstage)

How to Define Engineering Standards (with Backstage)

Comments
10 min read
The Pillars of Site Reliability Engineering Building Resilient Systems
Cover image for The Pillars of Site Reliability Engineering Building Resilient Systems

The Pillars of Site Reliability Engineering Building Resilient Systems

Comments
2 min read
loading...