Forem

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
What happens when Amazon accidentally sends all of their support traffic your way?
Cover image for What happens when Amazon accidentally sends all of their support traffic your way?

What happens when Amazon accidentally sends all of their support traffic your way?

28
Comments 3
3 min read
How Disaster Ready Are Your Backup Systems, Really?

How Disaster Ready Are Your Backup Systems, Really?

2
Comments
6 min read
DevOps - Deployment strategies

DevOps - Deployment strategies

8
Comments
6 min read
#90DaysOfDevOps - Day 3

#90DaysOfDevOps - Day 3

2
Comments
5 min read
#90DaysOfDevOps - Day 1

#90DaysOfDevOps - Day 1

32
Comments 4
4 min read
Fylamynt and Squadcast Team Up To Handle Cloud Incident Response, Management, and Remediation
Cover image for Fylamynt and Squadcast Team Up To Handle Cloud Incident Response, Management, and Remediation

Fylamynt and Squadcast Team Up To Handle Cloud Incident Response, Management, and Remediation

5
Comments
4 min read
Como criar uma função personalizada para RBAC

Como criar uma função personalizada para RBAC

6
Comments
4 min read
Circumvent STDIN when installing packages with apt

Circumvent STDIN when installing packages with apt

4
Comments
2 min read
Some DevOps Terms definitions

Some DevOps Terms definitions

8
Comments 1
4 min read
Hosting and Scaling Applications

Hosting and Scaling Applications

3
Comments
3 min read
How to Write Meaningful Retrospectives
Cover image for How to Write Meaningful Retrospectives

How to Write Meaningful Retrospectives

2
Comments
6 min read
#K8S01: Criando Cluster Kubernetes para Fins Didáticos
Cover image for #K8S01: Criando Cluster Kubernetes para Fins Didáticos

#K8S01: Criando Cluster Kubernetes para Fins Didáticos

17
Comments
9 min read
Starting an SRE Team? Stay Away From Uptime.
Cover image for Starting an SRE Team? Stay Away From Uptime.

Starting an SRE Team? Stay Away From Uptime.

8
Comments 2
5 min read
Solving the Diamond Problem with a Spacelift Trigger policy

Solving the Diamond Problem with a Spacelift Trigger policy

13
Comments
4 min read
Day 8 of Sysadvent - D&D for SREs

Day 8 of Sysadvent - D&D for SREs

2
Comments 2
6 min read
How to improve your influence as an SRE
Cover image for How to improve your influence as an SRE

How to improve your influence as an SRE

1
Comments
8 min read
Post-mortem: Kubernetes pods don't start because of too many services

Post-mortem: Kubernetes pods don't start because of too many services

7
Comments
3 min read
All About Incident Communication: What it Is, How to Do It, and Why It’s Crucial for Business
Cover image for All About Incident Communication: What it Is, How to Do It, and Why It’s Crucial for Business

All About Incident Communication: What it Is, How to Do It, and Why It’s Crucial for Business

Comments
7 min read
Implementing Graceful Shutdown in Go
Cover image for Implementing Graceful Shutdown in Go

Implementing Graceful Shutdown in Go

15
Comments 5
14 min read
What You Need to Break into DevOps and SRE

What You Need to Break into DevOps and SRE

65
Comments
3 min read
Don't panic when using CLI
Cover image for Don't panic when using CLI

Don't panic when using CLI

7
Comments
2 min read
Virtual Webinar on 'Reliability Reimagined: How SREs spearhead competitive CX'
Cover image for Virtual Webinar on 'Reliability Reimagined: How SREs spearhead competitive CX'

Virtual Webinar on 'Reliability Reimagined: How SREs spearhead competitive CX'

6
Comments
1 min read
DevOps & SRE Words Matter: How Our Language has Evolved
Cover image for DevOps & SRE Words Matter: How Our Language has Evolved

DevOps & SRE Words Matter: How Our Language has Evolved

8
Comments 2
6 min read
Understanding DevOps

Understanding DevOps

12
Comments
4 min read
Moving large amounts of data on AWS

Moving large amounts of data on AWS

7
Comments
5 min read
loading...