Forem

Site Reliability Engineering

Site Reliability Engineering principles, practices, and culture.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Desvendando o Mundo do On-call: Desafios e Estratégias para uma Operação Eficiente
Cover image for Desvendando o Mundo do On-call: Desafios e Estratégias para uma Operação Eficiente

Desvendando o Mundo do On-call: Desafios e Estratégias para uma Operação Eficiente

2
Comments
3 min read
Lazy Loading vs Write-Through: A Guide to Performance Optimization
Cover image for Lazy Loading vs Write-Through: A Guide to Performance Optimization

Lazy Loading vs Write-Through: A Guide to Performance Optimization

6
Comments 1
8 min read
Mastering Reliability in High-Velocity Software Development
Cover image for Mastering Reliability in High-Velocity Software Development

Mastering Reliability in High-Velocity Software Development

Comments
9 min read
Alert Fatigue, and How to Fix it

Alert Fatigue, and How to Fix it

5
Comments
4 min read
Platform Engineering 101: Supercharging Dev, Sec, and Ops Harmony with Automation
Cover image for Platform Engineering 101: Supercharging Dev, Sec, and Ops Harmony with Automation

Platform Engineering 101: Supercharging Dev, Sec, and Ops Harmony with Automation

Comments
7 min read
Code to Cloud: DevOps with AWS
Cover image for Code to Cloud: DevOps with AWS

Code to Cloud: DevOps with AWS

2
Comments
5 min read
Navigating On-Call Compensation in the Tech Industry In 2023

Navigating On-Call Compensation in the Tech Industry In 2023

Comments
9 min read
Using Projectsveltos to Manage Kubernetes Add-ons on Civo Cloud Clusters
Cover image for Using Projectsveltos to Manage Kubernetes Add-ons on Civo Cloud Clusters

Using Projectsveltos to Manage Kubernetes Add-ons on Civo Cloud Clusters

1
Comments
4 min read
6 Outstanding Status Page Examples to Inspire You in 2023
Cover image for 6 Outstanding Status Page Examples to Inspire You in 2023

6 Outstanding Status Page Examples to Inspire You in 2023

1
Comments 1
5 min read
MTTx Metrics-Based Incident Response Optimization
Cover image for MTTx Metrics-Based Incident Response Optimization

MTTx Metrics-Based Incident Response Optimization

3
Comments 1
7 min read
Choosing the Right AWS EC2 Instance: Avoiding Common Pitfalls
Cover image for Choosing the Right AWS EC2 Instance: Avoiding Common Pitfalls

Choosing the Right AWS EC2 Instance: Avoiding Common Pitfalls

9
Comments 2
7 min read
Reliability concepts: Availability, Resiliency, Robustness, Fault-Tolerance, and Reliability
Cover image for Reliability concepts: Availability, Resiliency, Robustness, Fault-Tolerance, and Reliability

Reliability concepts: Availability, Resiliency, Robustness, Fault-Tolerance, and Reliability

10
Comments
1 min read
Amazon Grafana demo with EKS
Cover image for Amazon Grafana demo with EKS

Amazon Grafana demo with EKS

8
Comments 4
6 min read
The Ins and Outs of Status Pages
Cover image for The Ins and Outs of Status Pages

The Ins and Outs of Status Pages

1
Comments
6 min read
Grafana on AWS Marketplace
Cover image for Grafana on AWS Marketplace

Grafana on AWS Marketplace

7
Comments
4 min read
Runbook vs. Playbook: Meaning, Differences, and Uses
Cover image for Runbook vs. Playbook: Meaning, Differences, and Uses

Runbook vs. Playbook: Meaning, Differences, and Uses

Comments
6 min read
Chaos Engineering con AWS Fault Injection Simulator

Chaos Engineering con AWS Fault Injection Simulator

2
Comments
5 min read
What Is the Role of an Incident Commander?
Cover image for What Is the Role of an Incident Commander?

What Is the Role of an Incident Commander?

Comments
7 min read
Taints and Tolerations in Kubernetes: A Pocket Guide
Cover image for Taints and Tolerations in Kubernetes: A Pocket Guide

Taints and Tolerations in Kubernetes: A Pocket Guide

4
Comments
3 min read
How To Create an Incident Communication Plan
Cover image for How To Create an Incident Communication Plan

How To Create an Incident Communication Plan

Comments
7 min read
Siglas da Observabilidade SLI, SLO, SLE, MTTA, MTTR, MTBF e MTTF
Cover image for Siglas da Observabilidade SLI, SLO, SLE, MTTA, MTTR, MTBF e MTTF

Siglas da Observabilidade SLI, SLO, SLE, MTTA, MTTR, MTBF e MTTF

3
Comments
3 min read
Unpacking the Power of AWS ECS: A Comparative Look at ECS on EC2 vs. ECS on Fargate
Cover image for Unpacking the Power of AWS ECS: A Comparative Look at ECS on EC2 vs. ECS on Fargate

Unpacking the Power of AWS ECS: A Comparative Look at ECS on EC2 vs. ECS on Fargate

2
Comments
3 min read
Did You Know About AWS Always-Free Services
Cover image for Did You Know About AWS Always-Free Services

Did You Know About AWS Always-Free Services

8
Comments 2
3 min read
Site Reliability Engineering (SRE) Consulting Services
Cover image for Site Reliability Engineering (SRE) Consulting Services

Site Reliability Engineering (SRE) Consulting Services

Comments
2 min read
Extensões do Visual Studio Code para um SRE

Extensões do Visual Studio Code para um SRE

7
Comments
2 min read
loading...