Embrace Risk - The SREs newsletter
Subscribe
Sign in
Home
Archive
About
Latest
Top
Revision 76
Articles and updates What’s the biggest unsolved problem within Site Reliability Engineering? (link) Ingress: Kubernetes Example with ngrok (link) Best…
Apr 22
•
Ricardo Castro
and
Konstantinos Livieratos
1
Share this post
Revision 76
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 75
Articles and updates Linux Crisis Tools (link) Moving fast breaks things: the importance of a staging environment (link) Building Application…
Apr 15
•
Ricardo Castro
and
Konstantinos Livieratos
1
Share this post
Revision 75
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 74
Articles and updates Using the platform engineering maturity model to understand the commitment required for an internal developer platform (link) File…
Apr 8
•
Ricardo Castro
and
Konstantinos Livieratos
1
Share this post
Revision 74
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 73
Articles and updates SLO formulas implementation in PromQL step by step (link) CI/CD observability: Extracting DORA metrics from a CD pipeline (link…
Apr 1
•
Ricardo Castro
and
Konstantinos Livieratos
1
Share this post
Revision 73
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
March 2024
Revision 72
Articles and updates Lessons From Our 8 Years Of Kubernetes In Production — Two Major Cluster Crashes, Ditching Self-Managed, Cutting Cluster Costs…
Mar 25
•
Konstantinos Livieratos
and
Ricardo Castro
1
Share this post
Revision 72
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 71
Articles and updates: Installing Cilium with ArgoCD on GKE (link) Preventing attacker persistence with Falco on AWS (link) How to Fail at Platform…
Mar 18
•
Ricardo Castro
and
Konstantinos Livieratos
1
Share this post
Revision 71
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 70
Articles and updates: What Does 99.999% Uptime Really Mean? (link) Building decoupled monitoring with OpenTelemetry (link) OpenTelemetry Collector…
Mar 11
•
Ricardo Castro
and
Konstantinos Livieratos
1
Share this post
Revision 70
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 69
Articles and updates: Reinvent Kubernetes Logging with Telemetry Controller (link) Backup Kubernetes using Velero and CSI volume snapshot (link) AWS…
Mar 4
•
Konstantinos Livieratos
and
Ricardo Castro
1
Share this post
Revision 69
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
February 2024
Revision 68
Articles and updates Resend Incident report for February 21st, 2024 (link) Writing an Excellent Postmortem (link) Getting Buy-in from Management on…
Feb 26
•
Ricardo Castro
and
Konstantinos Livieratos
1
Share this post
Revision 68
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 67
Articles and updates Automate Kubernetes Network Security with Falco Talon (link) “Why Are We Having More Incidents?” Causal Loops in Reactions to…
Feb 19
•
Ricardo Castro
and
Konstantinos Livieratos
Share this post
Revision 67
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 66
Articles and updates Kubernetes security best practices: definitive guide for security professionals (link) How the data center site selection process…
Feb 11
•
Ricardo Castro
and
Konstantinos Livieratos
Share this post
Revision 66
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Revision 65
Articles and updates The Scary Thing About Automating Deploys (link) What is instrumentation for observability? (link) Fleet Management at Spotify: The…
Feb 5
•
Ricardo Castro
and
Konstantinos Livieratos
Share this post
Revision 65
embracerisk.substack.com
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts