Member-only story
A Journey To The Site Reliability Engineering
A Comprehensive Guide For Aspiring SREs

Many organizations have started adopting Site Reliability Engineering(SRE) practices to run their operations instead of traditional. The latest LinkedIn Job search show 190,000+ job openings for Site Reliability Engineers worldwide.

If you are still no familiar with SRE, then here is how Google describes it —
SRE is what happens when you ask a software engineer to design an operations team.
SRE is defined by 7 important principles — * Operations is a software problem* Managed by Service Level Objectives * Work to minimize the toil* Automate this year’s job away* Move fast by reducing the cost of failure* Share ownership with the developers* Use the same tooling, regardless of function or job title
Site Reliability Engineering could be a great career move for individuals coming from backgrounds like Operations Support, System Admin, Infra Architecture, DevOps Engineers, etc.
In this article, I plan to talk about various resources you can use to start your journey to be a Site Reliability Engineer.
Mastering the Art of Service Level Objectives(SLOs)
It is essential to start your journey with understanding the concepts of Service Level Indicators(SLIs) and Service Level Objectives(SLOs).
SLI: A quantifiable measure of service reliability
SLO: Set a reliability target for an SLI
There are plenty of resources that talk about SLIs & SLOs but I would recommend using the Art Of SLOs workshop to deeply understand the concept.
If you are part of an organization that is trying to adopt SRE practices then. I will recommend doing this workshop internally for your aspiring SREs.