Member-only story

A Journey To The Site Reliability Engineering

Tanmay Deshpande
7 min readMay 11, 2021

--

Photo by Mukuko Studio on Unsplash

Many organizations have started adopting Site Reliability Engineering(SRE) practices to run their operations instead of traditional. The latest LinkedIn Job search show 190,000+ job openings for Site Reliability Engineers worldwide.

LinkedIn Job Search Screen Shot — Image By Author

If you are still no familiar with SRE, then here is how Google describes it —

SRE is what happens when you ask a software engineer to design an operations team.

SRE is defined by 7 important principles — * Operations is a software problem* Managed by Service Level Objectives * Work to minimize the toil* Automate this year’s job away* Move fast by reducing the cost of failure* Share ownership with the developers* Use the same tooling, regardless of function or job title 

Site Reliability Engineering could be a great career move for individuals coming from backgrounds like Operations Support, System Admin, Infra Architecture, DevOps Engineers, etc.

In this article, I plan to talk about various resources you can use to start your journey to be a Site Reliability Engineer.

Mastering the Art of Service Level Objectives(SLOs)

It is essential to start your journey with understanding the concepts of Service Level Indicators(SLIs) and Service Level Objectives(SLOs).

SLI: A quantifiable measure of service reliability

SLO: Set a reliability target for an SLI

There are plenty of resources that talk about SLIs & SLOs but I would recommend using the Art Of SLOs workshop to deeply understand the concept.

If you are part of an organization that is trying to adopt SRE practices then. I will recommend doing this workshop internally for your aspiring SREs.

--

--

Tanmay Deshpande
Tanmay Deshpande

Written by Tanmay Deshpande

I write about technology in simple words!

No responses yet

Write a response