Getting started with Site Reliability Engineering (SRE)

, par  abeer486@slideshare.net(abeer486) , popularité : 2%

"Getting started with Site Reliability Engineering (SRE) : A guide to improving systems reliability at production" This is an intro guide to share some of the common concepts of SRE to a non-technical audience. We will look at both technical and organizational changes that should be adopted to increase operational efficiency, ultimately benefiting for global optimizations - such as minimize downtime, improve systems architecture & infrastructure : - improving incident response - Defining error budgets - Better monitoring of systems - Getting the best out of systems alerting - Eliminating manual, repetitive actions (toils) by automation - Designing better on-call shifts/rotations How to design the role of the Site Reliability Engineer (who effectively works between application development teams and operations support teams)

Voir en ligne : https://www.slideshare.net/abeer486...

Sites favoris Tous les sites

84 sites référencés dans ce secteur