Back to Browse

Site Reliability Engineering (SRE) Fundamentals

43.0K views
Streamed live on Sep 22, 2022
1:00:49

Join us on September 22nd to learn the Site Reliability Engineering (SRE) principles and practices that you can apply in your organization that enable your systems to be more scalable, reliable, and efficient. Technical Account Manager, Pamella Canova, will lead the session, including:  - The core problems SRE solves and organizational structures to facilitate the practice of SRE - Key principles SREs use to keep systems reliable - Areas of responsibility and expertise amongst SREs - How to adopt SRE best practices in your organization Join, learn, and engage with the Community → https://goo.gle/google-cloud-community 07:06 The SRE approach to operations 09:04 What do SRE teams do? 10:10 SRE and DevOps 11:03 Error budgets: The key principle of SRE 23:57 Practice areas of SRE 24:17 Monitoring and alerting 26:57 Demand forecasting and capacity planning 29:04 Efficiency and performance 30:55 Change management 34:00 Pursuing maximum change velocity 39:55 Provisioning 41:50 Emergency response 44:09 Incident and postmortem thresholds 48:31 Culture of blamelessness 49:55 Toil management / operational work 52:55 Getting started in 4 steps 55:14 Resources and certification information 56:55 Q&A

Download

1 formats

Video Formats

360pmp468.7 MB

Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.

Site Reliability Engineering (SRE) Fundamentals | NatokHD