Course Outline
SRE Anti-patterns
- Identifying counterproductive practices
- Recognizing the impact of anti-patterns on reliability
- Best practices and corrective alternatives
SLO as a Proxy for Customer Satisfaction
- Defining Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
- Managing error budgets and balancing innovation with reliability
- Understanding limits of distributed systems
Building Secure and Reliable Systems
- Designing for fault tolerance and resilience
- Integrating security into reliability engineering
- Scalability and data protection strategies
Full-stack Observability
- Instrumentation and metrics collection
- Distributed tracing and synthetic monitoring
- Observability-driven development
Platform Engineering and AIOps
- Platform-centered engineering approaches
- Automation and orchestration in SRE
- Leveraging DataOps and operational intelligence
Incident Management in SRE
- Roles and responsibilities in incident response
- Applying frameworks such as OODA
- Automated remediation and AI/ML-assisted resolution
Chaos Engineering
- Principles and strategies for resilience testing
- Planning and executing “game day” exercises
- Learning from controlled failure experiments
SRE as a Pure Form of DevOps
- Integrating SRE into DevOps workflows
- Cultural alignment and collaboration practices
- Driving organizational transformation through SRE
Post-class Exercises
- Large-scale system design case studies
- Advanced instrumentation and monitoring scenarios
- Real-world reliability problem-solving
Review and Exam Preparation
- Final review of the DevOps Institute SRE Practitioner syllabus
- Sample questions and practice tests
- Exam-taking strategies and recommendations
Summary and Next Steps
Requirements
- Understanding of core Site Reliability Engineering principles
- Experience with DevOps practices and related tools
- Familiarity with system monitoring, incident management, and automation
Audience
- SRE professionals seeking DevOps Institute SRE Practitioner certification
- DevOps engineers aiming to expand into reliability-focused roles
- Operations leaders responsible for reliability strategy and execution
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from £9500 online delivery, based on a group of 2 delegates, £3000 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Testimonials (4)
The break down of what DevOps can do. Possible Automation Integration.
Adeyinka Adekoya - NTPF
Course - Continuous Testing Foundation (CTF)®
working with DevOps Toolchain
Kesh - Vodacom
Course - DevOps Foundation®
new information
Michael Durisin - Deutsche Telekom IT & Telecommunications Slovakia s.r.o
Course - Site Reliability Engineering (SRE) Foundation®
the topic - SRE