Course Outline

Foundations of Cloud Operations on AWS

  • Operational roles and responsibilities in the cloud
  • AWS account structure, organizations, and multi-account strategy
  • Core operational services: CloudWatch, CloudTrail, AWS Config

Infrastructure as Code and Provisioning

  • Principles of IaC and immutable infrastructure
  • Provisioning with Terraform and AWS CloudFormation
  • Managing state, modules, and environment promotion

CI/CD and Deployment Strategies

  • Designing CI/CD pipelines for cloud-native apps
  • Blue/green, canary, and rolling deployments
  • Automating rollback, health checks, and release validation

Monitoring, Observability, and Alerting

  • Metrics, logs, and traces: ship, store, and analyze
  • Using CloudWatch, X-Ray, and third-party observability tools
  • Defining SLOs/SLIs, alerting policies, and on-call practices

Security Operations and Identity Management

  • IAM best practices, least privilege, and cross-account access
  • Secrets management, KMS, and secure parameter stores
  • Operational security: patching strategies, vulnerability scanning, and audit trails

Resilience, Backup, and Disaster Recovery

  • Designing for fault tolerance and high availability
  • Backup strategies, snapshot automation, and restore procedures
  • Disaster recovery planning and runbook creation

Cost Optimization and Governance

  • Cost visibility: billing, tagging, and cost allocation strategies
  • Rightsizing, reserved instances/savings plans, and budgeting controls
  • Governance: policies, guardrails, and automation for compliance

Containers, Serverless, and Runtime Operations

  • Operational considerations for ECS, EKS, and Lambda
  • Service discovery, autoscaling, and resource limits
  • Logging, tracing, and debugging containerized workloads

Incident Response, Playbooks, and Chaos Engineering

  • Runbook-driven incident response and postmortem practices
  • Automating remediation and self-healing patterns
  • Intro to chaos experiments for validating resilience

Hands-on Workshop: Operate a Sample Workload

  • Deploy a sample application using IaC and a CI/CD pipeline
  • Implement monitoring, alerts, and an automated remediation script
  • Simulate incidents and practice runbook-based response

Summary and Next Steps

Requirements

  • A basic understanding of cloud concepts and networking
  • Familiarity with Linux command line and scripting
  • Experience with source control (Git) and basic CI/CD concepts

Audience

  • Cloud operations engineers
  • SREs and platform engineers
  • DevOps engineers and technical team leads
 21 Hours

Delivery Options

Private Group Training

Our identity is rooted in delivering exactly what our clients need.

  • Pre-course call with your trainer
  • Customisation of the learning experience to achieve your goals -
    • Bespoke outlines
    • Practical hands-on exercises containing data / scenarios recognisable to the learners
  • Training scheduled on a date of your choice
  • Delivered online, onsite/classroom or hybrid by experts sharing real world experience

Private Group Prices RRP from £5700 online delivery, based on a group of 2 delegates, £1800 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.

Contact us for an exact quote and to hear our latest promotions


Public Training

Please see our public courses

Testimonials (5)

Provisional Upcoming Courses (Contact Us For More Information)

Related Categories