Description

Required Skills

 

  • 7+ years of Software Engineering, SRE, Platform Engineering, or DevOps experience
  • 5+ years of Python development experience
  • 3+ years of hands-on AWS experience (EC2, VPC, S3, Lambda, IAM, CloudFormation, EventBridge, Step Functions)
  • Expert-level Terraform and Infrastructure as Code (IaC)
  • Strong CI/CD, DevOps, and automated testing experience
  • Experience with observability tools such as Grafana and CloudWatch
  • Strong knowledge of SRE practices including SLIs, SLOs, error budgets, incident management, and RCA/postmortems
  • Agile/SAFe environment experience

     

Responsibilities

 

  • Develop cloud reliability tools and automation solutions
  • Build and maintain AWS infrastructure using Terraform
  • Create CI/CD pipelines and testing frameworks
  • Define and implement SRE best practices and reliability metrics
  • Support production environments, incident response, and root cause analysis
  • Collaborate with cross-functional teams to improve platform reliability and scalability

Education

Bachelor's degree