You will manage and scale cloud infrastructure on AWS in a Linux-based environment.
Responsibilities
Partner with development and cross-functional teams to provide day-to-day support on DevOps activities, including environment setup and build/deployment troubleshooting.
Automate infrastructure and implement Infrastructure as Code (IaC) using tools like Terraform or Ansible.
Build and maintain CI/CD pipelines to enable frequent, reliable software delivery.
Implement observability and monitoring solutions to proactively identify and resolve performance bottlenecks.
Apply Site Reliability Engineering (SRE) practices to ensure high availability and incident response across production environments.
Required Skills
6+ years of experience in DevOps or Site Reliability Engineering managing AWS cloud infrastructure on Linux.
3+ years of backend development proficiency in TypeScript, Java, Python, or Node.js.
Expertise in SQL-based RDBMS such as PostgreSQL or MySQL.
Proven experience with Infrastructure as Code (IaC) using Terraform or CloudFormation.
Hands-on experience building and maintaining CI/CD pipelines (e.g., GitHub Actions, GitLab CI).
Proficiency with observability and monitoring tools such as DataDog or New Relic.
Strong debugging skills across application layers, OS, and cloud infrastructure components.