You will manage and scale enterprise infrastructure using virtualization, containerization, and automation technologies.
Responsibilities
- Operate and troubleshoot test infrastructure using metrics such as SLOs and error budgets.
- Design and maintain enterprise virtualization environments based on VMs and containers like VMWare, OpenStack, and Kubernetes.
- Build and troubleshoot Jenkins-based CI pipelines using Groovy.
- Manage core network and identity services including LDAP, Active Directory, DNS, DHCP, NIS, and Kerberos.
- Scale systems through Python or Java scripting and automation to improve operational reliability.
Required Skills
- 5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or Infrastructure Engineer.
- Proficiency in coding with Python, Java, C, C++, or Go.
- Hands-on experience with Docker and Kubernetes orchestration.
- Experience with Jenkins and Groovy for CI pipeline delivery.
- Practical knowledge of Infrastructure as Code tools such as Ansible, Terraform, or CloudFormation.
- Experience managing Layer 2/Layer 3 hybrid networks and services like DNS, DHCP, and NTP.
- Experience with virtualization technologies including VMWare or OpenStack.
- Ability to analyze performance and debug enterprise systems and applications.
- Capacity to work with a few hours of overlap with the USA time zone.
Preferred Skills
- Storage administration experience with Pure Storage.
- Understanding of hardware management services like CIMC or UCS Manager.
- Knowledge of Unix/Linux and Windows operating systems.