You will manage and automate cloud infrastructure, container orchestration, and deployment lifecycles.
Responsibilities
- Build and maintain infrastructure using Terraform and GitOps workflows.
- Manage Kubernetes clusters (GKE) and deployment lifecycles via Helm and ArgoCD.
- Develop internal tools and automation scripts using Python or Go.
- Implement observability, alerting, and performance monitoring using Prometheus and Grafana.
- Support internal product teams with infrastructure requirements and ensure SOC2 conformance.
Required Skills
- 5+ years of experience in technical infrastructure or SRE roles.
- Strong expertise with Kubernetes, including GKE, Helm charts, and Go templates.
- Proven experience with Terraform and Infrastructure as Code principles.
- Hands-on experience with ArgoCD and GitOps methodologies.
- Proficiency in Python or Go for tool building and automation.
- Deep knowledge of Google Cloud Platform (GCP).
- Experience implementing immutable infrastructure and configuration management.
- Familiarity with SOC2 conformance and SLO/SLI monitoring.
- Degree in any field of study.
Preferred Skills
- Experience managing Apache Airflow at scale.