You will own the reliability and automation of the core deployment platform.
Responsibilities
- Monitor Harness pipelines, delegates, and logs for anomalies using observability tools.
- Respond to support tickets, Slack alerts, and incident reports in real-time.
- Assist with delegate upgrades, connector configurations, and onboarding of new services.
- Troubleshoot CICD pipeline failures and integration issues across cloud platforms.
- Collaborate with engineering teams to improve developer workflows and platform reliability.
Required Skills
- 5+ years of experience in a DevOps or Platform Engineering role.
- Hands-on experience with Harness pipelines, delegates, and connectors.
- Proficiency with Infrastructure as Code using Terraform and Helm.
- Strong scripting skills in Bash and Python for automation.
- Experience with CI/CD tools including Jenkins and GitLab.
- Familiarity with observability stacks like Grafana, Prometheus, ELK, and Splunk.
- Solid understanding of AWS services (EC2, IAM, S3, Lambda, CloudWatch).
- Experience managing environments in a private cloud setup.
- Knowledge of GitOps principles and container orchestration (Kubernetes/EKS/GKE).
Preferred Skills