Description
You will lead the planning, execution, and management of observability infrastructure processing trillions of events daily.
Responsibilities
- Manage observability infrastructure including logging, monitoring, and alerting systems.
- Design and develop scalable software observability platforms for logs, traces, and metrics.
- Develop and maintain Kubernetes Helm charts to deploy hundreds of pods daily.
- Design end-to-end Synthetic Tests Monitoring solutions on GCP with self-service capabilities.
- Participate in on-call rotations to ensure high availability and performance.
Required Skills
- 3+ years of experience as a DevOps Engineer or in an equivalent role.
- Proficiency in Kubernetes and containerization technologies.
- Experience with observability tools including ELK, GrafanaLab, Zabbix, Fluentd, Kafka, Prometheus, and OpenTelemetry.
- Hands-on experience with Docker, Bash, and Python.
- Knowledge of cloud platforms including GCP, AWS, and Azure.
- Familiarity with Infrastructure as Code (IaC) tools such as Terraform and Ansible.
- Bachelor's degree in Computer Science, Engineering, or related field.
Preferred Skills
- Experience with OpenTelemetry Collector and Grafana Agent.
- Strong programming skills in Go.