You will provide technical support and operational assistance for GCP cloud-based customers and PaaS engineering teams.
Responsibilities
- Troubleshoot and resolve complex GCP technical issues through automation and hands-on debugging.
- Build custom tools from scratch to meet specific operational requirements.
- Design and manage predictive alerting platforms using monitoring and observability tools.
- Assist customers with cloud migrations and meeting operational needs.
- Document solution specifications, architecture diagrams, operating procedures, and test plans.
Required Skills
- 5+ years of experience in infrastructure engineering, SRE, or GCP cloud support.
- Proficiency in Python, bash, and general scripting fundamentals.
- Hands-on expertise with the GCP cloud platform.
- Experience with infrastructure-as-code tools including Terraform, Cloud Deployment Manager, Ansible, or Chef.
- Experience with containerization and cluster management using Docker and Kubernetes/GKE.
- Strong Linux system administration skills.
- Knowledge of microservices architecture and modern web services.
- Experience with Git and CI/CD pipelines.
- Proficiency with observability tools for log aggregation, monitoring, and distributed tracing.
Preferred Skills
- Experience with Prometheus, Grafana, Cloud Monitoring, or Splunk.
- Familiarity with Pulumi.