Description
You will design, develop, and maintain scalable cloud platforms and drive standardization across development and operations teams.
Responsibilities
- Design and maintain scalable, high-performance platforms.
- Provide technical leadership and mentorship to other engineers.
- Create technical documentation including requirements, test strategies, and deployment plans.
- Troubleshoot and resolve production issues to ensure system stability.
- Optimize system performance, security, and user experience.
Required Skills
- 5+ years of experience in cloud engineering.
- Hands-on experience with AWS and Azure.
- Proficiency with containerization using Kubernetes and Docker.
- Strong programming skills in Python, Go, Bash, Java, or C#.
- Experience with monitoring and observability tools such as Dynatrace, Prometheus, Grafana, or Datadog.
- Understanding of distributed systems, high availability, and failure recovery.
- Knowledge of service-level management, incident response, and root cause analysis.
- Familiarity with CI/CD pipelines and automation frameworks.
- Familiarity with chaos engineering practices and tools like Gremlin or Chaos Monkey.