Design and implement the roadmap for Platform Ops, focusing on AI for IT Operations and machine learning to automate operational workflows.
Responsibilities
Establish architecture governance, design standards, and policies that prioritize security, reliability, and scalability across private cloud and hyperscalers.
Automate IT operational processes for service provisioning, configuration, and management.
Design and implement enterprise-level solutions for infrastructure and observability.
Collaborate with application development and infrastructure teams to support business processes.
Required Skills
5+ years of experience in enterprise architecture or platform engineering.
Infrastructure-as-Code (IaC) expertise using Terraform and Ansible.
Proficiency with CI/CD tools including GitLab CI/CD and Jenkins.
Hands-on experience with Kubernetes and container orchestration at scale.
Observability and performance monitoring experience with Dynatrace, Splunk, Datadog, or Grafana.
Experience managing service workflows within ServiceNow.
Knowledge of both on-premises and cloud-based vendor technologies.