You will provide second-line support for ATM hardware and applications while maintaining operational stability.
Responsibilities
- Monitor production environments for anomalies using Splunk, Dynatrace, and Grafana, addressing identified issues.
- Lead and drive the identification and implementation of automation opportunities across applications and infrastructure.
- Support incident, problem, and change management processes for applications and infrastructure.
- Conduct Root Cause Analysis (RCA) and identify monitoring gaps, implementing solutions using observability tools.
- Partner with various support groups and product owners to resolve defects and address operational support questions from vendors.
Required Skills
- 5+ years of experience with Splunk, Dynatrace, and Grafana.
- Expertise in event, change, incident, and problem management.
- Strong proficiency in at least one scripting language (Java, Python, or PowerShell).
- Working understanding of public cloud environments, specifically AWS.
- Experience working with ticket systems like ServiceNow and Jira.
- Demonstrated automation mindset focused on continuous improvement.
- Ability to work in a production environment and provide weekend support as part of shift coverage.
Preferred Skills
- Familiarity with Kubernetes or Docker.
- Experience with GCP cloud environments.