Description

You will provide second-line support for ATM hardware and applications while maintaining operational stability.

Responsibilities

  • Monitor production environments for anomalies using Splunk, Dynatrace, and Grafana, addressing identified issues.
  • Lead and drive the identification and implementation of automation opportunities across applications and infrastructure.
  • Support incident, problem, and change management processes for applications and infrastructure.
  • Conduct Root Cause Analysis (RCA) and identify monitoring gaps, implementing solutions using observability tools.
  • Partner with various support groups and product owners to resolve defects and address operational support questions from vendors.

Required Skills

  • 5+ years of experience with Splunk, Dynatrace, and Grafana.
  • Expertise in event, change, incident, and problem management.
  • Strong proficiency in at least one scripting language (Java, Python, or PowerShell).
  • Working understanding of public cloud environments, specifically AWS.
  • Experience working with ticket systems like ServiceNow and Jira.
  • Demonstrated automation mindset focused on continuous improvement.
  • Ability to work in a production environment and provide weekend support as part of shift coverage.

Preferred Skills

  • Familiarity with Kubernetes or Docker.
  • Experience with GCP cloud environments.

Education

Any Graduate