You will lead the configuration, maintenance, and optimization of application monitoring systems.
Responsibilities
Drive, standardize, and manage the unified configuration management database.
Collect and aggregate data to support decisions across ITIL processes (configuration, event, capacity, availability, incident, and problem management).
Assess and fine-tune monitoring capabilities to provide accurate and actionable alerts for 24x7 operations systems.
Configure and maintain monitoring dashboards to track health and performance across diverse IT infrastructure components.
Perform event correlation and filtering to streamline incident triage and ensure timely escalation.
Required Skills
Minimum 7 years of relevant experience.
Minimum 2 years managing OpenText suite tools (AI Operations Management, Operations Bridge, SiteScope, Optic).
Expertise with management protocols including SNMP and WMI.
Proficiency in scripting using PowerShell and/or VBScript.
Experience managing monitoring systems with 250+ Hosts and/or 3000+ sensors.
Experience operating monitoring solutions such as Zenoss, PRTG, Zabbix, and/or Nagios.
Extensive experience monitoring server, storage, database, networking, and applications.
Knowledge of ITIL processes, including availability and capacity management.
Familiarity with multi-vendor server operating systems.