Provide application support and software development to improve system availability, resiliency, and performance.
Responsibilities
- Design, code, and test software to automate manual operational tasks.
- Troubleshoot production incidents and perform root-cause analysis for permanent closure.
- Implement self-healing patterns and software-driven alerting to meet service level objectives.
- Participate in SR/DR/HA exercises to validate resiliency assumptions.
Required Skills
- 3+ years of experience in application support or development.
- Proficiency with Ab Initio and Hadoop ecosystems.
- Experience with Spark and Scala.
- Strong UNIX Shell scripting skills.
- Hands-on experience with Oracle (v9i/10/11) and SQL/PL/SQL stored procedures.
- Experience using Control-M or AutoSys scheduling packages.
- Knowledge of Hive and Impala.
- Competency in Python, Java, or Maven.