← Back to jobs
Montreal, QC, Canada
No related jobs found
Responsibilities:
• Are interested in distributed systems and working with highly scalable and reliable services.
• Like to work in a fast-moving environment and you aren't afraid to change things to make them better.
• Enjoy new technological challenges and solving hard problems.
• Believe a team working well together is smarter than the single smartest person on that team.
• Have grit, drive and a deep sense of ownership.
• Working closely with engineering/development teams to design, build, and maintain systems.
• Troubleshooting issues across the entire technology stack: hardware, software, application, and network.
• Identifying and driving opportunities to improve automation for our platforms; scope and create automation for deployment, management, and visibility of our services.
• Proactively identifying and addressing systems reliability risks.
• Working alongside existing global and regional team members on a follow-the-sun basis.
• Represent the RPE organization in design reviews and operational readiness exercises for new and existing services.
Qualifications -
Skill Set
• Demonstrated ability to troubleshoot problems and debug to identify root cause.
• Hands on experience on enterprise tools such as AppDynamics, Grafana, Splunk, Dynatrace .
• Experience with Ansible, GitHub or any automation/configuration/release management tools .
• Automation-related experience is particularly valued using scripting languages such as python, shell . One higher level language is desired.
• Awareness of, and ability to reason about modern software and systems architectures, including load-balancing, databases, queueing, caching, distributed systems failure modes, micro services, Cloud, etc.
• Practical experience running large scale systems is an advantage.
• Should be able to contribute to system design and architecture with strong database knowledge
Any Gradute
No related jobs found
← Back to jobs