Description
Lead the transition of networking support from an outsourced model to an automated, data-driven Site Reliability Engineering model.
Responsibilities
- Build and lead an in-house team of networking reliability experts.
- Define the technical vision, strategy, and roadmap for network operations in partnership with infrastructure teams.
- Collaborate with Network Architecture and Engineering to establish runbooks and implement self-healing network capabilities.
- Analyze RCAs from incidents to enrich observability tooling and improve full-stack visibility from network to applications.
- Influence the architecture of on-prem and cloud-based networks.
Required Skills
- 10+ years of experience in system design, network architecture, network engineering, or network operations.
- 8+ years of leadership experience.
- Proven track record of building and growing geographically distributed teams.
- Ability to perform technical deep-dives into code, networking, operating systems, and storage.
- Experience communicating technical strategy to executive leadership and subject matter experts.
- Bachelor’s degree in Computer Science, a related technical field, or equivalent experience.
Preferred Skills
- Experience transforming network operations using software-driven methods.
- Experience working within a Hyperscale Cloud Service Provider.
- Knowledge of SRE principles including observability, SLOs, SLIs, and logging.