Principal SRE based in Charlotte, NC, owning platform automation, distributed systems reliability, and SRE best practices.
Responsibilities
Lead the SRE charter by driving automation, efficiency, and best practices in platform change management and operations.
Design, build, and implement orchestration and tooling solutions to optimize workflows and minimize defects.
Establish operational best practices for structuring, automating, deploying, and monitoring complex distributed software products.
Collaborate with engineering teams to triage alerts, diagnose critical issues, and manage the implementation of changes.
Mentor lead, senior, and staff SREs to adopt and implement DevSecOps culture and system design improvements.
Required Skills
7+ years of experience automating tasks, building cloud-native software in microservice architectures, and writing tools in Python, Go, or Ruby.
Advanced knowledge in at least three areas: Cloud-native/IaaS architecture (Azure preferred, or GCP/AWS), Design (compliance, security), Cloud Engineering, Container Orchestration, or Microservice engineering.
Deep expertise in SRE and DevOps philosophies, technologies, platforms, tools, SLA management, and incident resolution.
Hands-on advanced experience implementing and supporting streaming.
Proven ability to design, build, and maintain cloud-based environments for massive-scale data processing using IaaS, PaaS, and SaaS.
Experience with Design Specifications and Systems Engineering.
Ability to work on client W2 without sponsorship now or in the future.
Ability to commute to Charlotte, NC, Phoenix, AZ, or Dallas, TX three days a week.