Description
Senior SRE Engineer with Java expertise to own data collection, analysis, and dashboard creation. You will leverage development experience to identify scenarios and improve system reliability.
Responsibilities
- Design and maintain data collection pipelines for system observability.
- Analyze operational data and build actionable dashboards for monitoring.
- Identify failure scenarios using Java development background to strengthen SRE practices.
- Collaborate on infrastructure improvements to support data-intensive workloads.
Required Skills
- 9+ years of total engineering experience.
- At least 5 years of dedicated Site Reliability Engineering (SRE) experience.
- Strong proficiency in Java and J2EE development.
- Hands-on experience with MongoDB for data storage and retrieval.
- Proficiency with Kafka for streaming and data ingestion.
- Experience creating and maintaining operational dashboards.
Preferred Skills
- Experience bridging development and operations teams.
- Background in building data collection tools from scratch.