← Back to jobs

Site Reliability Engineering Senior Lead

CareerNet Technologies Pvt Ltd

Bangalore, Karnataka, India

Posted On: 30+ days ago

Experience: 10+ years

Availability: Onsite

Openings: 1

Category: Site reliability engineering

Tenure: No Preference/Any

Related Jobs

No related jobs found

Description

You own and drive reliability outcomes at scale for real-time, distributed payment and transaction processing platforms meeting strict SLAs, SLOs, and regulatory requirements.

Responsibilities

Define reliability architecture and standards across services, platforms, and infrastructure.
Design and evolve enterprise-grade observability platforms (metrics, logs, traces, SLOs/SLIs).
Lead incident response for high-severity production issues, driving root-cause analysis and fixes.
Set strategy and drive adoption of SRE best practices, including error budgets and capacity modeling.
Architect automation platforms to eliminate toil and enable safe production releases.

Required Skills

10+ years of software engineering experience building large-scale, distributed systems.
Significant experience operating mission-critical systems in Payments, FinTech, or Banking environments.
Expertise in AWS, Azure, and GCP.
Proficiency with Prometheus, Grafana, and Datadog for monitoring.
Strong command of Linux and Python.
Experience with configuration management using Ansible.

Preferred Skills

Experience with Splunk, ELK stack, or Oracle RDMS.
Familiarity with CI/CD platforms and release automation in regulated environments.

Key Skills

Aws Azure Gcp Prometheus Grafana Datadog Linux Python Ansible

Education

Any Graduate

Related Jobs

No related jobs found

← Back to jobs

Site Reliability Engineering Senior Lead

Related Jobs

Description

Responsibilities

Required Skills

Preferred Skills

Key Skills

Education

Related Jobs

Explore More Jobs