Site Reliability Engineer (SQCE)
Role Description
Site Reliability of our Development, Test & Prod environments hosted in Azure
Driving operational excellence for Payments Cloud services to deliver an "always on" operation, year-round, at the right cost
Rollout of Infrastructure, Operating System and Application updates with no impact to consumers
Experience with implementing end to end monitoring & alerting
Implementing and Delivering robust Infrastructure as code
Managing desired state configuration of Java Applications hosted on Cloud
Leading Root Cause Analysis through Blameless Post Mortems of Incidents and Failure Mode Analysis
Should prepare Run Books, Training Material and conducted sessions
Converts OPS issues into Stories to fix root cause
Key Responsibilities
Own value stream and application issue resolution to completion
- Department
- Engineering
- Locations
- Bengaluru
- Remote status
- Hybrid
- Employment type
- Full-time