Lead DevOps Engineer - AKS
Job Description
About Us:
ClearRoute is an is an engineering consultancy bridging Quality Engineering, Cloud Platforms and Developer Experience. We work with technology leaders facing complex business challenges.
We take as much pride in our people, culture and work-life balance as we do in making better software. We’re not just making better software. We’re making the making of software better. Collaborative, entrepreneurial and dedicated to problemSolving, we bring the step change our customer need to sustain innovation. Our values challenge us to do the best we can for ClearRoute, our customers and most importantly our team. We want to create a collaborative team to help build ClearRoute. This is an opportunity for you to build the organization from the ground up, use your voice to drive change and help transform organisations and problem domains.
Job Role
We are seeking a Senior DevOps / Platform Engineer with strong hands-on experience in building and operating scalable, secure, and highly available cloud-native platforms for a SaaS-based retail banking solution. The role involves designing and managing CI/CD pipelines, Kubernetes-based infrastructure, automation, monitoring, and security while closely collaborating with engineering teams to support application, infrastructure, and network needs. You will be responsible for improving platform reliability, observability, and security, enabling developers through tooling and standards around containers, service mesh, and infrastructure as code, and ensuring production systems meet high availability and defect SLAs. This role requires deep expertise in cloud platforms, distributed systems, CI/CD, container orchestration, and scripting, along with the ability to communicate technical solutions effectively in a global, collaborative environment.
Key Requirements:
Designing and managing highly scalable, reliable and fault tolerant CI/CD Pipeline infrastructure & networking that forms the backbone of SaaS Based Retail Banking Solution at Finastra.
Improve overall Monitoring and security posture of infrastructure/application by implementing protective measures, efficiently in tandem with better ROI and TCO.
Work along with the Dev Engineering teams to help with Application / Infrastructure / Network automation and long-term business needs.
Research and evaluate parallel products available, define & govern application/ infrastructure baselines
Communicate, collaborate and work effectively across distributed teams in a global environment.
Operate to strengthen teams across their product with their knowledge base.
Research and implement toolsets that help developers use Containers, Kubernetes and Service Mesh
Develop tools for developers, operations and release teams to use Kubernetes and Service Mesh with ease
Ensure Platform Security and Monitoring using tools like Prometheus/Grafana etc and implement Monitoring and Security best practices
Have a passion for delivering zero-defect and highly resilient code and be responsible for ensuring the team's deliverables exceed the prescribed availability and defect SLA
Present technical solutions, capabilities, considerations, and features in business terms
Groom user stories into detailed development tasks
Effectively communicate status, issues, and risks in a precise and timely manner.
Required Skills & Experience
Should have at least 8 to 15 years of hands-on experience on SaaS / IaaS. Hands-on experience with DevOps techniques building continuous integration solutions
using Ansible, Bash, Docker, Git, MavenExperience with Load Balancing, Rate Limiting, Traffic Shaping and managing connectivity between Applications and Networks.
Deep knowledge of Linux as a production environment. Strong Understanding of Container technologies. e.g. Docker, Infrastructure As Code such as Terraform, K8s administration at large scale.
Strong understanding of cluster orchestrators and schedulers (Kubernetes, Mesos etc)
Excellent bash, and scripting fundamentals and strong hands on with scripting in programming languages such as Spring, Python, Java, Ruby, etc.
Good understanding of distributed system fundamentals and ability to troubleshoot issues in a larger distributed Application infrastructure
Excellent understanding of interactive application development paradigm, memory management, performance/resource optimizations, database interactions, network programming, concurrency and multithreading, fault tolerance, monitoring, security and
operability of systems.Working knowledge on Oracle, DB2, PostgreSQL or Mongo DB databases.
Have worked on production distributed systems and strong understanding of microservices architecture, RESTful services, CI/CD.
Must have Prior experience with at least one Cloud Service provider – Azure AKS, AWS EKS, GCP GKE, OpenShift, Rancher etc.
Experience implementing Kubernetes controllers/operators.
Strong Docker experience
Operational experience deploying and managing Kubernetes
CKA or CKAD certification.
AZ400 (Good to Have)
- Department
- Engineering
- Locations
- Bengaluru
- Remote status
- Hybrid