Lead DevOps Engineer - AKS
Job Description
About Us:
ClearRoute is an is an engineering consultancy bridging Quality Engineering, Cloud Platforms and Developer Experience. We work with technology leaders facing complex business challenges.
We take as much pride in our people, culture and work-life balance as we do in making better software. We’re not just making better software. We’re making the making of software better. Collaborative, entrepreneurial and dedicated to problemSolving, we bring the step change our customer need to sustain innovation. Our values challenge us to do the best we can for ClearRoute, our customers and most importantly our team. We want to create a collaborative team to help build ClearRoute. This is an opportunity for you to build the organization from the ground up, use your voice to drive change and help transform organisations and problem domains.
Job Role
We are looking for a skilled Cloud/Platform Engineer with strong expertise in Azure Kubernetes Service (AKS) to design, build, and manage scalable, production-grade infrastructure. The ideal candidate will have hands-on experience with AKS provisioning, GPU-backed environments, and exposure to integrating AI/ML services within cloud-native architectures.
Key Requirements:
Design, provision, and manage production-grade AKS infrastructure
Build and maintain scalable, secure, and highly available Kubernetes environments on Azure
Implement and manage GPU-backed infrastructure to support AI/ML workloads
Integrate and manage AI services such as LiteLLM with Azure OpenAI and Google Vertex AI
Develop and maintain infrastructure as code (IaC) using tools like Terraform or ARM templates
Collaborate with engineering teams to optimize application deployment, scalability, and performance
Set up and maintain CI/CD pipelines for automated deployments
Monitor system performance, troubleshoot issues, and ensure reliability and uptime
Implement best practices around security, networking, and cost optimization
Required Skills & Experience
Strong hands-on experience with Azure Kubernetes Service (AKS) in production environments
Experience in provisioning and managing AKS clusters
Solid understanding of Kubernetes architecture, networking, and scaling
Experience working with GPU-enabled infrastructure for compute-intensive workloads
Exposure to AI/ML integrations, including tools like LiteLLM, Azure OpenAI, or Google Vertex AI
Proficiency in Infrastructure as Code (Terraform preferred)
Experience with CI/CD tools (Azure DevOps, GitHub Actions, Jenkins, etc.)
Strong knowledge of Docker and containerization
Familiarity with monitoring and logging tools
Good To Have:
Experience with multi-cloud environments (Azure + GCP)
Knowledge of MLOps practices
Exposure to service mesh, ingress controllers, and API gateways
Understanding of cost optimization strategies in cloud environments
- Department
- Engineering
- Locations
- Bengaluru
- Remote status
- Hybrid