Tech & Product

Staff DevOps Engineer

Pune
Work Type: Full Time

Who We Are: Osmos, a B2B SaaS company founded by ex-Amazon ad-tech experts, is revolutionizing retail media with an AI-powered operating system that increases retailer profitability (by up to 7% of sales) and delivers superior ROAS for brands. By enabling Tier 1 retailers and marketplaces worldwide to activate more brands and leverage advanced targeting, we help them secure a lasting competitive edge.


Your Impact: 

We are seeking a highly skilled Staff DevOps Engineer to architect and maintain a highly available, global infrastructure capable of handling high QPS systems with 99.99% uptime. The role requires expertise in managing deployments across multiple regions, ensuring fault-tolerant systems, and driving scalability for mission-critical applications.


What You'll Do: 

  • Architect, manage, and scale Kubernetes clusters for high throughput and low latency across multiple global regions.

  • Design and maintain Infrastructure as Code (IaC) to support a fault-tolerant, globally distributed architecture.

  • Build and optimize CI/CD pipelines to ensure smooth, zero-downtime deployments.

  • Ensure 99.99% availability for high QPS applications by implementing robust monitoring, incident management, and failover strategies.

  • Manage multi-region deployments to enable low-latency, geo-redundant infrastructure.

  • Collaborate with cross-functional teams to ensure security, scalability, and operational efficiency.

  • Lead and mentor a high-performing DevOps team, fostering a culture of excellence and innovation.


You'll Thrive If You Have

  • 7–10 years of experience managing large-scale, high-availability systems.

  • Proven expertise in Kubernetes administration, including multi-region deployments and scaling for high QPS.

  • Deep experience with IaC tools like Terraform or CloudFormation.

  • Hands-on with CI/CD pipelines for global, multi-region deployments.

  • Strong understanding of cloud platforms (AWS, GCP, or Azure) and geo-redundant architecture.

  • Proficient in Linux, scripting (Bash, Python), and troubleshooting large-scale distributed systems.

  • Experience leading teams and solving complex, production-grade system challenges.


Why Choose Osmos?

  • Startup Energy, Enterprise Scale: Fast-paced innovation with global ambition

  • Revolutionize Retail Marketing: Be at the forefront of AI-powered solutions

  • Meaningful Contribution: Directly impacts major brands' success

  • No Red Tape: Autonomy and empowerment to drive results

  • Growth & Fun: Continuous learning in a vibrant, collaborative culture

  • Competitive Rewards: We value your expertise and offer strong compensation


Ready to champion Infra & Cloud? Let's chat.

Submit Your Application

You have successfully applied
  • You have errors in applying