Who We Are: Osmos, a B2B SaaS company founded by ex-Amazon ad-tech experts, is revolutionizing retail media with an AI-powered operating system that increases retailer profitability (by up to 7% of sales) and delivers superior ROAS for brands. By enabling Tier 1 retailers and marketplaces worldwide to activate more brands and leverage advanced targeting, we help them secure a lasting competitive edge.
Your Impact:
We are seeking a highly skilled Staff DevOps Engineer to architect and maintain a highly available, global infrastructure capable of handling high QPS systems with 99.99% uptime. The role requires expertise in managing deployments across multiple regions, ensuring fault-tolerant systems, and driving scalability for mission-critical applications.
What You'll Do:
Architect, manage, and scale Kubernetes clusters for high throughput and low latency across multiple global regions.
Design and maintain Infrastructure as Code (IaC) to support a fault-tolerant, globally distributed architecture.
Build and optimize CI/CD pipelines to ensure smooth, zero-downtime deployments.
Ensure 99.99% availability for high QPS applications by implementing robust monitoring, incident management, and failover strategies.
Manage multi-region deployments to enable low-latency, geo-redundant infrastructure.
Collaborate with cross-functional teams to ensure security, scalability, and operational efficiency.
Lead and mentor a high-performing DevOps team, fostering a culture of excellence and innovation.
You'll Thrive If You Have:
7–10 years of experience managing large-scale, high-availability systems.
Proven expertise in Kubernetes administration, including multi-region deployments and scaling for high QPS.
Deep experience with IaC tools like Terraform or CloudFormation.
Hands-on with CI/CD pipelines for global, multi-region deployments.
Strong understanding of cloud platforms (AWS, GCP, or Azure) and geo-redundant architecture.
Proficient in Linux, scripting (Bash, Python), and troubleshooting large-scale distributed systems.
Experience leading teams and solving complex, production-grade system challenges.
Why Choose Osmos?
Startup Energy, Enterprise Scale: Fast-paced innovation with global ambition
Revolutionize Retail Marketing: Be at the forefront of AI-powered solutions
Meaningful Contribution: Directly impacts major brands' success
No Red Tape: Autonomy and empowerment to drive results
Growth & Fun: Continuous learning in a vibrant, collaborative culture
Competitive Rewards: We value your expertise and offer strong compensation
Ready to champion Infra & Cloud? Let's chat.