See all roles

Team Lead, Site Reliability Engineering - Storage Layer Service

Work from home Full-time role Hiring

About the position MongoDB’s Storage Layer Services (SLS) team is re-architecting the MongoDB cloud storage layer and sits at the heart of our next-generation cloud storage architecture. This relatively new team is building performant, multi-tenant distributed storage services that both enhance today’s Atlas storage stack and enable more customer workloads to run more efficiently. As the Lead Site Reliability Engineer for SLS, you will partner with the teams building these storage services to define SLOs, shape capacity plans, and ensure the reliability, durability, and operational safety of the storage layer that underpins Atlas. You’ll help grow and lead a small, senior team of SREs as founding members of this organization, playing a crucial role in executing on a multi-year roadmap for MongoDB’s cloud storage architecture. This role can be based in our New York City office (hybrid working model) or remotely on the East Coast.

Responsibilities

  • Build and lead a team of 6-8 engineers, fostering a positive culture, handling career growth and performance conversations, and proactively removing blockers
  • Define and drive a clear technical vision and comprehensive roadmap for our multi-tenant distributed storage systems, balancing long-term strategic infrastructure goals with immediate engineering needs
  • Contribute through hands-on technical work, such as leading architectural design reviews, reviewing PRs, and stepping in to guide the team through complex operational challenges
  • Act as the primary liaison for the Storage Layer Services SRE team, collaborating closely with other engineering leaders to ensure platform alignment and manage stakeholder expectations

Requirements

  • Have 10+ years of experience working on software and operating distributed systems, with 2+ years managing engineering teams
  • Possess a customer-focused mindset, treating internal developers as your primary users
  • Value efficiency in processes and operations, and have a track record of optimizing team workflows
  • Prefer automation over manual processes, fostering a culture of building software solutions to eliminate toil
  • Have deep technical familiarity with Kubernetes ecosystems, containerization technologies, and modern IaC tooling (e.g., Terraform, Crossplane, or Operators) so you can effectively guide the team's technical decisions
  • Have operated or supported stateful storage or database systems at scale and are comfortable with durability, consistency and recovery trade-offs
  • Excel at translating complex business and engineering requirements into actionable, phased technical roadmaps
  • Have a high level of empathy, responsibility, ownership, and accountability
  • Excellent verbal and written technical communication skills

Nice-to-haves

  • Leading major architectural shifts, such as moving from legacy storage stacks to new multi-tenant storage architectures, including planning and executing large-scale data and workload migrations with tight availability and durability requirements
  • Managing and scaling infrastructure across multi-cloud environments (AWS, GCP, or Azure)
  • Designing secure, multi-tenant runtime environments at scale

Benefits

  • equity
  • participation in the employee stock purchase program
  • flexible paid time off
  • 20 weeks fully-paid gender-neutral parental leave
  • fertility and adoption assistance
  • 401(k) plan
  • mental health counseling
  • access to transgender-inclusive health insurance coverage
  • health benefits offerings

Apply tot his job Apply To this Job

You might like

Site Reliability Engineer-SkillBridge Intern

Work from home Full-time role

SRE Architect + Strong Dynatrace exp

Work from home Full-time role

Software Engineer – Java, Spring Boot, Kubernetes, AWS

Work from home Full-time role

Senior DevOps Engineer (Kubernetes, Docker, Jenkins)

Work from home Full-time role

Staff Software Development Engineer-Kubernetes

Work from home Full-time role

Principal Cisco Network Systems Engineer

Work from home Full-time role

Cloud Engineer -(Network Cloud Engineer – Alibaba Cloud)_(Memphis, TN -Only USC/GC on W2)

Work from home Full-time role

Network Engineer job at AnewHealth in OH

Work from home Full-time role

Network Engineering with AI - (Hyperscaler, HyperNet) - (REMOTE)

Work from home Full-time role

Senior Network Engineer | Upto $70/hr

Work from home Full-time role

Enterprise Sales Lead Crypto Native

Work from home Full-time role

Experienced Virtual Chat Assistant – Delivering Exceptional Customer Service in a Fast-Paced Environment

Work from home Full-time role

AI & Software Senior Developer – Integration

Work from home Full-time role

Student Records Data Entry Clerk

Work from home Full-time role

Experienced Spanish Bilingual Customer Service Representative – Remote Customer Care Opportunities

Work from home Full-time role

Experienced Part-Time Remote Customer Support Representative – Delivering Exceptional Travel Experiences with arenaflex

Work from home Full-time role

Experienced Customer Service Representative – Remote Opportunity at arenaflex

Work from home Full-time role

Experienced Part-Time Remote Data Entry Clerk – Endless Opportunities for Growth and Development at arenaflex

Work from home Full-time role

Experienced and Enthusiastic Chat Online Support Representative – Part-Time Remote Opportunity at arenaflex

Work from home Full-time role

Sr. Account Manager, Grocery - Active Nutrition

Work from home Full-time role