See all roles

Manager, Site Reliability Engineering and Incident Management

Work from home Full-time role Hiring

Planet DDS is a leading provider of a platform of cloud-based solutions that empowers growth-minded dental businesses. Now serving over 13,000 practices and 118,000 customers in North America, Planet DDS delivers a comprehensive suite of solutions, including Denticon Practice Management, Cloud 9 Ortho Practice Management, and Apteryx Cloud Imaging. Planet DDS is dedicated to enabling dental support organizations (DSOs) and groups to grow and thrive with technology that delivers seamless integrations, improved workflows, and future-proof scalability. We are seeking a Manager, Site Reliability Engineering and Incident Management, to manage our Site Reliability Engineering function as well as our external incident response function for our production operations. To be successful, the manager will need to be self-motivated, communicate clearly, and operate with a sense of urgency in a fast-paced environment. Providing operational support means that you will leverage your customer empathy to production incidents and to any other internal engineering-related support requests. It will be crucial for you to gain a deep understanding of our systems and architecture and build a hands-on knowledge of support and observability tooling. You will need to be available to engage in any incident escalations 24x7. You will need to seek answers from subject matter experts in a variety of positions from architects to support staff, business leaders, and technically minded developers.

  • Location: East Coast (US)

Job Duties

  • Team Leadership & Development
  • Lead and mentor a team of SREs, Incident Managers, and Release Managers.
  • Foster a culture of reliability, accountability, and continuous improvement.
  • Collaborate with engineering teams to design resilient platform architectures.
  • Incident Management
  • Oversee the incident response process for outages and service disruptions.
  • Ensure timely detection, escalation, and resolution of incidents.
  • Drive post-incident reviews (PIRs) and root cause analysis.
  • Implement improvements based on lessons learned to prevent recurrence.
  • Operational Excellence
  • Mature and enforce best practices for incident response and observability.
  • Automate operational tasks to reduce toil and improve efficiency.
  • Maintain observability tools (monitoring, alerting, logging).
  • Process & Governance
  • Define and maintain incident management policies and escalation procedures.
  • Drive initiatives for chaos engineering, capacity planning, and disaster recovery testing.

Skills and Qualifications

  • 7+ years in SRE, DevOps, or Infrastructure roles.
  • 3+ years in Incident Management leadership.
  • Deep understanding of reliability, scalability, and performance optimization.
  • Multi-cloud expertise in AWS, Azure, or GCP.
  • Understanding of DNS, load balancing, firewalls, and compliance frameworks.
  • Security is part of everything we do and will require your knowledge of fundamental cloud security (e.g., identity and access management, firewalls, etc.)
  • Deep understanding of logging and monitoring and security best practices
  • Strong collaboration and communication skills
  • Bachelor’s Degree in a relevant major or equivalent years of experience is a plus

Any of the following would be a plus:

  • Dental industry knowledge
  • Experience working in B2B SaaS companies
  • Experience with cloud containers, specifically Kubernetes

Benefits:

  • Medical, dental and vision insurance
  • Health Savings Account
  • Flexible Spending Accounts
  • Telehealth
  • 401(k) and 401(k) match
  • Life and AD&D insurance
  • Short-Term and Long-Term Disability
  • FTO or Vacation
  • Sick Time
  • Employee Well-Being program
  • 11 paid holidays
  • Volunteer Time Off
  • Employee Referral program
  • Additional perk and voluntary benefit programs

Salary is based on a number of factors and may vary depending on job-related knowledge, skills, and experience. This position is also eligible for variable pay as part of the total compensation package. PLANET DDS CORE IDEOLOGY To encourage measurable progress toward our vision and make the best decisions on behalf of employees and customers, we adopted a set of common values:

  • Collaborative – Working independently and across teams, we create scalable solutions to enable company growth
  • Empathetic – We are educated on the experience of our customers and feel vested in their success
  • Accountable – We feel ownership for the quality of our work and take pride in the positive outcomes
  • Trustworthy – We operate with integrity and honest, making promises we know that we can keep
  • Ambitious – We are driven by our ability to make a long-term, positive impact on the lives of dental market leaders

Planet DDS is an Equal Opportunity Employer – Including Disability/Veterans Apply tot his job Apply To this Job

You might like

XTN-B1B2293 | SITE RELIABILITY ENGINEER

Work from home Full-time role

Senior Software Engineer: Site Reliability (SRE)

Work from home Full-time role

Kubernetes & Database Migration Engineer- Remote EST hours

Work from home Full-time role

Site Reliability / Gitops Engineer

Work from home Full-time role

Senior Integration - Site Reliability Engineer (SRE)

Work from home Full-time role

SRE Engineer

Work from home Full-time role

Senior Site Reliability Engineer (Compute Node Team)

Work from home Full-time role

Senior Principal Site Reliability Engineer | Oracle Health Federal Operations Team

Work from home Full-time role

DevOps Site Reliability Engineer

Work from home Full-time role

Cloud Site Reliability Engineer

Work from home Full-time role

EdTech Co-Founder / CTO (100 % remote) (m/f/d)

Work from home Full-time role

Experienced Full Stack Data Entry Specialist – Remote Operations Support

Work from home Full-time role

Registered Nurse - Behavioral Health Utilization Review Consultant

Work from home Full-time role

Senior Clinical Project Manager - $15K sign on bonus being offered!

Work from home Full-time role

Experienced Email and Chat Support Professionals Wanted – Remote Opportunities for Career Growth and Flexibility

Work from home Full-time role

Experienced Customer Experience Specialist, Part-Time (Remote) - Beauty and Grooming Industry

Work from home Full-time role

Commercial Contracts Manager, Ford Energy

Work from home Full-time role

Strategic Marketing Expert - Category Growth & Market Development Specialist (Part Time)

Work from home Full-time role

ClearML – GenAI Consultant

Work from home Full-time role

Senior Software Engineer, Core Experiences - Miami, FL, USA

Work from home Full-time role