Sr. Staff Site Reliability Engineer
Description:
- Define and drive the company-wide reliability strategy across services.
- Establish end-to-end system visibility frameworks for observability, detection, and resilience.
- Partner with DevOps and Platform Engineering leadership to standardize SLI/SLOs and improve reliability practices across teams.
- Serve as a technical escalation expert for reliability issues and incident response.
- Build intelligent detection systems, including anomaly detection and connector health models.
- Enable self-service observability for engineering teams.
- Define and evolve a tiered incident communication strategy.
- Lead postmortems and improve incident response practices to strengthen customer trust.
- Contribute hands-on to system design, monitoring, and debugging across distributed systems and data pipelines.
Requirements:
- 5+ years of experience in SRE, Production Engineering, or a related role.
- 3+ years of experience operating at a senior or technical leadership level, such as Staff scope or equivalent.
- Deep expertise with AWS and/or GCP.
- Experience with Kubernetes and Helm.
- Experience with observability stacks such as Prometheus and Grafana, or equivalent tools.
- Experience with CI/CD systems such as GitLab CI/CD and ArgoCD, or similar tools.
- Proven experience designing and scaling reliability systems for multi-tenant SaaS platforms.
- Strong debugging and systems thinking across distributed microservices and legacy systems.
- Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience.
- Hands-on engineering approach with a track record of building reliability systems, not just configuring them.
- Experience in B2B SaaS serving enterprise or financial customers, preferred.
- Familiarity with third-party SaaS connector architectures and ingestion patterns, preferred.
- Experience building anomaly detection or intelligent alerting systems, preferred.
- Experience designing customer-facing status pages and incident communication frameworks, preferred.
Benefits:
- Competitive compensation with equity and 401(k).
- Comprehensive healthcare with dental and vision coverage.
- Flexible paid time off and paid holiday time off.
- 12 weeks of new parent or family leave.
- Personal and professional development resources.
- Base salary range of $232,000 to $263,000 USD.
- Eligibility for equity awards and possible sales commission or incentive compensation, depending on role or function.
Apply tot his job Apply To this Job