See all roles

Senior Database Site Reliability Engineer

Work from home Full-time role Hiring

About the position We are seeking a Database Site Reliability Engineer who demonstrates a strong skill set in managing PostgreSQL. In this role, you will own the reliability and operability of our PostgreSQL services supporting a growing 24x7 SaaS platform, with an emphasis on availability, performance, observability, incident response, and automation. You will partner with cross-functional teams—including developers, operations, and infrastructure—to ensure that our database services run smoothly and efficiently. If you are passionate about operational excellence and continuous improvement, we want to hear from you.

Responsibilities

  • Responsible to design, implement, and maintain high-availability, high throughput, data and compute intensive, critical database systems running PostgreSQL which supports a growing 24x7 SaaS platform.
  • Define and improve database service reliability through monitoring/alerting, SLO-oriented metrics, and operational readiness.
  • Participate in and help drive incident response, root cause analysis, and post-incident corrective actions for database-related production events.
  • Partner with other technical leaders to ensure all newly introduced systems are supportable and maintainable by both development and operations.
  • Provides escalated technical guidance and support to other technology teams throughout the organization
  • Provides on-call coverage for production support and other duties as required.
  • Accountable for complying with HIPAA security policies within the database platform
  • Ensure all solutions and operational activities adhere to the security and operating policies established by the organization
  • Own and continuously improve our Datadog database observability by building actionable dashboards, alerts, and service-level views using an observability stack (e.g., Prometheus, Grafana, New Relic, or equivalent). Familiarity with PGAnalyze or Percona a plus.
  • Automate system maintenance tasks using Bash, Powershell, Python, or Ansible. Manage infrastructure as code (IaC) writing Ansible playbooks. Some exposure to Terraform a plus.
  • Understand and maintain various PostgreSQL ecosystem components like: PgBouncer, PgBackrest, HaProxy, RepMgr a plus

Requirements

  • BS degree in Information Systems, Engineering, or equivalent experience
  • 7-10+ years of Engineering experience with Database Engineering, Systems Engineering, DevOps and/or SRE
  • Experience in cloud-based compute, storage, and containerization solutions (Azure & Kubernetes preferred)
  • Expertise with an observability/monitoring platform (e.g., Prometheus/Grafana, New Relic, Datadog, or equivalent); Datadog experience is a plus.
  • Experience working in Agile/DevOps environments and operating production services with ITSM practices where applicable
  • Excellent communication and interpersonal skills.

Nice-to-haves

  • Proficiency with operating PostgreSQL in a Linux environment is a plus
  • Experience with writing & designing ETL pipelines using Python a plus
  • Familiarity with PGAnalyze or Percona a plus.
  • Some exposure to Terraform a plus.
  • Understand and maintain various PostgreSQL ecosystem components like: PgBouncer, PgBackrest, HaProxy, RepMgr a plus

Benefits

  • Competitive salary - $120,000-160,000
  • Employer sponsored health, dental, vision, life, and disability insurance
  • Retirement plan with company contribution
  • Annual company profit sharing
  • Personal development/training budget
  • Open, collaborative work environment
  • Extensive 2-week onboarding plan
  • Comprehensive mentorship program

Apply tot his job Apply To this Job

You might like

Secure Site Reliability Engineer

Work from home Full-time role

Cloud Site Reliability Engineer

Work from home Full-time role

Senior Site Reliability Engineer -AI Infrastructure Operations

Work from home Full-time role

SRE – OPENSTACK / PRIVATE CLOUD OPERATIONS

Work from home Full-time role

Site Reliability Engineer Intern

Work from home Full-time role

IT Site Reliability Engineer

Work from home Full-time role

Site Reliability Engineer 3

Work from home Full-time role

Sr. Site Reliability Engineer

Work from home Full-time role

Senior SRE - INTL MX

Work from home Full-time role

Junior Site Reliability Engineer

Work from home Full-time role

Experienced Customer Service Representative – Participant Svcs Rep III (Remote)

Work from home Full-time role

Experienced Customer Service Representative – Delivering Exceptional Arenaflex Client Experiences

Work from home Full-time role

Product Manager, Geo Expansion

Work from home Full-time role

Teletherapy Speech Therapy in NE

Work from home Full-time role

Experienced Live Chat Agent – Day & Night Shift at arenaflex

Work from home Full-time role

Associate Project Leader - Vaccines & Infectious Diseases

Work from home Full-time role

Experienced Student Customer Service Coordinator – Recreational Facilities and Events

Work from home Full-time role

Web Designer / Developer (USA BASED ONLY)

Work from home Full-time role

Experienced Remote Data Entry Specialist – Work From Home Opportunity with arenaflex

Work from home Full-time role

Kundensupport-Mitarbeiter (m/w/d) im E-Commerce (Teilzeit, Remote)

Work from home Full-time role