See all roles

[Remote] Staff Site Reliability Engineer, Core AI Infrastructure

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Coinbase is a leading company focused on increasing economic freedom through innovative financial solutions. They are seeking a Staff Site Reliability Engineer to join their IT Operations team, responsible for ensuring the reliability and automation of critical AI infrastructure while collaborating with various teams to enhance operational workflows.

Responsibilities

  • Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros
  • Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments
  • Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines
  • Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence
  • Develop full-stack applications that power internal AI products and infrastructure with Go or Python

Skills

  • 8+ years of experience automating and supporting cloud infrastructure (AWS) and network environments, with hands-on use of infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, or Salt)
  • Proven experience deploying, managing, and troubleshooting containerized workloads using Docker and Kubernetes in production environments
  • Proficiency in at least one scripting or programming language (Python, Bash, Ruby, or Go) and version control workflows using Git-based CI/CD pipelines
  • Track record of leading incident response in environments with strict SLAs, including root cause analysis, blameless retros, and measurable reliability improvements
  • Utilizes generative AI responsibly, maintaining human oversight to deliver business-ready outputs and drive measurable improvements in workflow efficiency, cost, and quality
  • Expertise with linux, bash, ruby, python and/or go
  • Expertise automating EC2 or containers deployment with terraform
  • Strong network security fundamentals
  • Experience managing and leveraging log aggregation
  • Experience working in a highly regulated environment
  • Experience in a fast-paced, high-growth company
  • Experience in a Remote-first IT environment

Benefits

  • Equity and bonus eligibility
  • Benefits (medical, dental, vision, 401(k))

Company Overview

  • Coinbase is a crypto exchange and wallet platform that allows merchants and consumers to buy, sell, and store digital currencies. It is a sub-organization of Coinbase. It was founded in 2012, and is headquartered in San Francisco, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.coinbase.com.
  • Apply To This Job

    You might like

    [Remote] Staff Machine Learning Engineer(Platform - Identity)

    Work from home Full-time role

    [Remote] Product Marketing Manager, Prediction Markets

    Work from home Full-time role

    [Remote] Staff Software Engineer, Backend (Consumer - Advanced Trading)

    Work from home Full-time role

    [Remote] Senior Software Engineer, Infra - Compute Platform

    Work from home Full-time role

    [Remote] Accounting Manager, Prime Financing

    Work from home Full-time role

    [Remote] Data Protection Engineer

    Work from home Full-time role

    [Remote] Senior Software Engineer, Backend - Identity

    Work from home Full-time role

    [Remote] Senior Analytics Engineer (Platform - Financial Analytics)

    Work from home Full-time role

    [Remote] Senior Site Reliability Engineer, Workforce Identity

    Work from home Full-time role

    [Remote] Senior Manager, Finance & Strategy

    Work from home Full-time role

    Experienced Full Stack Customer Service Representative – Data Information Processor – US Remote

    Work from home Full-time role

    Experienced Customer Service Representative – Temporary to Permanent, Remote Opportunity in Harrisburg, PA

    Work from home Full-time role

    Experienced Customer Success Specialist – Remote Opportunity for Employee Experience Solutions

    Work from home Full-time role

    Integrated Behavioral Health Clinician

    Work from home Full-time role

    Experienced Remote Data Entry Specialist – Sustainable Energy & Innovation Sector | Work From Home Opportunity with arenaflex

    Work from home Full-time role

    Experienced Customer Service Supervisor – Remote Work Opportunity in the USA at arenaflex

    Work from home Full-time role

    Delta (Remote Jobs) – Customer Service Representative – Apply Now

    Work from home Full-time role

    Experienced Remote Data Entry Specialist – Entry Level Typist Opportunity for Career Growth and Flexible Work Arrangements at blithequark

    Work from home Full-time role

    Director, Ethics and Compliance

    Work from home Full-time role

    Telemedicine Work comp Nurse Practitioner

    Work from home Full-time role