Senior Site Reliability Engineer (Linux, Kubernetes, Go & Python)

Work from home Full-time role Hiring

Red Hat is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hat’s enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation. On the SRE team, you will have the opportunity to influence the complex challenges of scale which are unique to Red Hat managed cloud services, while using your skills in coding, operations, and large-scale distributed system design. What you will do: The day-to-day responsibilities of an SRE involve working with live systems and coding automation. As an SRE you will be expected to:

Contribute code to increase the scalability and reliability of the service
Contribute software tests and participate in peer review to increase the quality of our codebase
Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration
Participate in a regular on-call schedule, including occasional paid weekends and holidays
Practice sustainable incident response and blameless postmortems
Resolve customer issues escalated from the Red Hat Global Support team
Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve
Collaborate with cross-functional teams to identify opportunities for AI integration within the software development lifecycle, driving continuous improvement and innovation in engineering practices; share use cases for successful experiments with stakeholders for broader use.

What you will bring:

A bachelor's degree in Computer Science or a related technical field involving software or systems engineering is required. However, hands-on experience that demonstrates your ability and interest in Site Reliability Engineering are valuable to us, and may be considered in lieu of degree requirements.
3+ years of experience programming with at least one object-oriented language; Golang AND Python.
5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure
1+ year(s) of experience with Kubernetes is a MUST
3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef
2+ years of experience delivering a hosted service
3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus
1+ year(s) of experience with docker-based containers is a plus
Demonstrated ability to quickly and accurately troubleshoot system issues
Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
Solid communications skills and experience working directly with and presenting to customers

Apply tot his job Apply To this Job

Apply

Senior Site Reliability Engineer (Linux, Kubernetes, Go & Python)

You might like

Senior Site Reliability Engineer, Platform & Cloud FinOps

(SME)Senior Kubernetes Architecture Engineer

Delivery Cloud Network Engineer | Remote

Network Engineer - Consultant (Senior Cloud Network Engineer )

Systems Admin - RamQuest Admin (Pittsburgh, PA; Remote)

Low Latency Network Engineer

Network Engineer (Microsoft, Broadcom, Dynatrace, Nexthink), Secret, Remote (DC MD VA)

Network Development Engineer, Amazon Corporate Network Engineering

HR Systems Administrator (UKG)- Remote (Anywhere in the U.S.)

Remote Support Professional

Manager HEDIS, Stars, & ACO Data Analytics #0714

Sr. Project Manager- Clinical Laboratory

Customer Care Resolution Coordinator - Work From Home | Remote Customer Support Specialist

Occupational Therapist, Virtual

Remote Customer Service Representative - Flexible Part-Time Schedule Ideal for College Students | Work From Home Opportunities in Illinois, Iowa, or Wisconsin

Experienced Customer Support Representative – Remote Work Opportunities with arenaflex

Product Development Lead - Commercial Auto

Experienced Customer Service Representative – Fintech Industry Expert

REMOTE FREIGHT DISPATCHER- 100% COMMISSION BASED

Experienced Data Entry Operator – Flexible Part-Time Opportunity with Daily Payments