See all roles

Site Reliability Engineer L4/L5 – Live Cloud Platform SRE

Work from home Full-time role Hiring

Job Description:

  • Drive continual improvement in observability, monitoring, and scalability with the primary goal to solve the thundering herd problem with cloud traffic (API gateway, IPC between microservices) for live streaming.
  • Implement, automate, execute, and analyze the results from a broad range of live streaming delivery focused functional, performance, resilience, and fault injection testing.
  • Write and review code, develop documentation and capacity plans, and debug the hardest problems on some of the largest and most complex systems in the world.
  • Coordination, collaboration, and partnership across multiple stakeholders for the smooth execution of live-streaming events.
  • Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule.

Requirements:

  • 5+ years of service reliability/operational experience running large-scale, high-performance systems & internet services with a focus on traffic at scale.
  • Knowledge of and proven experience with L4 Load Balancer, HTTP cache, and reverse proxy technologies.
  • Expert-level knowledge of Unix or Linux systems and TCP/IP network fundamentals.
  • Proficient understanding of networking principles, transport, and application protocols, especially DNS, TLS, and HTTP(s) etc.
  • Proficient in a programming language such as Go, Python, Rust etc.
  • Experience with using real-time and Big Data analytic processing technologies (Kafka, time series database and Presto/Trino, Spark SQL, etc)
  • Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners.
  • Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience).

Benefits:

  • Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • Paid leave of absence programs
  • 35 days annually for paid time off

Apply tot his job Apply To this Job

You might like

Senior Site Reliability Engineer – Compute Platforms

Work from home Full-time role

Network Engineer II (Remote/ Dallas TX)

Work from home Full-time role

Senior Site Reliability Engineer (Linux, Kubernetes, Go & Python)

Work from home Full-time role

Senior Site Reliability Engineer, Platform & Cloud FinOps

Work from home Full-time role

(SME)Senior Kubernetes Architecture Engineer

Work from home Full-time role

Delivery Cloud Network Engineer | Remote

Work from home Full-time role

Network Engineer - Consultant (Senior Cloud Network Engineer )

Work from home Full-time role

Systems Admin - RamQuest Admin (Pittsburgh, PA; Remote)

Work from home Full-time role

Low Latency Network Engineer

Work from home Full-time role

Network Engineer (Microsoft, Broadcom, Dynatrace, Nexthink), Secret, Remote (DC MD VA)

Work from home Full-time role

Partnerships Consultant, Partnerships and Convening Unit, OSE, 11 months, Florence, Italy (Remote) #593035

Work from home Full-time role

Secondary School Teacher. Job in Witney Educati...

Work from home Full-time role

Experienced Social Media Virtual Chat Assistant – Remote Work Opportunity with arenaflex

Work from home Full-time role

Digital Product Analyst (Remote)

Work from home Full-time role

Sr. Commissions & Order Operations Manager | LATAM

Work from home Full-time role

Coder II

Work from home Full-time role

Scrum Master (Remote - United States)

Work from home Full-time role

Job Title:

Work from home Full-time role

Registered Mental Health Counselor Intern (RMHCI) - Remote

Work from home Full-time role

Experienced Remote Data Entry Specialist – Logistics and Supply Chain Data Management

Work from home Full-time role