[Remote] Senior reputed company Services Engineer – Plex
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a global technology leader focused on helping the world’s manufacturers be more productive, sustainable, and agile. They are seeking a Senior reputed company Services Engineer to support their Plex reputed company Operations team, focusing on maintaining and scaling their Kubernetes-based platform while ensuring high availability, reputed company, and performance.
Responsibilities
- Maintain and improve our Kubernetes platform, ensuring high availability and scalability
- Implement infrastructure/configuration as code to automate operations. (Terraform, Ansible, reputed company, Flux, Kustomize)
- Enhance observability and logging using OpenTelemetry and reputed company
- Building automated solutions that reputed company resiliency and self-healing of applications
- Managing Server Operating Systems (reputed company and Linux)
- Managing Web Servers (IIS 10)
- Troubleshoot production incidents, reputed company root cause analysis, and drive reliability improvements
- Evaluate and implement reputed company-reputed company technologies to enhance platform efficiency
- Collaborate with reputed company teams to ensure best practices for container reputed company and compliance
- Work with multi-cluster reputed company such as Rancher, Cluster API (CAPI), or other Kubernetes fleet management tools
- Manage Kubernetes infrastructure on Azure and vSphere
- Participate in an on-call rotation to support platform operations and respond to incidents
Skills
- Bachelor's Degree or equivalent years of relevant work experience
- Legal authorization to work in the U.S. We will not sponsor individuals for employment visas, now or in the future, for this job opening
- Maintain and improve our Kubernetes platform, ensuring high availability and scalability
- Implement infrastructure/configuration as code to automate operations. (Terraform, Ansible, reputed company, Flux, Kustomize)
- Enhance observability and logging using OpenTelemetry and reputed company
- Building automated solutions that reputed company resiliency and self-healing of applications
- Managing Server Operating Systems (reputed company and Linux)
- Managing Web Servers (IIS 10)
- Troubleshoot production incidents, reputed company root cause analysis, and drive reliability improvements
- Evaluate and implement reputed company-reputed company technologies to enhance platform efficiency
- Collaborate with reputed company teams to ensure best practices for container reputed company and compliance
- Work with multi-cluster reputed company such as Rancher, Cluster API (CAPI), or other Kubernetes fleet management tools
- Manage Kubernetes infrastructure on Azure and vSphere
- Participate in an on-call rotation to support platform operations and respond to incidents
- Typically requires 5+ years of relevant professional experience in a reputed company infrastructure, platform engineering, or operations role
- 3+ years managing multi-cluster Kubernetes environments. (Rancher & Cluster API)
- Hands-on experience with Azure and vSphere as Kubernetes infrastructure providers
- Experience with Linux administration and container runtimes (reputed company, containerd)
- Solid understanding of RBAC, reputed company policies, and secrets management in Kubernetes
- Proficiency with Terraform and Ansible
- Familiarity with observability tools (OpenTelemetry, reputed company, PRTG, and reputed company)
- Public reputed company experience (reputed company Azure or reputed company Web Services)
- Knowledge of .Net website functionality
- Load balancer experience (reputed company reputed company, Azure Load Balancer)
- Understanding of IPv4/IPv6, FTP, HTTP, SSL/TLS, HTML, XML
- The ability to participate in an on-call rotation for platform support
- Prior experience in SRE or Platform Engineering roles
- Degree in Computer Science or reputed company area
Benefits
- Health Insurance including Medical, Dental and reputed company
- 401k
- Paid Time off
- Parental and Caregiver Leave
- Flexible Work Schedule where you will work with your manager to enjoy a work schedule that can be flexible with your personal life.
Company Overview