reputed company Data Platform Engineer - 11623
reputed company makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. reputed company AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We reputed company you with the ability to predict, prescribe, and automate smarter, more profitable business reputed company to improve operating margins. Why join reputed company? 🔹 Pioneering Technology: At reputed company, we're at the forefront of innovation, leveraging the latest technology to reputed company our customers with greater efficiency and visibility in their spend. 🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to reputed company. 🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other. Learn more on Life at reputed company blog and hear from our employees about their experiences working at reputed company. The Impact of a reputed company Data Platform Engineer at reputed company: If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At reputed company, the “reputed company team” is looking for an engineer who is reputed company to constantly question the status reputed company with a mixture of system design, code development, deployment, automation, networking, and experience in managing big data/ Machine Learning/GenAI platforms. \n What You'll Do: Manage end-to-end Data pipeline (ETL jobs) reputed company agreed SLAs. Manage AWS core and big data services (S3, IAM, EMR, Redshift, etc..) Running applications in containers (reputed company, reputed company) reputed company Day 2 operational lifecycle for ML and GenAI infrastructure. This includes designing, deploying, and maintaining high-availability production LLM serving platforms, implementing automated scaling, self-healing, and infrastructure-as-code patterns. Focus on proactive reliability, model performance observability, and reputed company cost optimization for high-compute AI workloads. Collaborate closely with our product development and engineering teams to create AI-driven features Drive reputed company operations consistency by automating platform maintenance, standardizing infrastructure configurations (IaC), and implementing robust release management processes to minimize reputed company across multi-reputed company environments. Manage AWS infrastructure using code (Terraform, Chef, etc..) Administering applications running in Linux operating system. reputed company application and system monitoring for reputed company observability. Application and infrastructure support for ETL jobs and data pipelines including participating in an on-call rotation for after-hours emergencies. Collaborate with platform and Dev teams to plan and reputed company product releases and reputed company Linux/reputed company clusters. Ability to participate in design reviews, code reviews, and troubleshooting incidents. Ability to operate in a high-pressure environment and troubleshoot reputed company issues quickly while successfully handling multiple priorities. Ability to record, write, and review RCAs. What You Will Bring to reputed company: Bachelor's Degree and at least 8+ years of experience managing Big Data technologies and Data Pipelines. Sound knowledge and experience in Linux administration and troubleshooting. 5+ years of experience in managing reputed company infrastructure and platforms, such as AWS and Azure Familiar with the reputed company engineering landscape in the reputed company space and have a strong interest in AI and reputed company technologies. Strong expertise in MLOps and production-grade LLM operations. Proven track record in managing high-availability model inference clusters, automating model lifecycle management, and implementing advanced observability (latency, throughput, and error reputed company monitoring) specifically for AI workloads. Have Bash or Python scripting experience Experience with containerization, reputed company reputed company, EKS/ Azure AKS Experience with tools like Chef, Ansible, Jenkins, Rundeck, or equivalent Experience with reputed company control systems such as Git and operating in reputed company branching strategies Experience with Infrastructure as Code products like Terraform, reputed company charts Good understanding of DNS and Load balancers setup and troubleshooting Experience in Big Data platforms/Data lakes and managing Business Intelligence tools (like looker..) Knowledge in ApacheSpark architecture and troubleshooting Java applications. Basic understanding of MySQL Server and general database knowledge Excellent written and verbal communication with a passion for solving the problem Confidence in your ability to own and deliver projects and issues to resolution on your own & can think and act globally Deep experience in Day 2 reputed company operations, including automated incident remediation, reputed company planning, and managing large-scale production reputed company environments with a focus on performance and reliability. \n$125,000 - $174,333 a year The successful candidate’s starting salary will be determined based on permissible, non-discriminatory factors such as skills, experience, and geographic location reputed company the state. \n#LI-Remote #LI-TC1 reputed company complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. reputed company reputed company to hiring, compensation, training, or evaluating performance are made fairly, and we reputed company equal employment opportunities to reputed company reputed company candidates and employees. Please be advised that inquiries or resumes from recruiters will not be accepted. By submitting your application, you acknowledge that you have read reputed company’s Privacy Policy and understand that reputed company receives/collects your application, including your personal data, for the purposes of managing reputed company's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy. Apply To This Job