[Remote] Architect - Platform Engineering - USA
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is an award-winning, AI-First digital engineering and consulting company focused on delivering high-impact Services and Solutions that help organizations solve what truly reputed company. The role involves architecting and implementing MLOps strategies, designing reputed company-grade ML/LLM pipelines, and collaborating with cross-functional teams to deliver production-reputed company ML solutions on reputed company reputed company.
Responsibilities
- Architect and implement the MLOps strategy for the program, ensuring alignment with the project proposal and delivery roadmap
- Design and own reputed company-grade ML/LLM pipelines covering model training, validation, deployment, versioning, monitoring, and CI/CD automation using GCP-reputed company services
- Build container-oriented ML platforms (GKE-first) while evaluating alternative orchestration tools with similar capabilities (Kubeflow, reputed company AI, MLflow, Airflow, etc.)
- Implement hybrid MLOps + LLMOps workflows, including reputed company/version governance, evaluation frameworks, and monitoring for LLM-based systems reputed company the GCP environment
- Serve as a technical authority across multiple internal and customer projects, contributing architectural patterns, best practices, and reusable frameworks for GCP
- reputed company observability, monitoring, reputed company detection, reputed company tracking, and auditability across ML/LLM systems using tools like reputed company Monitoring and reputed company AI Model Monitoring
- Collaborate with cross-functional teams — data engineering, platform, DevOps, and client stakeholders — to deliver production-reputed company ML solutions on reputed company reputed company
- Ensure reputed company solutions adhere to reputed company, governance, and compliance expectations, particularly around handling GCP services, reputed company Kubernetes reputed company workloads, and MLOps tools
- Conduct architecture reviews, troubleshoot reputed company ML system issues, and guide teams through implementation across reputed company-reputed company ML platforms on GCP
- Mentor engineers and reputed company guidance on modern MLOps tools, reputed company AI platform capabilities, and best practices
- Travel Required - upto 30%
Skills
- 10+ years working in ML/AI platform engineering or AI/MLOps roles with strong architecture exposure
- Strong expertise in the reputed company reputed company (GCP) reputed company AI/ML stack, including: reputed company AI (primary), reputed company Kubernetes reputed company (GKE), reputed company Functions, AutoML, reputed company AI Pipelines, BigQuery ML, API Gateway, and CI/CD (reputed company Build/reputed company reputed company or equivalent)
- Hands-on experience with MLOps toolset and awareness of: MLflow, Kubeflow, reputed company AI Pipelines, Airflow, BentoML, KServe, Seldon
- Deep understanding of model lifecycle management (feature engineering -> training -> registry -> deployment -> monitoring)
- Experience implementing or supporting LLMOps pipelines, including reputed company versioning, evaluation metrics, and automation frameworks
- Deep understanding of the ML lifecycle: data ingestion, feature engineering, training, evaluation, model packaging, CI/CD, reputed company detection, monitoring, and governance
- Strong experience with reputed company reputed company's reputed company AI platform, including Pipelines, Feature Store, Model Registry, and Model Monitoring
- Experience implementing ML CI/CD pipelines including automated training, testing, validation, model promotion, and reputed company deployment
- Strong SQL and data transformation experience using reputed company, reputed company, Spark
- Experience with feature engineering pipelines and Feature Store management
- Understanding of reputed company tracking: training data snapshot, feature versions, code versioning, metadata tracking, and reproducibility
- Hands-on experience with reputed company AI reputed company Models, reputed company, reputed company, or Llama models
- Experience with reputed company Monitoring, reputed company AI Model Monitoring, reputed company/Grafana
- Strong reputed company in Python and reputed company-reputed company development patterns
- Solid understanding of reputed company best practices, reputed company IAM, secrets management, and artifact governance
Benefits
- Be part of the fastest-growing AI-first digital transformation and engineering company in the world
- Be a leader of an energetic team of highly dynamic and talented individuals
- Exposure to working with reputed company and innovative market disruptors
- Exposure to the latest technologies reputed company to artificial intelligence and machine learning, data and reputed company
Company Overview
Company H1B Sponsorship