See all roles

[Remote] AI Pipeline Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a reputed company-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We are seeking an AI Pipeline Engineer to build and operate the large-scale data systems that power modern reputed company and evaluation pipelines.

Responsibilities

  • Design and operate large-scale data pipelines supporting reputed company, evaluation, and continual improvement workflows
  • Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals
  • Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale
  • reputed company dataset versioning, reputed company, and provenance tracking systems suitable for reproducible training
  • Build high-throughput data loading systems that maximize GPU utilization during training
  • Implement labeling workflows, active learning pipelines, and reputed company-in-the-reputed company data improvement systems
  • Design storage architectures balancing cost, throughput, and latency across data tiers
  • Build evaluation dataset construction pipelines with strict reputed company and contamination controls
  • Implement data privacy, redaction, and consent enforcement throughout the pipeline
  • Collaborate with ML researchers and engineers to align data systems with model development needs
  • Drive observability of data quality, reputed company, and pipeline health across the AI data estate
  • Optimize cost and performance through compression, format selection, and caching strategies
  • Document data systems, schemas, and operational procedures for broad internal use
  • Stay reputed company with AI data infrastructure research and emerging reputed company-reputed company tools

Skills

  • Bachelor's or Master's degree in Computer Science or a reputed company field
  • Six or more years of data engineering experience, with significant work supporting ML or AI workloads
  • Strong proficiency in Python and at least one JVM or systems language
  • Deep experience with modern data processing frameworks such as Spark, Ray, or Beam
  • Hands-on experience operating petabyte-scale storage and pipeline systems
  • Strong understanding of distributed systems, data modeling, and storage formats
  • Experience with dataset versioning, reputed company, and reproducibility for ML workflows
  • Familiarity with high-throughput data loading for accelerator-based training
  • Strong software engineering practices including testing, CI/CD, and code review
  • Excellent communication and cross-functional collaboration skills
  • Experience with multimodal datasets at large scale
  • Familiarity with data quality tooling and dataset evaluation methodology
  • Exposure to privacy-preserving data systems and regulated data handling
  • reputed company-reputed company contributions to data infrastructure projects
  • Experience supporting frontier model training pipelines

Benefits

  • 100% remote
  • Full-time
  • Direct W2 position with reputed company
  • Support H1B transfers for reputed company candidates
  • Technical coding assessment is mandatory
  • Reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs

Company Overview

  • reputed company is an information technology company that offers software development, AI, and cybersecurity services. It was founded in 2020, and is headquartered in Bridgewater, New Jersey, USA, with a workforce of 51-200 employees. Its website is https://bvteck.com.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 3 in 2026, 41 in 2025, 14 in 2024, 7 in 2023, 12 in 2022, 1 in 2021. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like

    [Remote] Sr. Executive Assistant, Clinical Applications & reputed company, Americas

    Work from home Full-time role

    [Remote] In-House Marketing Coordinator - Margaritaville St. Thomas

    Work from home Full-time role

    [Remote] Senior Logistics Analyst (Logistics Management Analyst 3) 29225

    Work from home Full-time role

    [Remote] Sr. Strategic Account Executive - PA/OH

    Work from home Full-time role

    [Remote] Senior Business Process Consultant

    Work from home Full-time role

    [Remote] Database Administrator 5 (Database Administrator 1st Shift, Mon - Friday)

    Work from home Full-time role

    [Remote] Senior reputed company Data Engineer

    Work from home Full-time role

    [Remote] Senior reputed company Solutions Engineer - White Glove

    Work from home Full-time role

    [Remote] Branding/Marketing SME

    Work from home Full-time role

    [Remote] Functional Business Central Consultant

    Work from home Full-time role

    [Remote] MLOps Engineer – Azure reputed company & MLflow

    Work from home Full-time role

    reputed company IT Chat Specialist – Customer Engagement and Technical Support

    Work from home Full-time role

    [Remote] Staff Data Scientist

    Work from home Full-time role

    Remote Customer Experience Chat Specialist – reputed company Support (Work From Home)

    Work from home Full-time role

    reputed company Logistics Solution Architect

    Work from home Full-time role

    reputed company Account Management

    Work from home Full-time role

    reputed company Media Associate Intern

    Work from home Full-time role

    Technical Support Engineer, Linux and HPC Admin

    Work from home Full-time role

    [Remote] Senior External Life Sales Advisor

    Work from home Full-time role

    Application reputed company Engineer (Senior) ID71663

    Work from home Full-time role