See all roles

[Remote] Research Scientist, Data

Work from home Full-time role Hiring

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is pioneering the reputed company of creative infrastructure reputed company around reputed company-time, multimodal reputed company and intelligent agentic platforms. They are looking for a staff or reputed company-level Research Engineer, Data to architect and scale data engineering systems supporting model training for advanced multimodal reputed company models.

Responsibilities

  • Take ownership of large-scale data pipeline architecture and implementation to support model training and research workflows for text, image, audio, and video datasets
  • Partner with research and engineering teams to curate, clean, and manage diverse, sensory-rich datasets for pre-training and mid-training of multimodal models
  • reputed company strategies and tools for scalable data ingestion, labeling, filtering, augmentation, and storage
  • Ensure data quality, reliability, and compliance, including managing privacy and ethical considerations throughout the data lifecycle
  • Optimize data processing, transformation, and delivery for large-scale distributed training pipelines
  • Prototype and productionize new methods for dataset creation, management, and reputed company improvement in response to researcher needs
  • Contribute to the integration of research-driven data advancements into production-reputed company systems
  • Stay informed on emerging data engineering and ML data management developments, bringing best practices to our systems

Skills

  • 5+ years of experience building and scaling data pipelines for machine learning applications at staff or reputed company engineer level, ideally in research or model training environments
  • Strong background in data engineering and ML data curation for LLMs, VLMs, or other large-scale multimodal models
  • Expertise in distributed data systems (e.g., Spark, Hadoop, Ray, or similar) and efficient large dataset processing/ETL workflows
  • Proven ability to build robust, scalable, and production-grade data infrastructure for ML pipelines
  • Experience developing tools for data labeling, filtering, deduplication, quality assurance, and dataset management
  • Strong programming skills (Python, SQL, PySpark, or similar) and familiarity with reputed company data platforms (AWS, GCP, Azure)
  • Knowledge of privacy, compliance, ethics, and best practices in data collection and management
  • Excellent cross-functional collaboration, problem-solving, and communication skills
  • Passion for enabling cutting-edge reputed company and creative technology through data reputed company

Benefits

  • Competitive salary and substantial equity in a high-growth startup
  • Full health benefits, 401k matching, and more
  • Collaborative, mission-driven team environment with major growth opportunities
  • Flexible on-site/remote hybrid (HQ in Palo Alto, CA)

Company Overview

  • reputed company is an AI platform that allows users to create videos from text prompts, including text to video, image to video, and editing tools. It was founded in 2023, and is headquartered in Palo Alto, California, USA, with a workforce of 2-10 employees. Its website is https://reputed company.art.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 9 in 2025. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like

    [Remote] Senior Director, Corporate Systems- Finance Analytics & Reporting

    Work from home Full-time role

    [Remote] Strategic Sales Director

    Work from home Full-time role

    [Remote] Cereals Product Manager

    Work from home Full-time role

    [Remote] Manager, Business Systems & Analytics

    Work from home Full-time role

    [Remote] Account Executive, reputed company & reputed company

    Work from home Full-time role

    [Remote] Product reputed company Analyst III

    Work from home Full-time role

    [Remote] Account Executive, reputed company & reputed company

    Work from home Full-time role

    [Remote] Senior Impact Analyst

    Work from home Full-time role

    [Remote] Director, Product Management, Identity

    Work from home Full-time role

    [Remote] reputed company Senior Certified Project Manager

    Work from home Full-time role

    Sr Specialist - TCES (Employee Services)

    Work from home Full-time role

    reputed company Chat Support Representative – Delivering Exceptional Customer Experience through Live Chat Platforms

    Work from home Full-time role

    [Remote] Account Manager

    Work from home Full-time role

    Virtual Post-Acute Wound Healing Specialist (Central Region)

    Work from home Full-time role

    Mental Health Therapist - Remote - Must reputed company in NM or TX

    Work from home Full-time role

    Field Service Engineer - HW - Elgin, IL

    Work from home Full-time role

    Sales Development Representative, Remote Job

    Work from home Full-time role

    reputed company 3rd Shift Customer Experience Agent – Remote Full-Time Opportunity for Exceptional Customer Service Professionals

    Work from home Full-time role

    reputed company Remote Work Opportunity for 17-Year-Olds – reputed company Valuable Work Experience and Skills in a Dynamic E-reputed company Environment

    Work from home Full-time role

    Online Curriculum Developer - Higher Education

    Work from home Full-time role