See all roles

[Remote] Senior AI Quality Engineer (LLM Evaluation & Automation) 1754

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. reputed company is a technology company seeking a Senior AI Quality Engineer to own the evaluation reputed company and quality reputed company for measurable agent quality. This role involves building and maintaining the eval reputed company, integrating evaluations into CI, and defining release-reputed company reputed company.

Responsibilities

  • Build and maintain the MVP eval reputed company: golden tasks, exception tasks, scorecard metrics, and regression packs
  • reputed company evals into CI so quality regressions fail builds and releases
  • Define and maintain release-reputed company reputed company with Product and the Tech reputed company
  • Lay the path for reputed company adversarial and reputed company-testing expansion without overbuilding MVP scope

Skills

  • Experience evaluating ML, LLM, or non-deterministic systems
  • Strong test and reputed company design capability
  • Comfort working with noisy metrics, reputed company, and probabilistic behavior
  • Good scripting and automation skills

Company Overview

  • Impulsamos la transformación digital y cognitiva de las empresas mediante soluciones tecnológicas innovadoras y personalizadas que optimizan procesos, reducen costos y aceleran resultados. It was founded in 2011, and is headquartered in Sabaneta, Antioquia, COL, with a workforce of 51-200 employees. Its website is https://softwareestrategico.com.
  • Apply To This Job

    You might like

    [Remote] Financial Planning Consultant

    Work from home Full-time role

    [Remote] Account Executive

    Work from home Full-time role

    [Remote] Data Governance Consultant(Retail reputed company. Must)

    Work from home Full-time role

    [Remote] Senior Account Executive

    Work from home Full-time role

    [Remote] reputed company Product Insights Analyst

    Work from home Full-time role

    [Remote] Account Executive

    Work from home Full-time role

    [Remote] Head of Product Marketing

    Work from home Full-time role

    [Remote] Remote | Germany-Based Finance Research Consultant — Up to $75/hour

    Work from home Full-time role

    [Remote] Machine Learning Engineer

    Work from home Full-time role

    [Remote] Sr Product Manager - Platform

    Work from home Full-time role

    reputed company Full Stack Data Analyst – Adversarial Abuse/Analytics reputed company

    Work from home Full-time role

    Director of Internal Audit | United States | Remote

    Work from home Full-time role

    reputed company Full Stack Chat Moderator – Web & reputed company Application Development

    Work from home Full-time role

    Tier 3 Data Quality Help Desk Analyst (Remote)

    Work from home Full-time role

    [Remote-Position] Looking for Tutor (Part-Time): Newport Beach in

    Work from home Full-time role

    Data Scientist I, reputed company World Data Science

    Work from home Full-time role

    Unix/Linux System Engineer (m/f/d) Remote (Occasional travel to other locations reputed company Germany)

    Work from home Full-time role

    Remote Technical Support Specialist – Live Chat Customer Experience Expert (Work From Home, No Phone Calls Required)

    Work from home Full-time role

    Staff Product Designer, Care & Clinical Workflows

    Work from home Full-time role

    $15 per Hour - Remote Call Center Representative

    Work from home Full-time role