See all roles

Italian Audio Evaluations Specialist - Freelance AI Trainer Project

Work from home Full-time role Hiring

Project Overview We are sourcing independent Audio Evaluation Specialists for an AI benchmark evaluation project assessing advanced agentic audio models. As AI models increasingly handle complex workflows in this domain—specifically real-world customer support scenarios like flight bookings, financial services, and telecommunications—their accuracy relies entirely on robust, expert-crafted training data. The objective of this project is to autonomously produce high-quality evaluation tasks through simulated interactions, audit conversational AI outputs, and generate clean, reliable datasets to optimize model performance. Project Deliverables & Scope Operate autonomously to design complex evaluation frameworks and provide structured training data. Expected deliverables include: Role-Play Scenario Execution: Creating and executing complex, role-play-based evaluation scenarios that simulate realistic customer service interactions across travel, finance, and technical support domains. Model Performance Auditing: Evaluating AI model performance across standardized qualitative and quantitative metrics, focusing strictly on task completion accuracy, conversational naturalness, and audio comprehension. Technical Metric Evaluation: Assessing the model's basic computer programming literacy, including its understanding of JSON structures, functions, methods, and ability to reason about structured data within a support context. Representative Dataset Generation: Contributing to the development of diverse, high-quality audio datasets that accurately reflect real customer expectations for clarity, efficiency, and natural conversational flow. Required Expertise To successfully fulfill the deliverables of this project, Contractors must possess deep industry knowledge to craft realistic professional scenarios. Core skillset includes: Demonstrable professional expertise in complex customer support, technical troubleshooting, or conversational AI evaluation. Native or bilingual proficiency in the target language, including fluency across all language skills (reading, listening, writing, and speaking), alongside strong analytical and verbal communication skills to confidently conduct simulated customer support role-plays. Basic computer programming literacy, specifically a comfortable understanding of JSON structures, functions, methods, and simple logic. A meticulous, detail-oriented approach to working with structured prompts, complex evaluation rubrics, and technical guidelines. Required Equipment: Access to a high-quality microphone to ensure clean, reliable audio input during voice evaluations. We offer a pay range of $6-to-$65 per hour, with the exact rate determined after evaluating your experience, expertise, and geographic location. Final offer amounts may vary from the pay range listed above. As a contractor you’ll supply a secure computer and high‑speed internet; company‑sponsored benefits such as health insurance and PTO do not apply. Engagement Type: Freelance / Independent Contractor Workplace Type: Remote Seniority Level: Mid-Senior Level Apply To This Job

You might like

Welsh Audio Specialist - Freelance AI Trainer Project

Work from home Full-time role

Dutch Audio Specialist - Freelance AI Trainer Project

Work from home Full-time role

In-House Clinical Research Associate

Work from home Full-time role

Insurance Producer - Evansville, IN

Work from home Full-time role

Insurance Producer - Greenwood, IN

Work from home Full-time role

Customer Growth Representative (Hybrid)

Work from home Full-time role

Director of Operations, Creative (Internal Agency) (Hybrid: Onsite and Remote)

Work from home Full-time role

Jr. Operations Analyst

Work from home Full-time role

Machine Learning Engineer (Platform)

Work from home Full-time role

Strategic Advisor (Execution)

Work from home Full-time role

Experienced Full Stack Data Entry Specialist – Virtual Assistant for arenaflex

Work from home Full-time role

NEXCOM Computer-Aided Design Draftsperson

Work from home Full-time role

Mechanical Engineer (REMOTE)

Work from home Full-time role

Eating Disorder and OCD Therapist

Work from home Full-time role

Bilingual Mortgage Loan Officer

Work from home Full-time role

Experienced Part-Time Remote Data Entry Specialist – Amazon Operations Support

Work from home Full-time role

Ecommerce - Website Testers Needed (with 20 circle of friends required)

Work from home Full-time role

Strategic Sourcing Specialist

Work from home Full-time role

Senior Software Engineer (JAVA)

Work from home Full-time role

Experienced Full Stack Data Entry Specialist – Virtual Operations and Data Management

Work from home Full-time role