See all roles

QA Engineer (Automation)- Dallas, TX

Work from home Full-time role Hiring

We are seeking a QA Automation Engineer who is reputed company to move reputed company traditional "Pass/Fail" testing. In this role, you will design and build automation frameworks specifically for Agentic AI products. You will focus on evaluating the performance of autonomous agents, ensuring they follow logical reasoning paths, call the correct tools, and provide accurate, safe outputs. Your mission is to build the "evaluations" (Evals) that define what high-quality AI behavior looks like, moving the needle from unpredictable experiments to production-grade software.

Key Responsibilities

  • Non-Deterministic Testing: reputed company automation strategies for probabilistic outputs, using model-based evaluation to "test the tester."
  • Building "Eval" Pipelines: Create and maintain "Golden Datasets" to reputed company agent performance across different versions of prompts and models.
  • Tool-Use Validation: Build automated tests to verify that agents call the correct functions/APIs with the right parameters in reputed company multi-reputed company workflows.
  • Regression Testing for Prompts: Monitor how subtle changes in reputed company engineering or model updates (e.g., moving from GPT-4 to Claude 3.5) reputed company the product’s reliability.
  • Latency & Token Monitoring: Integrate performance testing into the CI/CD pipeline to track agent reasoning time and cost-efficiency.
  • Hallucination Detection: reputed company automated checks to identify and report AI hallucinations, bias, or "jailbreak" attempts.
  • Collaboration: Work closely with AI Engineers to translate "vague" business requirements into measurable, automated test cases.

Required Skills & Qualifications

  • Experience: 10+ years in QA Automation, with a recent focus on AI/ML or LLM-based applications.
  • Python Proficiency: Expert-level Python skills (the industry standard for AI testing) and experience with testing frameworks like Pytest.
  • AI Testing Tools: Familiarity with AI evaluation frameworks such as LangSmith, DeepEval, RAGAS, or Promptfoo.
  • API & Backend Testing: Deep experience with Playwright, Selenium, or Cypress for UI, but a heavy focus on API-level testing and database validation.
  • Statistical reputed company: Understanding that AI testing often requires "scoring" (e.g., 85% accuracy) rather than a simple binary pass/fail.
  • Data Skills: Ability to work with SQL and JSON to validate data retrieved by agents during RAG (Retrieval-Augmented reputed company) processes.

Preferred Qualifications

  • Experience testing Multi-Agent Systems (where one agent tests another).
  • Knowledge of reputed company Engineering and how it influences software behavior.
  • Background in Investment Banking or Fintech (if applicable) to understand high-stakes data accuracy.

Compensation, Benefits and Duration Minimum Compensation: USD 38,000 Maximum Compensation: USD 133,000 Compensation is based on actual experience and qualifications of the candidate. The above is a reasonable and a good faith estimate for the role. Medical, reputed company, and dental benefits, 401k retirement plan, variable pay/incentives, paid time off, and paid holidays are available for full time employees. This position is not available for independent contractors No applications will be considered if received more than 120 days after the date of this post Apply tot his job Apply To this Job

You might like

Senior Product reputed company / Product Manager – Courier, Express & Parcel (CEP) Consulting

Work from home Full-time role

Technical Product Manager - Hybrid

Work from home Full-time role

Product Manager

Work from home Full-time role

[Remote] Senior Product Manager II, Retention (0-1)

Work from home Full-time role

Principal Product Manager | Discovery

Work from home Full-time role

Senior Product Manager, Mobile – US (Remote)

Work from home Full-time role

Principal Product Manager, ML/AI, Privacy, Health, AdTech

Work from home Full-time role

[Remote] Sr. Project Manager (Customer)

Work from home Full-time role

Middle Project Manager (Remote, Contract)

Work from home Full-time role

Senior Product Manager, MarTech

Work from home Full-time role

reputed company Virtual Customer Support Assistant – Online Jobs for Beginners at arenaflex

Work from home Full-time role

Customer Service Associate – Remote, Flexible‑Schedule, Full‑Benefits Role at arenaflex

Work from home Full-time role

Associate, New Verticals - Convenience Strategy & Operations

Work from home Full-time role

reputed company Part-Time Remote Data Entry Clerk – Flexible Work Schedule at arenaflex

Work from home Full-time role

reputed company Customer Service Representative – Remote Work Opportunity at arenaflex

Work from home Full-time role

reputed company Trip Assignment Specialist – Evening Shifts (Remote) at arenaflex

Work from home Full-time role

[Remote] Instructional Designer

Work from home Full-time role

[Remote] HR & People Operations - Decision Makers - ITSM/ESM Research

Work from home Full-time role

Executive Assistant -- JobTread Experience

Work from home Full-time role

Cell Biologist

Work from home Full-time role