See all roles

Python Engineer, AI

Work from home Full-time role Hiring
Software Engineer, AI

Train large-language models (LLMs) to write production-grade code:

  • Compare & rank multiple code snippets, explaining which is best and why.

  • Repair & refactor AI-generated code for correctness, efficiency, and style.

  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly. End result: the model learns to propose, critique, and improve code the way you do.

RLHF in one line

Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship.

What is Needed

  • 4+ years of professional software-engineering experience using Python and Constraint.

  • Extreme attention to detail and excellent writing skills—most of the job is explaining why one solution is better than another. This requirement cannot be overstated!

  • You actually enjoy reading documentation and specs.

  • Proven ability to thrive in a fully asynchronous, low-oversight remote environment.

  • Strong code-review instincts: can spot logic errors, performance traps, and security issues quickly.

What is Not Needed
  • No prior RLHF or AI-training experience required.

  • You don’t need deep machine-learning knowledge—if you can review code and explain your reasoning, we’ll teach you the RLHF bits.

Logistics

  • Location: Fully remote (work from anywhere).

  • Hours: Minimum 15 hrs/week with the ability to work up to 40 hours per week

  • Engagement: 1099 contract

Straightforward impact, zero fluff. If this fits your profile, apply here.

Apply to this Job

You might like

Product Engineer

Work from home Full-time role

Senior FullStack Engineer (Creator Team)

Work from home Full-time role

Senior Account Executive, Data Solutions

Work from home Full-time role

Solutions Engineer

Work from home Full-time role

Senior Software Engineer (Platform)

Work from home Full-time role

Account Manager, Mid-Market

Work from home Full-time role

Join Our Talent Pool

Work from home Full-time role

(Plugins) Senior Software Engineer

Work from home Full-time role

Strategic Outreach Specialist

Work from home Full-time role

Associate, Investment - North America

Work from home Full-time role

Reimbursement Specialist- Remote- CSR

Work from home Full-time role

Medical Transcriptionist - Remote - Pathology Reports Transcription Specialist with Competitive Salary and Comprehensive Benefits

Work from home Full-time role

Guitar instructor needed in New York City, NY

Work from home Full-time role

(Work From Home Jobs Part Time) Walmart Work From Home jobs

Work from home Full-time role

Experienced Data Entry Specialist – Remote Opportunity for Detail-Oriented Individuals to Join arenaflex Team

Work from home Full-time role

Remote Customer Experience Specialist – Premium Technology Support (Work From Home) at arenaflex

Work from home Full-time role

Ambulatory LPN - Rheumatology Clinic - FT

Work from home Full-time role

PEZA Compliance Officer (6 Months Contract)

Work from home Full-time role

Virtual Mental Health Group Facilitator

Work from home Full-time role

Insurance Agent Apprenticeship

Work from home Full-time role