See all roles

Python Engineer, AI

Work from home Full-time role Hiring
Software Engineer, AI

Train large-language models (LLMs) to write production-grade code:

  • Compare & rank multiple code snippets, explaining which is best and why.

  • Repair & refactor AI-generated code for correctness, efficiency, and style.

  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly. End result: the model learns to propose, critique, and improve code the way you do.

RLHF in one line

Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship.

What is Needed

  • 4+ years of professional software-engineering experience using Python and Constraint.

  • Extreme attention to detail and excellent writing skills—most of the job is explaining why one solution is better than another. This requirement cannot be overstated!

  • You actually enjoy reading documentation and specs.

  • Proven ability to thrive in a fully asynchronous, low-oversight remote environment.

  • Strong code-review instincts: can spot logic errors, performance traps, and security issues quickly.

What is Not Needed
  • No prior RLHF or AI-training experience required.

  • You don’t need deep machine-learning knowledge—if you can review code and explain your reasoning, we’ll teach you the RLHF bits.

Logistics

  • Location: Fully remote (work from anywhere).

  • Hours: Minimum 15 hrs/week with the ability to work up to 40 hours per week

  • Engagement: 1099 contract

Straightforward impact, zero fluff. If this fits your profile, apply here.

Apply to this Job

You might like

Product Engineer

Work from home Full-time role

Senior FullStack Engineer (Creator Team)

Work from home Full-time role

Senior Account Executive, Data Solutions

Work from home Full-time role

Solutions Engineer

Work from home Full-time role

Senior Software Engineer (Platform)

Work from home Full-time role

Account Manager, Mid-Market

Work from home Full-time role

Join Our Talent Pool

Work from home Full-time role

(Plugins) Senior Software Engineer

Work from home Full-time role

Strategic Outreach Specialist

Work from home Full-time role

Associate, Investment - North America

Work from home Full-time role

Data Entry Specialist

Work from home Full-time role

Experienced Full Stack Data Entry Specialist – Remote Opportunity at arenaflex

Work from home Full-time role

Entry Level Sales Rep, Work from Home Remotely

Work from home Full-time role

Virtual Assistant: Join a Small B2B Software Company

Work from home Full-time role

senior administrative assistant, MidAmerica (Remote, U.S.)

Work from home Full-time role

Associate Biostatistician- fully remote east coast

Work from home Full-time role

Junior Program Analyst with Security Clearance

Work from home Full-time role

Experienced Remote Customer Service Representative - Earn $19/hour or More with blithequark

Work from home Full-time role

Experienced Call Center Agent/Data Entry Clerk – Remote Opportunity with arenaflex

Work from home Full-time role

Assistant/Associate Professor, Counseling and Human Development - Walsh University

Work from home Full-time role