Back to all roles

Python - Software Engineer, AI

Remote-first Full-time Now hiring

This a Full Remote job, the offer is available from: United States, Turkey, Greece, Estonia, Latvia, North Macedonia, Hungary, Bulgaria, Albania, Poland, Romania, Kosovo, Portugal, Spain, Malta, United Kingdom, Austria, Belgium, Germany, Ireland, France, Slovakia, Czechia, Italy, Montenegro, Bosnia and Herzegovina, Paraguay, Uruguay, Brazil, Dominican Republic, Venezuela, Ecuador, Colombia, Argentina, Chile, Bolivia, Peru, Mexico, Puerto Rico, Canada, Serbia, Arizona (USA), California (USA), Colorado (USA), Delaware (USA), District of Columbia (USA), Florida (USA), Georgia (USA), Idaho (USA), Illinois (USA), Indiana (USA), Louisiana (USA), Nevada (USA), North Carolina (USA), Ohio (USA), Oklahoma (USA), Pennsylvania (USA), Tennessee (USA), Texas (USA), Virginia (USA), Washington (USA) Before applying This role is open to contractors in accepted locations only. Please confirm your country is on the list before applying — we're unable to process applications from unlisted locations. List of accepted countries and locations. For US applicants This is a 1099 independent contractor role. It is not compatible with F-1 OPT, STEM OPT, or any visa status that requires W-2 employment, guaranteed hours, or employer sponsorship. We are unable to provide offer letters or employment verification for this role. What You'll Be Doing Help train large language models (LLMs) to write production-grade code across a wide range of programming languages:

  • Compare and rank multiple code snippets, explaining which is best and why
  • Repair and refactor AI-generated code for correctness, efficiency, and style
  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly

End result: the model learns to propose, critique, and improve code the way you do. RLHF in one line: Generate code → expert engineers rank, edit, and justify → convert that feedback into reward signals → reinforcement learning tunes the model toward code you'd actually ship. What You'll Need

  • 3+ years of professional software engineering experience in Python (constraint programming experience is a bonus, but not required)
  • Strong code-review instincts — you can spot logic errors, performance traps, and security issues quickly
  • Extreme attention to detail and excellent written communication skills. Much of this role involves explaining why one approach is better than another. This cannot be overstated.
  • Comfortable reading documentation and language specs, and able to work well in an asynchronous, low-oversight environment

Identity verification: Applicants will be required to verify their identity and confirm they have valid documentation to work as an independent contractor in their country of residence. What You Don't Need

  • No prior RLHF or AI training experience

Logistics

  • Location: Fully remote — work from anywhere on the accepted locations list
  • Compensation: $30–$70/hr based on location and seniority. Note: the majority of projects run at around $30/hr — higher rates apply to senior profiles and specific project types
  • Hours: Minimum 15 hrs/week, up to 40+ hrs/week available — hours vary by project and are not guaranteed week to week
  • Engagement: 1099 independent contractor
  • Payment: Weekly via PayPal or Stripe

⚠️ Important: Hours are project-dependent and can vary week to week. We recommend keeping other work options open alongside this engagement rather than relying on it as your sole source of income. This offer from "G2i Inc." has been enriched by Jobgether.com and got a 75% flex score. Apply tot his job Apply To this Job

More remote roles

Lead Software Engineer (Clojure, Python, AWS) - Open to remote

Remote-first Full-time

Multi Asset Python Platform Developer PM, Associate

Remote-first Full-time

Python Developer with IBM MQ

Remote-first Full-time

Software Developer (Python, SQL)

Remote-first Full-time

Python Developer / API Engineer (Hybrid)

Remote-first Full-time

On W2 - Python Dev w/ Gunicorn/Unicron WSGI/ASGI, AsynCIO - Remote

Remote-first Full-time

Java Developer Remote

Remote-first Full-time

Junior java software developer/Remote-Junior data AI scientisit

Remote-first Full-time

Pyspark/Java Developer

Remote-first Full-time

Senior Full-Stack Java Developer

Remote-first Full-time

Experienced Data Entry Associate – Entry-Level Opportunity at arenaflex

Remote-first Full-time

V101 - Legal Practice Assistant

Remote-first Full-time

Streamer Relationship Manager (Brazil)

Remote-first Full-time

Part-Time Instructor | Liberal Studies

Remote-first Full-time

Care Advice Line Health Advisor

Remote-first Full-time

Experienced Customer Relations Representative - arenaflex Agent Team Member

Remote-first Full-time

Experienced Customer Service Representative for arenaflex Call Center

Remote-first Full-time

Experienced Data Entry Specialist – Remote Opportunity at arenaflex

Remote-first Full-time

Experienced Part-Time Remote Data Entry Specialist – Focus Group Research Support

Remote-first Full-time

Experienced Customer Service/Data Entry Specialist – Delivering Exceptional Service and Data Accuracy in a Dynamic Remote Environment

Remote-first Full-time