Back to all roles

AI Research Resident

Remote-first Full-time Now hiring

About Polymath Polymath is an applied research lab focused on advancing long-horizon agent capabilities through reinforcement learning. We design and scale simulation environments where agents learn to operate safely and autonomously. We work with the world’s leading model labs to push the frontier of agent capabilities. Polymath is backed by Base10, Founders Future, Y Combinator, and other incredible investors & angels. We've raised an $8M seed, and are growing out the team. About the role We’re looking for talented researchers currently enrolled in MS / PhD programs to collaborate on a research project focused around frontier benchmarks and environments for long-horizon AI agents. This will require 1) identifying failure modes in frontier models, 2) developing rigorous benchmarks that evaluate how well frontier agents perform on complex, realistic tasks requiring long-horizon reasoning and tool use in dynamic environments, and 3) training autonomous agents that can reason, plan, and act over extended time horizons. We can accommodate full-time or part-time engagements. The goal of the residency is to culminate in a publication, and if there is a mutual fit, transition into a full-time role. If you’re interested in joining Polymath but are not currently a student, please apply to the Member of Technical Staff role. You’ll be a good fit if you: Are currently pursuing an MS or PhD program in Computer Science or a related field Have experience with reinforcement learning, benchmarking frontier models, or model post-training Have experience with systems engineering and can write production-quality code Have a strong track record of publications Have high agency, move quickly, and enjoy working on open-ended research problems Culture Polymath is a team of researchers, engineers, and operators focused on advancing the frontier of safe, superintelligent AI agents. We have a flat organizational structure. We believe that people do their best work when they’re self-motivated and driven by a desire to learn, contribute to the team’s goals, and advance scientific progress. We’re looking for folks who ship fast, set high standards for themselves, and are great team players. Apply To This Job

More remote roles

D365 BC Functional Consultant

Remote-first Full-time

YouTube Video Creator and Manager

Remote-first Full-time

Creative Strategist

Remote-first Full-time

Founding Growth Product Manager

Remote-first Full-time

Full-Stack Product Designer

Remote-first Full-time

Head of Product

Remote-first Full-time

Senior Frontend Engineer (product-minded)

Remote-first Full-time

Senior Software Engineer

Remote-first Full-time

Software Engineer

Remote-first Full-time

Senior/Mid Frontend Engineer - React

Remote-first Full-time

Remote Sports Events Coordinator

Remote-first Full-time

Experienced Customer Service Representative - Home-Based Opportunity with arenaflex

Remote-first Full-time

Experienced Full Stack Data Entry Specialist – Remote Work Opportunity with arenaflex

Remote-first Full-time

Remote Customer Service Representative – Global Travel Support at arenaflex – $25/Hour

Remote-first Full-time

Customer Support Executive – Remote Live‑Chat Specialist (Entry‑Level, No Calling Required) – $35 per Hour – Join arenaflex’s Growing Support Team

Remote-first Full-time

Experienced Live Chat Customer Support Representative – Part-Time Remote Opportunity at arenaflex

Remote-first Full-time

Healthcare Sales Representative — Commission Only

Remote-first Full-time

Experienced Customer Service Representative – Delivering Exceptional Experiences for arenaflex Clients

Remote-first Full-time

[Entry Level/No Experience] Chewy Data Entry Remote Jobs

Remote-first Full-time

Online Amazon Chat Jobs Work From Home (REMOTE) Part-Time

Remote-first Full-time