About Polymath

Polymath is an applied research lab building the training grounds for the next generation of autonomous agents. We design and scale simulation environments where agents learn to operate over long horizons, use tools, and improve through reinforcement learning. We work with the world's leading model labs to push the frontier of agent capabilities. Polymath is backed by Base10, Founders Future, Y Combinator, and other incredible investors & angels. We've raised an $8M seed and are actively growing the team.

About the role

We're hiring an AI Research Resident to work on some of the most important open problems in long-horizon agent research. This is a flexible role for MS and PhD students or recent graduates, with residencies ranging from 3 to 12 months. We can accommodate full-time or part-time roles. The residency is designed to culminate in a publication, with the goal of producing meaningful research outcomes and, if there is a mutual fit, transitioning into a full-time role.

Focus Areas

Residents will have the opportunity to work on one or more of the following focus areas:

Frontier Agent Benchmarks: Develop rigorous benchmarks that evaluate how well frontier agents perform on complex, realistic tasks requiring long-horizon reasoning, tool use, and adaptation in dynamic environments.
Long-Horizon Agents: Study and train autonomous agents that can reason, plan, and act over extended time horizons across multi-step tasks and tool-rich environments.
Continual Learning: Explore methods that enable agents to improve continuously through experience, retain and build on prior capabilities, and adapt efficiently to new tasks, domains, and environments.

You'll be a good fit if you:

Are currently pursuing an MS or PhD program in a Computer Science or related field
Have experience with post-training and reinforcement learning
Can write production quality code
Have a strong track record of publications
Are excited about long-horizon reinforcement learning and autonomous agents
Have high agency, move quickly, and enjoy working on open-ended research problems

Culture

Polymath is a team of researchers, engineers, and operators focused on advancing the frontier of safe, superintelligent AI agents.
We have a flat organizational structure. We believe that people do their best work when they're self-motivated and driven by a desire to learn, contribute to the team's goals, and advance scientific progress.
We're looking for folks who ship fast, set high standards for themselves, and are great team players.