Overview

Content

People

Guides

Apply

Support Us

<aside> 👉

The program runs for 5 weeks. Each week focuses on a different phase of building and testing an alignment method. The goal is to embed values in a system in a way that generalizes and can’t be easily gamed. Participants will form teams during the app process. We recommend teams of 3–5. You can apply solo or with collaborators.

Mentors may support specific teams depending on availability. Teams are expected to coordinate independently and meet regularly. If someone drops out, we’ll help rebalance teams where needed.

</aside>


Week 1: Scoping

In the first week, teams will examine previous attempts within their chosen method. This includes reviewing what has been done before, why they failed or suceeded, and what directions are promising to explore. The goal is to understand what makes this iteration different, and to identify what makes this experiment a success. Teams with assigned mentors will coordinate throughout the week to stress-test their direction.

By the end of Week 1, the team should have:

<aside> ✍️

Track-specific Examples


Week 2-3: Experimentation

Teams will begin implementation by running tests and iterating based on the research they found from Week 1. We will provide TPU credits and mentorship to help teams build their project from the ground up. In general, every team is expected to test whether their method actually moves the needle on alignment.

By the end of Week 3, the team should have:

<aside> ✍️

For the Agent Foundations tracks

If you’re in this track, you’ll follow one of two subtracks:


Week 4: Testing

Teams will critique their alignment method, attempt to break their own evals, and run tests on larger or more adversarial setups. Mentors and the AI-Plans team will advise teams based on prior hackathons in alignment evals.

By the end of Week 4, the team should have:

Week 5: Wrap-up

Teams will write their final summary. This includes the method, evidence, assumptions, failure analysis, and proposed next steps. The summary should stand on its own as a falsifiable alignment contribution. Teams will also prepare their poster and get final feedback.

By the end of Week 5, the team should have:


Final Day Presentation + Job Fair

The program ends with a public poster session and a job fair.

<aside> 🍿

Attendance and Pricing

All funds go toward program costs and participant stipends.