Skip to content

Projects

The projects are the runnable heart of the course. Each one changes one harness lever, asks you to inspect the trace, and leaves behind a policy or artifact you can carry forward.

Project Path

ProjectFocus
P01: Agent TraceRead the agent loop from its event trace
P02: Model RoutingRight-size the model to the task
P03: RetrievalDecide when search tools earn their slot
P04: DecompositionSplit large work into smaller runs
P05: MemoryAdd durable context to cut re-discovery
P06: SafetyBound the agent's blast radius and sandbox it
P07: Critic CapstoneAdd evaluation and compose the full harness
P08: Dynamic WorkflowsMove orchestration into reusable workflows
P09: Model Routing BenchmarkSelect the most appropriate model
P10: Indexing Agent HistoryAdd a database/index over conversation traces
P11: SubagentsWhen do subagents make sense?

How To Work

  1. Open the project page.
  2. Open the starter/ files in GitHub.
  3. Predict the trace.
  4. Run the smallest experiment.
  5. Compare with solution/.
  6. Save the result table or policy.

The full project READMEs live in the repo: projects/.

Built as a friendly front door for the runnable OpenHands harness lab.