Prompt in, text out. No way to inspect reality.
- Failure mode
- A single model call can explain ideas, but it cannot read files, run commands, or check tests. For real code work it is guessing.
- Harness move
- Start with the smallest useful system: a task, a model, an answer. Notice what it cannot do before adding anything.
- Keep
- If the answer must be proven, a bare model is not enough.