what if the AI didn't just learn to do the task better — what if it rewrote the part of itself that decides how to learn, and then rewrote that too. hyperagents are self-referential programs where the task agent (does the thing) and the meta agent (improves the thing) are folded into one editable blob of code. crucially, the meta-level improvement process is itself editable. so the system isn't just learning — it's learning how to learn how to learn. and somehow, it actually works.
step through the cycle to see how a hyperagent improves itself.
the task agent attempts the problem and produces an output. its performance is scored and logged for the meta agent to read.
drag the slider to see how performance diverges as improvement cycles stack up.
illustrative curves based on paper trends — not exact reported numbers
[REVIEWER 2 DEMANDS YOU ANSWER THESE]
what makes a hyperagent fundamentally different from a regular self-improving AI?
the paper drops an assumption made by the original Darwin Gödel Machine. which one?
which of these emergent behaviors appeared in DGM-Hyperagents without being explicitly programmed?
what happens to meta-level improvements across different domains and runs?