# Compression Progress

Jürgen Schmidhuber, *Formal Theory of Creativity* (1990–2010). Beauty = compressibility given prior knowledge. Interestingness = the *change* in compressibility as you learn. A worthwhile project is one that, on completion, would compress your model of the world.

## Core formula

```
I(D, O(t)) = B(D, O(t)) − B(D, O(t−1))
```

Interestingness = first derivative of beauty over time. Pure noise (no learnable pattern) and fully-known pattern (already compressed) are both boring. Beauty lives between.

## When to use

- Picking a research question
- Selecting between candidate projects ("which would teach me the most?")
- Diagnosing aesthetic dissatisfaction ("this is fine but not interesting")
- Choosing what to read

## Don't use when

- Fast generation (this is reflective, not generative)
- Group decisions where audiences differ (single-observer model)

## Procedure

### For picking a research question
1. List 5–10 things you currently *cannot predict well* in your domain. Be specific: not "the future of AI", but "why X 7B model trained with technique A performs worse than Y 1.3B model with technique B on benchmark Z".
2. For each: would understanding it compress only this fact, or re-organize a broader domain? Prefer the latter.
3. For each: is the answer learnable from where you are? (Not noise; not too far above your prior.)
4. Pick the highest learnable compression-progress potential.

### For evaluating ideas
For each candidate, ask:
- What would I understand differently if this were complete?
- Would that understanding compress this domain or only this idea?
- Is it currently learnable from where I am?

Highest answers across all three = pursue.

### For aesthetic critique
Where is the work entirely predictable? (too known) Entirely unpredictable? (too random) Where does it sit in the learnable-but-not-yet-learned zone? Strong work has more of the third.

## Worked example

User has three options:
- A. Build a habit tracker.
- B. Build a tool that explains why a `git rebase --interactive` produced its conflicts, by reconstructing the commit graph mid-rebase.
- C. Read Lacan.

Analysis:
- A: no compression progress; user already has model of habit trackers. Reject.
- B: high. User doesn't currently have strong model of how rebase constructs intermediate states; building this requires learning that, and the resulting model re-organizes how the user thinks about all VCS internals.
- C: real compression-progress potential, but prior is missing. Long path to get there. Worthwhile if on the prerequisite track; otherwise read Žižek/Bruce Fink first as scaffolding.

Recommend B.

## Anti-slop notes

- "Compression progress" as slogan ≠ doing the analysis. State the actual model gaps you'd close.
- Don't claim every idea has high compression-progress. Most don't. The framework is useful because it discriminates.
- Don't impose this lens on artistic work without acknowledging its limits.

Source: people.idsia.ch/~juergen/creativity.html