Continual Learning

Learning from deployment — the way a human learns on the job — rather than arriving fully trained. Ilya Sutskever uses it to reframe what superintelligence is: not a finished, omniscient artefact, but a fast learner that keeps learning after release.

AGI and pre-training ‘overshot’

Sutskever argues two words distorted the field’s thinking. AGI arose as a reaction to ‘narrow AI’ (the chess engine that beats Kasparov but can do nothing else), so it came to mean a system that can already do everything. Pre-training reinforced the expectation, because more of it improved a model ‘at everything, more or less uniformly’. Together they overshot the target: ‘a human being is not an AGI.’ We have a foundation of skills plus a great deal we don’t know — and we rely on continual learning to fill the rest.

The superintelligent 15-year-old

So the right target is not a mind that already knows every job, but one that can learn any job: ‘a superintelligent 15-year-old that’s very eager to go… a great student… go and be a programmer, go and be a doctor, go and learn.’ Deployment becomes a trial-and-error learning period — ‘a process, as opposed to you dropping the finished thing.’ Even a straight-shot superintelligence would be released gradually.

Why this could compound into superintelligence

Dwarkesh Patel draws out the implication and Sutskever accepts it: take one model that learns as efficiently as a human, run instances across the economy each learning a different job, and amalgamate their learnings — something humans cannot do, because we can’t merge minds. Such a system becomes functionally superintelligent even without any recursive self-improvement in software, and creates strong economic pressure to deploy it. Sutskever expects rapid (if hard-to-rate) economic growth, faster wherever regulation is friendlier. He resists the conclusion that one model would win everything: his intuition is that competition drives specialisation, leaving many narrow superintelligences occupying different niches.

Continual Learning

AGI and pre-training ‘overshot’

The superintelligent 15-year-old

Why this could compound into superintelligence

See also