Concept

Theory of Constraints

conceptmanagementproductivitydecision-makingsystems-thinking

Theory of Constraints

A management framework developed by Israeli physicist Eliyahu Goldratt, first published in his 1984 novel The Goal. The central claim: the output of any system is determined by its single least efficient step — the bottleneck. Improving any other step cannot increase total output until the bottleneck is addressed.

Goldratt arrived at this insight while analysing a small chicken-coop assembly business. Work was piling up at one stage; moving one worker there tripled total output. The same logic applies to manufacturing lines, software development pipelines, creative studios, and individual knowledge workers.

The core logic

In any system with sequential steps, effort expended at non-bottleneck stages produces no additional output — it simply creates a larger queue in front of the bottleneck. This is counterintuitive: working harder at a fast step feels productive but changes nothing for total throughput.

The operating framework:

  1. Identify the constraint (the step where work piles up)
  2. Exploit it (maximise the throughput of that step without other changes)
  3. Subordinate everything else to the constraint (don’t let other steps run faster than the bottleneck can absorb)
  4. Elevate the constraint (add resources if still needed after steps 1–3)
  5. Repeat (once you fix the old constraint, a new one becomes the bottleneck)

Goldratt’s summary: “Can I summarise all of Theory of Constraints in one sentence? I think I can do it in one word: focus.”

Organisational applications

Custom gearbox manufacturer: a 15-person design office (the bottleneck) was switching between projects an average of 55 times a day, generating errors and delays. Rule implemented: “stop starting, start finishing” — no new design is begun until the current one is complete. Result: tripled design throughput, total lead time from 12 months to 2 months.

Amazon / Jeff Bezos: Bezos reportedly applied TOC thinking to identify customer friction points as system bottlenecks, restructuring operations around the assumption that the customer experience — not internal efficiency at other stages — was the system’s limiting constraint.

Ed Catmull / Pixar: without using the vocabulary, Catmull’s system embodies TOC logic. Directors spend years in story development (apparently slow) to avoid the bottleneck at production (catastrophically expensive). The popsicle-stick system made the trade-offs between different tasks visible, forcing the organisation to decide which bottleneck mattered most.

Application to individuals

David Epstein draws on Sheila Taormina’s case as the archetype: an elite swimmer who discovered her bottleneck was strength/power, not aerobic capacity — which her coaches had been training. Redirecting effort to the actual constraint produced three seconds of improvement and an Olympic gold medal; subsequent applications to triathlon and modern pentathlon produced three distinct Olympic sport appearances across four Summer Games.

For most knowledge workers, the bottleneck is attention: the scarcest and most fragile resource in the system. Gloria Mark’s 25-year study of workplace attention shows average task-switching every 45 seconds (2022), with measurable physiological stress and productivity costs from every toggle. The TOC fix: batch work by type, eliminate unnecessary switches, protect monotasking blocks. See David Epstein on Constraints, Attention, and Narrative Values.

The personal bottleneck question: in creative work, the constraint is rarely “more research” or “more ideas” — it is typically the narrowing step (writing a one-page architectural outline, setting a word limit, choosing what not to pursue).

Connection to satisficing

Herbert Simon’s satisficing principle (bounded rationality) is a constraint applied to the decision-making process itself. By setting a “good enough” threshold for a class of decisions (same socks, same breakfast), Simon freed attention for problems where more deliberation actually changes the outcome. This is TOC applied to cognitive bandwidth: identify what decisions are genuinely bottlenecks on your life’s output, and satisfice everything else. See the same David Epstein on Constraints, Attention, and Narrative Values episode for the full Herbert Simon treatment.

Where mainstream views differ

Lean manufacturing / Six Sigma: these traditions optimise many steps simultaneously rather than concentrating on the single binding constraint. Goldratt explicitly argued this was a mistake — local efficiencies that don’t address the bottleneck create inventory and complexity without improving output.

Agile / parallel workstreams: breaking work into parallel tracks can look like it bypasses TOC, but it simply moves the bottleneck to the integration step. Goldratt’s insight applies at each level: within any parallel track and at the integration point.

Attention maximalism: the dominant culture of knowledge work (full inbox access, continuous availability, open-plan offices) is structurally opposed to protecting attention as a resource. Gloria Mark’s research quantifies the cost; the resistance comes from signalling norms (being seen to be responsive) rather than productivity evidence.