-
-
Notifications
You must be signed in to change notification settings - Fork 163
Perlis Thompson Principle
UNDER CONSTRUCTION
The Perlis-Thompson Principle is an idea in software architecture. It's arguably the most important one, because it explains the evolution and scaling of the largest and longest-lived software systems, like Unix and the Web. It also suggests where they fall short.
I wrote 4 blog posts that mention it, starting in July 2021, but the series isn't done yet:
- Summer Blog Backlog: Understanding and Using Shell. This post gives a short outline of the principle and its motivation.
- Summer Blog Backlog: Distributed Systems. This was widely discussed as Kubernetes Is Our Generation's Multics. Claim: In the future, we'll use a distributed OS that adheres to the Perlis-Thompson Principle. It will have fewer concepts and compose naturally. It should use Unix-style language-oriented composition.
- Blog Review: Distributed Systems
-
Unix Shell: History and Trivia
- My comments on this article explaining the narrow waist idea from various angles
Here's a literal summary of what Perlis and Thompson said:
Write software where everything is an X (e.g. a string, pointer, cons cell, table, vector, window, etc.)
They say to use a single data structure, or single notion. However I think that is too absolute, and there are caveats.
Here's a revised, short definition of the principle:
Software with fewer concepts composes, scales, and evolves more easily.
Here's a long definition:
Consider using fewer concepts, data structures, and types in foundational software like programming languages and operating systems.
This style allows for more composition and ad hoc reuse. It scales in multiple dimensions. It evolves gracefully (and messily) over decades.
When introducing a new concept, define a way to reduce it to an existing concept.
- "Everything is an X"
- Writing
O(M + N)
code instead ofO(M * N)
. Huge difference! - Coding the Perimeter vs. Coding the Area (from the Unix vs. Google video)
A software ecosystem that uses a narrow waist is following the Perlis-Thompson Principle in a particular way. The narrow waist idea relates to data: data structures, interchange formats, and network protocols. The Perlis-Thompson principle is arguably more general and refers to "concepts" like Unix processes and Emacs buffers that aren't quite data.
It's also known as:
- Thin Waist
- Hourglass Model
- Lowest Common Denominator
- Networking Terms: "Distinguished Layer" or "Spanning Layer"
- This 2019 article tries to formalize it with math, but I'm not sure how useful that idea is: https://cacm.acm.org/magazines/2019/7/237714-on-the-hourglass-model/fulltext
This idea spans operating systems, networking, and programming languages.
- TCP/IP is a narrow waist between physical issues like wired/wireless and application protocols like the web, e-mail, IRC, Usenet, etc.
- The narrow waist of the Internet has arguably moved to HTTP over time, with e-mail, chat, Usenet-style discussions all migrating to web apps
- An operating system API like POSIX is a narrow waist between applications and hardware
- LLVM is a narrow waist between programming languages and computer architectures
- Byte streams / Unix-style text are a narrow waist between storage/networking abstractions and application-specific schemas
- Pandoc is a narrow waist between document formats. Instead of writing
O(N^2)
document converters, you design an IR, and write N translators to the IR, and N translators from the IR. - SQL is a Narrow Waist (Jamie Brandon)
Just to show the the concept isn't vacuous, here are things that are not narrow waists:
-
No: MS Word files or Excel spreadsheets. These formats are too complicated, to the point that they're tied to specific applications. On the other hand, a CSV file is a narrow waist -- you export it from Excel so you can read it into other applications.
- CSV is also an example of a bad narrow waist. Not all narrow waists are well designed! Sometimes they just evolve. JSON is a better one that was explicitly designed.
- No: JPEG. It's optimized for small size and display. To support other operations, like various kinds of editing, it's converted to a format that's easier to work with.
- TODO: There should be a networking textbook that talks about narrow waists / hourglass
- TODO: There should also be a compiler textbook. I know I've seen a diagram of IRs as a waist between programming languages and CPU architectures.
Not sure about OS textbooks, but the concept definitely applies there too.
- HTTP: An Evolvable Narrow Waist For a Future Internet (PDF, 2010). This and Van Jacobsen's Named Data Networking introduced me to the concept of a narrow waist. (Even though it's in both networking and compiler textbooks.)
- My HN Comment on the Kubernetes-Multics analogy (2021): I trace the narrow waist to Kleinrock via Brewer, but there might be a more original reference
-
Unix vs. Google (2016): a video by Kevin Greer. Has visualizations of Unix vs. Multics: coding the area vs. coding the perimeter.
O(M * N)
problems. -
Always Bet on Text (Graydon Hoare, 2014). 196 HN Comments.
- Text is everything. My thoughts on this are quite absolute: text is the most powerful, useful, effective communication technology ever, period.
- It can be indexed and searched efficiently, even by hand. It can be translated. It can be produced and consumed at variable speeds. It is asynchronous. It can be compared, diffed, clustered, corrected, summarized and filtered algorithmically. It permits multiparty editing.
- This is a restatement of what Perlis said: It's better to have 100 functions on 1 data structure than 10 functions and 10 data structures.
It doesn't appear that the operating systems community does! And definitely not the "cloud" operating system vendors.
- https://www.iab.org/wp-content/IAB-uploads/2010/11/hourglass-london-ietf.pdf (2001)
- https://arxiv.org/ftp/arxiv/papers/1607/1607.07183.pdf
- https://rh.gatech.edu/news/69297/study-shows-how-internets-architecture-got-its-hourglass-shape
- https://people.eecs.berkeley.edu/~kubitron/cs262/lectures/lec02-E2E-SystemR.pdf
Common threads here: either "everything is an X", or explicit nonlinear codebase scaling problems -- O(M * N)
or O(N^2)
.
- Extremist Programming (2012). What if everything were a function? What if everything were an object?
- Everything is an X by Luke Plant (11/2020). Talks about programming languages and user interfaces like Emacs.
-
Are There Any Interesting Programming Languages Based on Particular Data Structures? (2021) -- My comment with 70 upvotes. Most good programming languages have this flavor!
- C has an equivalence between arrays and pointers:
a[i] == *(a + i) == *(i + a) == i[a]
! Doug McIlroy mentioned this in regard to Dennis Ritchie's genius.
- C has an equivalence between arrays and pointers:
- Unix and Microservice Platforms (2021) by Brandon Bloom. Refers to the Unix vs. Google video. Good example of microservices vs. auth/monitoring/metrics/etc.
- My Comment on "On Unix Composability" (2021) -- SSH transport solves and O(M * N * K) problem: (apps like ssh, scp, git, rsync) X (auth methods like password or public key) X method of encryption
-
Glue: The Dark Matter of Software (2021). Makes an explicit claim that N features requires O(N^2) glue code. References the Unix vs. Google video.
- Author is working on a programming language Objective S.
-
Why file systems have loose coupling and your protocol doesn't (2013)
- RPC style vs. File system style. Plan 9 uses the latter.
- This design is often called the “Uniform Interface Constraint”, and is present in one of the more popular protocols, HTTP.. This has a bearing on whether you can write transparent middleboxes, e.g. a cache. You can write a generic HTTP cache, but not a generic RPC cache (because you don't know what the methods are and if they're idempotent, etc.)
- Fielding's REST dissertation has a fairly clear description of the Uniform Interface Constraint and the resulting tradeoffs: https://www.ics.uci.edu/~fielding/pubs/dissertation/rest_arch_style.htm
I've been thinking about this for a long time!
-
Comment echoing an objection to Bret Victor's work (2013). I make an explicit
O(M * N)
argument.- I was waiting for him to mention what I think of as the Unix/Plan 9/REST principle the whole time -- that is what I was calling the Perlis-Thompson principle in my head in 2013
- Comment on Doug McIlroy's Unix Taste Comments (2016). Everything is a file, etc.
- The web was a modest and humble (and thus brilliant) extension of Unix (2021). Unlike Kubernetes and Unix, the new concepts compose with the old ones! It doesn't "shit all over" OS abstractions.
The Perlis-Thompson Principle goes against the intuition of many programmers. They are weighing the tradeoffs from a code perspective, not a systems perspective. Those perspectives can lead to opposite conclusions.
- A Case Against Text Protocols (2020). My "Unix text as a narrow waist" comment argues against a common misunderstanding. Text is the reason we have good tools. As an example, it doesn't make sense to make an WASM editor and WASM version control system. Instead the binary WASM format is projected on to text, and we use text editors and SCMs like git.
- On the Missed Opportunities of Static Types (2021). There is a tradeoff: more types give you some easy safety checks, but can inhibit composition!
-
FFI Fallacy on the Vale Programming Language (2020)
- People who advocate "standardized" RPC systems are often attached to a similar fallacy.
- A function in C is not a function in Python is not a function in Rust. You need a lowest common denominator, or you require O(N^2) glue. I remember someone actually writing to write a specific R-Java bridge rather than using IPC.
-
Programmer's Critique of Missing Structure of Operating Systems (2020). A long article lamenting the repetitiveness of parsing a plethora of data formats. It's reaching toward ideas like a single-level store.
- My response: This could be done in an "application framework". The problem is nobody can agree on the lowest common denominator. There's no reason to think that putting more code in the kernel will solve this problem.
- An operating system has two "sides": one part faces the hardware, and one part faces applications. He wants the OS to do more work for applications. I think a tighter goal for the operating system is to proivde a minimal abstraction to virtualize and share hardware, while providing safety and isolation. It should make applications "possible"; making applications "easy" and factoring into shared libraries is a separate job.
-
Shells Should Shell Out (2021). I question the two-tiered design of PowerShell, Elvish, and nushell. cmdlets aren't processes. They need semantic rules to convert between internal and external representations.
- Argument: The lowest common denominator between a PowerShell script and an Elvish script is a Bourne shell script :) Byte streams are the lowest common denominator.
- "Desugaring" in compilers. Design an IR that has fewer concepts that your language reduces to. This simplifies the back end of the compiler and aids reasoning about correectness. (I recall Walter Bright of D saying that Andrei Alexandrescu really pushed for more desugaring in the semantics of in D 2.0, the second version of the D language.)
- Slogans, Fallacies, and Concepts -- Understanding shell.
- Patterns and Anti-Patterns -- Using shell.
- Composable Distributed OS -- Applying the Perlis-Thompson Principle