“No man can cross the same river twice, because neither the man nor the river are the same.”
Heraclitus
The following post is extracted & paraphrased from Rich Hickey's excellent Are We There Yet? - specifically the section of the talk that focuses on the model for discussing and thinking about the titular concepts. These concepts are in turn taken from the philosophical writings of Alfred North Whitehead (a co-author of Principia Mathematica).
I often find myself wanting to explain this core concept to people who are new to Clojure, and particularly people who I am trying to make into people who are new to Clojure. While I think I have a good handle on this concept in my head - I sometimes struggle to explain it succinctly, hopefully this post achieves that goal.
Definitions
These definitions are not really globally applicable, but they represent the precise meaning I try to capture when discussing values changing over time in the context of software development and programming.
Value
A value is some measurable quantity, magnitude or amount for which its equivalence to some other value can be determined. A value can also be some immutable composite of other values. The number 42 is a value, the string "Hello World" is a value, the list (1, 2, 3) is also a value.
Identity
An identity is some composite psychological concept we apply to a succession of values over time where they are in some way causally related. You are an identity, I am an identity, a river is an identity (see below).
Name
A name is simply a label that we apply to an identity or a value to make it easier to refer to. The same identity can have many names. "Glen", "Glen Mailer", "Mr Mailer", "Glenjamin" and "@glenathan" are all names which could legitimately be considered to refer to the identity that is myself in the appropriate context. Likewise the "Answer to the Ultimate Question of Life, The Universe, and Everything" is a name for the value 42.
State
A state is simply the value of an identity at a particular time. A snapshot, if you will. Under this definition state does not change, and thus there is no such thing as "mutable state".
Time
Purely relative, it has no dimensions - it can only tell us whether something happened before or after some other thing (or at the same time).
The River
Let us consider the title quote in the context of these definitions. To help us examine the proverbial river under this light, we shall give ourselves the same powers as when running a computer program but in the real world - which requires us to sidestep some fairly fundamental physics - hopefully this will not cause any lasting damage.
The third-longest river in Asia runs through China. Depending on context it is known as the "Yellow River", "Huang He", "རྨ་ཆུ།", "the cradle of Chinese civilization" and "China's Sorrow". All of these are names for the same river, which itself is an identity.
If we were to freeze an instant in time into a snapshot of our proverbial river crossing, this state would contain a value composed of a large number of atomic (in the irreducible sense) smaller values. For simplicity, lets assume that water molecules are immutable. In this case then the state of the river we are crossing can be said to be the current arrangement of all these immutable water molecule values.
At some point in the future when returning for our second crossing, we take another snapshot of the river as our new state. The river's value is again the arrangement of all the immutable water molecules - but this time they are all different molecules with different values.
The state of the identity which is the river named "Huang He" at this later point in time is measurably different from the value we took during the first crossing.
In Clojure
Since immutability is at it's core, we'll start here for some code examples.The following code should work correctly when pasted into a running Clojure REPL.
; We'll use numbers to represent our water molecules | |
; Numbers are values, because they cannot change | |
1 | |
2 | |
; These can be assembled into a river-state snapshot | |
; In Clojure vectors are values because they are immutable | |
[1 2 3] | |
; We'll use this to represent a (small) river with 3 water molecules in a row | |
; The river itself is an identity, we'll use an atom to represent it | |
(atom [1 2 3]) | |
; and so we have a way to refer to it later, lets give it a name | |
(def yellow-river (atom [1 2 3])) | |
; The same identity can have multiple names | |
(def huang-he yellow-river) | |
(def རྨ་ཆུ། yellow-river) | |
; Clojure provides (deref) take the current value out of an identity | |
(deref yellow-river) | |
; => [1 2 3] | |
; Lets throw in a little trick to make the river flow downstream | |
; Don't worry too much about how this works | |
(def flow | |
(future (while true | |
(swap! yellow-river #(mapv inc %)) | |
(Thread/sleep 1000)))) | |
; Now when we look at our river, it has changed | |
(deref yellow-river) | |
; => [3 4 5] | |
; We can also look at it via a different name | |
(deref རྨ་ཆུ།) | |
; => [4 5 6] | |
; Now that we've got a flowing river, lets cross it. | |
; We'll take a note of the value during the first crossing | |
(def first-crossing (deref yellow-river)) | |
; And some time later, we'll cross it again | |
(def second-crossing (do (Thread/sleep 1000) (deref yellow-river))) | |
; And lo, we didn't cross the same river twice | |
(= first-crossing second-crossing) | |
; => false |
In JavaScript
JavaScript doesn't have the same set of immutable primitives, but we can achieve a similar effect with a little sleight of hand.
The following code should work correctly when pasted into the browser console or a Node.js REPL line-by-line.
// Water molecules can still be Numbers | |
// since they're immutable values in JS | |
1 | |
2 | |
// The story is not so good with arrays | |
// JS Arrays are mutable by default | |
// and so we start with an identity | |
[1, 2, 3] | |
// As before, lets give this a name | |
var yellow_river = [1, 2, 3]; | |
// The identity can still have many names | |
var huang_he = yellow_river; | |
// var རྨ་ཆུ། = yellow_river; but tibetan doesn't work | |
// We're still missing our river value representation. | |
// Here's a snapshot function that extracts an | |
// immutable value from a mutable javascript array | |
function snapshot(array) { | |
return Object.freeze(array.slice()); | |
} | |
// Now we can extract the state at point in time from our river | |
snapshot(yellow_river); | |
// => [1, 2, 3] | |
// As before, a little trick to make our river flow | |
var timer = setInterval(function() { | |
yellow_river.forEach(function(v, i) { | |
yellow_river[i]++; | |
}); | |
}, 1000); | |
// And now as we look at the state of our river at different | |
// times, we see that the value has changed | |
snapshot(yellow_river); | |
// => [4, 5, 6] | |
// And we can still look via a different name | |
snapshot(huang_he); | |
// => [5, 6, 7] | |
// Now that we have all of the same tools, lets cross our river | |
var first_crossing = snapshot(yellow_river); | |
// And wait for a bit... | |
var d = Date.now(); while(Date.now() - d < 1200); | |
// Then cross it again | |
var second_crossing = snapshot(yellow_river); | |
// Now theres a slight problem - we made up our own | |
// representation of the river's value, so we have | |
// no way to compare the two crossing values. | |
// Let's write one: | |
function equal(a1, a2) { | |
return Object.isFrozen(a1) && | |
Object.isFrozen(a2) && | |
a1.every(function(v, i) { return v == a2[i] }); | |
} | |
// And as expected, our two values aren't the same | |
equal(first_crossing, second_crossing); | |
// => false |
Summary
The ancient Greeks knew about the perils of mutable state a long time ago - we're only now rediscovering this for ourselves.
In a language like Clojure, that was designed from the ground up with this in mind, it's easy to take back control and tease apart the separate concepts I've described. Even in a language like JavaScript, designed in a week at a time when mutability was commonplace, we can achieve a similar effect with a measure of self-control. There are also libraries like mori and immutable-js which provide much fuller implementations of the data-types required to avoid mutability.
If you remain unconvinced, I recommend watching Are We There Yet?. If you're still not sure after that, you're either a far better programmer than me, or you're yet to experience the pain of trying to reason about a large codebase riddled with unchecked mutability.
Addendum
As well as the above definitions, Are We There Yet? contains this gem, which is Rich visualising the idea of obvious complexity while saying "Grrrrr".