1.2.1 Extending by linearity
How should we define when is an arbitrary operation on a bit? As before, we assume that we know how acts on the two possible values of the bit, and we write for the result of the operation when the bit is zero, and for the result of the operation when the bit is one. (For the NOT operation, this was precisely what we did in Eq. 1.17.) Let us try to apply the same reasoning as above. With probability , the bit is zero and so we obtain by applying the operation . With probability , the bit is one and we obtain instead . Together, we see that we should define
where means that we are multiplying the vector by the probability .
Exercise 1.5 (NOT on probabilistic bits).
Note that the difference between the two sides is in the order of operations: on the left-hand side we first take a linear combination and then act with , while on the right-hand side we first act with and only then take the linear combination. This equation looks very similar to the familiar rule for numbers (the “distributive law”).