Partial Order Theory of Recursion And Semantics

This is an introduction to a mathematical theory of recursion that covers:

recursive functions that can be undefined for some inputs
recursive data that can be infinitely big (e.g., infinite lists)

In addition to being of theoretical interest, it also helps with understanding code like

ns = 0 : map (+ 1) ns
fs = 1 : 1 : zipWith (+) fs (tail fs)

Although in theory you could apply lazy evaluation, it would be a big mess in this case. The theory here offers a more practical way, shown below. Reason: Finding the answer seldom needs following execution steps; there is always a higher-level way.

On the flip side, this high-level theory ignores time and space complexities; those would be the strong suit of following execution steps.

50 Shades of Undefinedness

Every type is modelled as a set consisting of the usual values you expect, plus an additional element we write as “⊥” to stand for “undefined” or “unknown”, so that “f 5 is undefined” is expressible as “f 5 = ⊥”. Some types have additional elements; we will get there. People also pronounce ⊥ as “bottom”.

Beginners can interpret ⊥ as non-termination and/or runtime error. But eventually you should accept the fact that this theory only addresses “will there be an answer? if so, what is it?”, so ⊥ means only “no answer”. Since this theory does not address execution steps, “non-termination” does not have a meaning here.

(The full theory upgrades “set” to “set with partial order”. ⊥ is the lowest element in the partial order, hence “bottom”. The partial order justifies the magical methods in this lecture.)

Basic Types

Example: Integer is {⊥}∪ℤ.

Algebraic Data Types

Unsurprising without fields:

data Uno = MkUno
data Bool = False | True
data Trio = T1 | T2 | T3

The sets are respectively

{⊥, MkUno}
{⊥, False, True}
{⊥, T1, T2, T3}

More interesting when you have fields:

data E = C1 Bool | C2 Integer

The set is

{⊥} ∪ {C1 b | b ∈ {⊥, False, True}} ∪ {C2 i | i ∈ {⊥}∪ℤ}

Note that C1 ⊥ and C2 ⊥ are possible in Haskell, in addition to ⊥. Idea: Knowing which of the two cases you are in is a little bit better than not even knowing. This is different from many other languages; there will be a remark on this later.

data B2 = MkB2 Bool Bool

The set is

{⊥} ∪ {MkB2 b c | b∈{⊥, False, True}, c∈{⊥, False, True}}

E.g., MkB2 ⊥ ⊥, MkB2 False ⊥, MkB2 ⊥ False are possible in Haskell. Idea: Some programs successfully output the constructor and may just have problems with some fields; some other programs have so much trouble they can’t even output that little.

Recursive Algebraic Data Types

Recursive types have the above plus infinitely long/big values, e.g., infinite lists. Using [Bool] for example:

data [a] = [] | a : [a]
-- equiv to: data List a = Nil | Cons a (List a)

The set for [Bool] is Fin ∪ Inf, where
Fin is the smallest set satisfying Fin = {⊥, []} ∪ {x:xs | x ∈ {⊥, False, True}, xs ∈ Fin}
Inf is the set of infinite lists, where each element is in {⊥, False, True}
The rigorous construction is complicated and shown in the Theory section.

Examples of elements in Fin:

⊥ : ⊥
⊥ : []
False : ⊥
⊥ : ⊥ : ⊥
⊥ : ⊥ : []
⊥ : False : ⊥

Examples of elements in Inf:

⊥ : ⊥ : ⊥ : … infinite list of undefined Bools
False : ⊥ : True : ⊥ : … infinite list with some undefined Bools
False : False : False : … infinite list of False’s

Fin has the finite values; you may have guessed it from the non-recursive examples. The surprising addition is Inf, the infinite values; their necessity will become clear later (defining recursion as limits of sequences).

Function Types

In a function type X -> Y, the ⊥ element is the function that maps all inputs to ⊥.

Reminder: X and Y themselves have extra elements as described here, e.g., Integer -> Bool becomes {⊥}∪ℤ → {⊥, False, True}. This already has (λx → ⊥) that maps all of {⊥}∪ℤ to Bool’s ⊥. We equate it with ⊥ of the function space, i.e., we assert ⊥ = (λx → ⊥).

Calculating with Undefinedness

What should happen to, e.g., f ⊥ and C ⊥ (where f is a function, C is a data constructor of 1 field)? Programming languages take one of two stances:

(Strict, most languages) f ⊥ = ⊥, C ⊥ = ⊥, unconditionally, regardless of actual f and C. This doesn’t support infinite data. The theory applies to most languages if you adopt this rule.
(Non-strict, e.g., Haskell) f ⊥ depends on actual f, C ⊥ is itself and already a value. This supports infinite data.

This note focuses on Haskell. Examples:

const 5 ⊥ = (λx y → x) 5 ⊥ = 5
MkB2 (div 1 0 == 0) False = MkB2 ⊥ False

But here are the cases when ⊥ in subexpressions lead to ⊥ answers:

Function application but the function is ⊥: ⊥ x = ⊥
Arithmetic, e.g., ⊥ + 4 = ⊥, ⊥ == 0 = ⊥, even 0 × ⊥ = ⊥
Matching ⊥ against a data constructor pattern, e.g., (case ⊥ of [] → …) = ⊥

Note: Not when the pattern is just a variable: (case ⊥ of var → foo var) = foo ⊥

You see this most in pattern matching in functions. Recall that
```
map f [] = []
map f (x:xt) = x : map f xt
```
is shorthand for
```
map f xs = case xs of [] -> []
                      x:xt -> x : map f xt
```
So map f ⊥ = case ⊥ of … = ⊥.
Test conditions in if-then-else and guards, e.g. if ⊥ then x else y = ⊥

Evaluation order: In this theory, we are doing math and algebra, all evaluation orders give the same final answer.

Successive Approximations and Taking Limit

This theory treats recursion by defining a sequence of successive approximations and then taking a “limit”.

If the recursive program goes like

foo = … foo …

then we define the sequence of approximations foo₀, foo₁, foo₂,… by

foo₀	= ⊥
foo_n+1	= … foo_n …

Idea: foo_k is like setting the recursion depth quota to k; this is why foo_n+1 may only call foo_n because it costs 1 unit of your quota. When your quota runs out, you give up and use ⊥; this is why foo₀ = ⊥.

Then we figure out what the sequence converges to, and that becomes the meaning of foo. Idea: Extrapolate to unlimited recursion depth.

Examples to show it in action:

Very Basic Infinite List

Program:

zs = 0 : zs

Sequence:

zs₀	= ⊥
zs₁	= 0 : zs₀	= 0 : ⊥
zs₂	= 0 : zs₁	= 0 : 0 : ⊥
zs₃	= 0 : zs₂	= 0 : 0 : 0 : ⊥

Note that zs_n+1 is a compatible improvement upon zs_n. I mean: Where zs_n has a non-⊥ part, zs_n+1 must agree (compatible); where zs_n has a ⊥ part, zs_n+1 may improve it to a non-⊥ value (improvement). This can be formalized and proved to hold for all programs.

“Limit” means the minimal compatible improvement over every zs_k. The infinite list 0 : 0 : 0 : … does the job, so it is now what zs means. (“Minimal” means: don’t improve a ⊥ part to a non-⊥ value unless you have to.)

Equivalently, think of how zs_k gives up after k nodes because it hits the recursion depth quota. If we allowed unlimited quota, it would keep going. So the answer is the infinite list.

These ideas can be formalized and proved to hold for all programs.

1, 2, 3,…

Program:

ns = 0 : map (\x -> x + 1) ns

Sequence:

Recall map f ⊥ = ⊥. This will help with working out the sequence.

ns₀	= ⊥
ns₁	= 0 : map (λx → x + 1) ⊥
	= 0 : ⊥
ns₂	= 0 : map (λx → x + 1) (0 : ⊥)
	= 0 : 1 : ⊥
ns₃	= 0 : map (λx → x + 1) (0 : 1 : ⊥)
	= 0 : 1 : 2 : ⊥
ns₄	= 0 : 1 : 2 : 3 : ⊥

The limit is the infinite list 0 : 1 : 2 : 3 : 4 : …

Fibonacci Again

This is a famously confusing example in some Haskell tutorials, but we are now equipped to deconfuse it! Program:

zipPlus [] _ = []
zipPlus _ [] = []
zipPlus (x:xs) (y:ys) = x+y : zipPlus xs ys
tail (_:xs) = xs
fs = 1 : 1 : zipPlus fs (tail fs)

Sequence:

Note that fs_n+1 is defined to be

fs_n+1 = 1 : 1 : zipPlus fs_n (tail fs_n)

It can make multiple calls to fs_n. The quota is on depth, not global count.

fs₀	= ⊥
fs₁	= 1 : 1 : zipPlus ⊥ (tail ⊥)
	= 1 : 1 : ⊥
fs₂	= 1 : 1 : zipPlus (1 : 1 : ⊥) (1 : ⊥)
	= 1 : 1 : 2 : ⊥
fs₃	= 1 : 1 : zipPlus (1 : 1 : 2 : ⊥) (1 : 2 : ⊥)
	= 1 : 1 : 2 : 3 : ⊥
fs₄	= 1 : 1 : zipPlus (1 : 1 : 2 : 3 : ⊥) (1 : 2 : 3 : ⊥)
	= 1 : 1 : 2 : 3 : 5 : ⊥

The limit is the infinite fibonacci list.

Unproductive Recursion

Program:

x = 5 * x

Sequence:

x₀	= ⊥
x₁	= 5 * x₀	= ⊥
x₂	= 5 * x₁	= ⊥

Stuck at ⊥, so converges to ⊥. This is why x = 5 * x does not “solve” the equation the way you expect!

Unparadoxical Recursion

Program:

b && c = case b of False -> False
                   True  -> c
x = False && x

Sequence:

x₀	= ⊥
x₁	= False && ⊥	= False
x₂	= False && False	= False

Converges to False.

Function Example

Program:

f :: Integer -> Integer
f x = if x==0 then 0 else 1 + f (x - 2)

Sequence:

Reminder: The approximation method translates to:

f₀ x	= ⊥
f_n+1 x	= if x==0 then 0 else 1 + f_n (x - 2)

In this kind of work, it is best to work out each f_n for all inputs, express as a formula or a table, before tackling f_n+1.

Table:

x	⊥	…	-2	-1	0	1	2	3	4	…
f₀ x	⊥	…	⊥	⊥	⊥	⊥	⊥	⊥	⊥	…
f₁ x	⊥	…	⊥	⊥	0	⊥	⊥	⊥	⊥	…
f₂ x	⊥	…	⊥	⊥	0	⊥	1	⊥	⊥	…
f₃ x	⊥	…	⊥	⊥	0	⊥	1	⊥	2	…

Brief illustration of how to obtain some of those entries:

f₀ x	= ⊥	regardless of x

f₁ ⊥	= if ⊥==0 …
	= ⊥
f₁ 0	= 0
f₁ x	= 1 + f₀ (x-2)	for other x’s
	= 1 + ⊥
	= ⊥

f₂ ⊥	= ⊥
f₂ 0	= 0
f₂ 2	= 1 + f₁ 0
	= 1
f₂ x	= 1 + f₁ (x-2)	for other x’s
	= 1 + ⊥
	= ⊥

Formula:

f_k x =	{⊥	if x=⊥
	{x÷2	if 0 ≤ x < 2k and x is even
	{⊥	otherwise

Remark: Since this is doing math, any common math notation or description that gets your point across is OK. So my “if x=⊥” uses math equal, not computer ==, for example 0=⊥ is well-defined and is false (whereas 0==⊥ is undefined, ⊥).

Extrapolating, the limit is

f x =	{⊥	if x=⊥
	{x÷2	if 0 ≤ x and x is even
	{⊥	otherwise

Theory (Not on Test/Exam)

Partial Order

The set corresponding to a type is given a partial order. The notation is usually ⊑. I pronounce it “under”. (And confusingly overload it for all types/sets.)

Partial order means:

reflexive: x ⊑ x
transitive: if x ⊑ y and y ⊑ z, then x ⊑ z
antisymmetric: if x ⊑ y and y ⊑ x, then x = y

Difference from total order: Two values may be incomparable, may have both x ⋢ y and y ⋢ x.

Interpret x ⊑ y as: either they have the same information, or y has more information—a compatible improvement upon x. Hence people say “information order”.

Idea: foo_n+1 can give more information than foo_n because foo_n+1 works harder:

⊥ is interpreted as completely unknown, so ⊥ ⊑ everyone.
A program may output ⊥ : ⊥ under a low quota, and 0 : 1 : ⊥ under a higher quota. We will want

⊥ : ⊥ ⊑ 0 : (1 : ⊥)
But we want both 0 ⋢ 1 and 1 ⋢ 0 because: Suppose a program under a low quota outputs 0. It would be ridiculous if, under a higher quota, it changes its mind and outputs 1. And vice versa.

Information order for Integer: Smallest partial order that satisfies ⊥ ⊑ everyone.

Information order for an algebraic data type: Smallest partial order that satisfies:

⊥ ⊑ everyone
for a data constructor C with n fields: “component-wise order”: if x₁ ⊑ y₁ and … and x_n ⊑ y_n, then C x₁ … x_n ⊑ C y₁ … y_n
(this is needed for infinite lists/trees) if every finite approximation of x is ⊑ y, then x ⊑ y

(Rigorous construction for a recursive type is described in the next subsection.)

Information order for a function type X→Y: “pointwise order”: f ⊑ g iff ∀x. f x ⊑ g x.

Theorem: For every recursive program foo,

foo₀ ⊑ foo₁ ⊑ foo₂ ⊑ …

So the successive approximation method gives compatible improvements.

Recursive Algebraic Types

Construction of the set for a recursive type is complicated, but both finite and infinite values emerge naturally. There are two stages, and each value is represented by an infinite sequence.

With [Bool] as an example:

Stage 1: Define sequence S of sets by:

S₀ = {⊥}
S_n+1 = {⊥, []} ∪ {x:xs | x ∈ {⊥, False, True}, xs ∈ S_n}

Each S_n is also given the component-wise partial order (and ⊥ ⊑ everyone) for its information order.

You may recognize that their union gives Fin. But Fin is not enough. We will not use their union; instead we go out of our ways to construct infinite sequences by the following. Preview:

False:False:[] will be represented by the infinite sequence
⟨⊥, False:⊥, False:False:⊥, False:False:[], False:False:[], …⟩
which increases until False:False:[], then plateaus.

False:False:False:… (infinite list) will be represented by
⟨⊥, False:⊥, False:False:⊥, False:False:False:⊥, …⟩
which keeps increasing and never plateaus.

Stage 2:

For each n, define e_n: S_n → S_n+1 to be the inclusion function. (e for embedding.)

It is almost invertible except for extra elements in S_n+1 not in S_n; fortunately the extra elements have greatest approximations in S_n, e.g., False : False : ⊥ is in S₂, its greatest approximation in S₁ is False : ⊥.

Define p_n: S_n+1 → S_n, p_n(y) = greatest x s.t. e_n(x) ⊑ y. (p for projection.)

Important properties: e_n and p_n are monotonic; p_n(e_n(x)) = x; e_n(p_n(y)) ⊑ y.

I will need sequences in which the nth item comes from S_n. Let me write Πn·S_n for the set of all such sequences.

Then the set for [Bool] is represented by

{s ∈ Πn·S_n | ∀n· s_n = p_n(s_n+1)}

Its information order is the pointwise order.

Some non-examples because they violate ∀n· s_n = p_n(s_n+1):
⟨⊥, False:⊥, False:⊥, False:[], …⟩ because p₂(s₃) = False:[] ≠ s₂
⟨⊥, ⊥:⊥, False:⊥, …⟩ because p₁(s₂) = False:⊥ ≠ s₁

The point is that a valid sequence can only do one of two things: strictly increase forever; strictly increase, then plateau and never increase again.

Examples of plateauing (they represent finite values):

⟨⊥ and stays there⟩
⟨⊥, ⊥:⊥, ⊥:False:⊥ and stays there⟩
⟨⊥, ⊥:⊥, ⊥:False:⊥, ⊥:False:[] and stays there⟩

We rename them to their plateaued values, respectively: ⊥, ⊥:False:⊥ and ⊥:False:[]. They become our Fin.

Examples of strictly increasing forever (they represent infinite values):

⟨⊥, ⊥:⊥, ⊥:⊥:⊥, ⊥:⊥:⊥:⊥, keeps growing⟩
⟨⊥, False:⊥, False:False:⊥, False:False:False:⊥, keeps growing⟩

We rename them to a convenient infinite list notation, respectively ⊥:⊥:… and False:False:…. They become our Inf.

Thus, although the construction is complicated, we get both finite and infinite values completely in one swoop.

Limits of Sequences

Definition: An ascending sequence is a sequence foo₀, foo₁, foo₂,… that also satisfies

foo₀ ⊑ foo₁ ⊑ foo₂ ⊑ …

Each set is required to have enough additional elements so that every ascending sequence has a limit. Precisely,

If foo₀ ⊑ foo₁ ⊑ foo₂ ⊑ …, then it has a least upper bound.

(It is easy to prove that least upper bounds are unique.)

Notation (with dummy variable k): lim_k foo_k

This is why [Bool] needs infinite lists as additional elements, for example the sequence for zs needs one. Similarly for other recursive algebraic data types.

Continuous Functions

The set for a function type X -> Y does not actually have all functions in (the set for X) → (the set for Y) allowed by set theory. We restrict it to continuous (limit-preserving) functions for the theory here to work.

h is continuous iff: for every ascending sequence s, h (lim_k s_k) = lim_k (h s_k)

Theorem: All programmable functions are continuous.

Theorem: All continuous functions are order-preserving (if x ⊑ y, then f x ⊑ f y).

Limit = Least Fixed Point

Definition: x is a fixed point of function h iff x = h x.

I.e., a solution of the equation x = h x.

There may be multiple solutions (or none), but we are interested in:

Definition: x is a least fixed point of h iff: x = h x, and x ⊑ every fixed point of h.

(It is easy to prove that least fixed points are unique.)

Example: x = 0 * x has two solutions, ⊥ and 0, but ⊥ is the least because ⊥ ⊑ 0.

Relevance: Recursive programs are fixed points. Examples:

-- zs = 0 : zs
h0 r = 0 : r
zs = h0 zs

-- ns = 0 : map (\x -> x + 1) ns
h1 r = 0 : map (\x -> x + 1) r
ns = h1 ns

-- f = \x -> if x==0 then 0 else 1 + f (x - 2)
hf r = \x -> if x==0 then 0 else 1 + r (x - 2)
f = hf f

Moreover, the computer can at most compute least fixed points, this is why x = 0 * x can only give you ⊥.

The following theorem backs the successive approximation method.

Theorem: If h is continuous, then lim_k s_k is a least fixed point, where

s₀	= ⊥
s_n+1	= h s_n

Extra Theory: Lies

I used simpler (but possibly weaker) definitions than the standard ones. In the standard definitions:

Each set is required to have least upper bounds for not just ascending chains, but also non-empty totally-ordered subsets.

Accordingly, continuous functions preserve least upper bounds of non-empty totally-ordered subsets.

I still have to find out how much the difference matters.

50 Shades of Undefinedness

Basic Types

Algebraic Data Types

Recursive Algebraic Data Types

Function Types

Calculating with Undefinedness

Successive Approximations and Taking Limit

Very Basic Infinite List

1, 2, 3,…

Fibonacci Again

Unproductive Recursion

Unparadoxical Recursion

Function Example

Theory (Not on Test/Exam)

Partial Order

Recursive Algebraic Types

Limits of Sequences

Continuous Functions

Limit = Least Fixed Point

Extra Theory: Lies

Further Reading