Evaluation Order

Call-by-value

Most languages use call-by-value for evaluation order: To evaluate “f (foo, bar)”, evaluate foo and bar first (which one first depends on the language), then plug into f's body, evaluate the body.

Example: If there is a function defined as f(x, y) = x:

  f (3+4, div(4, 2))    eval a parameter, arithmetic
→ f (7, div(4, 2))      eval the other parameter, arithmetic
→ f (7, 2)              ready to plug in at last
→ 7

A problematic parameter can cause an error/exception even if it would be unused:

  f (3+4, div(1, 0))    eval a parameter, arithmetic
→ f (7, div(1, 0))      eval the other parameter, arithmetic
→ ☹

Lazy evaluation (a.k.a. call-by-need)

Let me mess with you first.

The standard library has a list function “take” that behaves like this:

take 3 [a,b,c,d,e] = [a,b,c]
take 3 [a,b] = [a,b]

In general the first n items of the input list, or as many as available. The implementation goes like this:

take 0 _ = []
take _ [] = []
take n (x:xs) = x : take (n-1) xs

Do the following terminate?

doITerminate = take 2 (from 0)
  where
    from n = n : from (n + 1)

doIEvenMakeSense = take 2 zs
  where
    zs = 0 : zs
    -- At low level this is one node pointing back to itself. O(1)-space.

in HaskellEval.hs

Lazy evaluation in Haskell (sketch):

To evaluate “f foo bar”: plug into f's body first, then evaluate the body.

If f's body mentions/uses the same parameter multiple times, there is only one shared copy.
If that runs you into pattern matching: evaluate parameter(s) just enough to decide whether it's a match or non-match. If match, plug into and evaluate body. If non-match, try the next pattern. (If run out of patterns, declare “undefined” aka “error”.)
To evaluate arithmetic operations e.g. “foo + bar”: call-by-value.

Very drastic example: The standard library has

const x y = x

Evaluation of const (3+4) (div 1 0):

  const (3+4) (div 1 0)    plug in
→ 3+4                      arithmetic
→ 7

(No error about dividing by zero.)

Evaluation of doITerminate (until the answer is fully printed—e.g, when the REPL needs to print the full result):

  take 2 (from 0)                      eval "from 0" for pattern matching
→ take 2 (0 : from (0 + 1))            match, plug in
→ 0 : take (2-1) (from (0 + 1))        eval "2-1" for pattern matching
→ 0 : take 1 (from (0 + 1))            eval "from (0 + 1)" for pattern matching
→ 0 : take 1 (n : from (n + 1))        match, plug in
  with n = 0 + 1
→ 0 : n : take (1-1) (from (n + 1))    eval element for printing
  with n = 0 + 1
→ 0 : 1 : take (1-1) (from (1 + 1))
(It is also OK to write
  0 : n : take (1-1) (from (n + 1))
  with n = 1
)
                                       eval "1-1" for pattern matching
→ 0 : 1 : take 0 (from (1 + 1))        match, plug in
→ 0 : 1 : []

Newton's method with lazy lists. Like in Hughes's «why FP matters». Approximately solve x³ - b = 0, i.e., cube root of b.

So f(x) = x³ - b, f'(x) = 3x²

x1 = x - f(x)/f'(x)
   = x - (x^3 - b)/(3x^2)
   = x - (x - b/x^2)/3
   = (2x + b/x^2)/3

The local function “next” below is responsible for computing x1 from x.

cubeRoot b = within 0.001 (iterate next b)
    -- From the standard library:
    -- iterate f z = z : iterate f (f z)
    --             = [z, f z, f (f z), f (f (f z)), ...]
  where
    next x = (2*x + b/x^2) / 3
    within eps (x : x1 : rest)
        | abs (x - x1) <= eps = x1
        | otherwise = within eps (x1 : rest)

in HaskellEval.hs

Equivalently, using the function composition operator (.)

cubeRoot = within 0.001 . iterate next

With this, you really have a pipeline like Unix pipelines.

Singly-linked list is a very space-consuming data structure (all languages). And if you ask for “the ith item” you're doing it wrong.

However, if you use list lazily in Haskell, it is an excellent control structure—a better for-loop than for-loops. Then list-processing functions become pipeline stages. If you do it carefully, it is even O(1)-space. If furthermore you're lucky (if the compiler can optimize your code), it can even fit entirely in registers without node allocation and GC overhead.

Thinking in high-level pipeline stages is both more sane and more efficient—with the right languages.

Some very notable list functions when you use lists lazily as for-loops, or when you think in terms of pipeline stages:

Producers: repeat, cycle, replicate, iterate, unfoldr, the [x..], [x..y] notation (backed by enumFrom, enumFromTo)

Transducers: map, filter, scanl, scanr, (foldr too, sometimes) take, drop, splitAt, takeWhile, dropWhile, span, break, partition, zip, zipWith, unzip

Consumers: foldr, foldl, foldl', length, sum, product, maximum, minimum, and, all, or, any

Sometimes you have to custom-make your own, of course.

And don't forget that list comprehension helps a lot too.

When lazy evaluation hurts

Define mySumV2 to sum a list using a helper function as a while-loop and an accumulator parameter:

mySumV2 xs = g 0 xs
  where
    g accum [] = accum
    g accum (x:xs) = g (accum + x) xs

in HaskellEval.hs

Evaluation of mySumV2 [1,2,3] (similarly foldl (+) 0 [1,2,3]):

  mySumV2 (1 : 2 : 3 : [])    plug in
→ g 0 (1 : 2 : 3 : [])        match, plug in
→ g (0 + 1) (2 : 3 : [])      match, plug in
→ g ((0 + 1) + 2) (3 : [])    match, plug in
→ g (((0 + 1) + 2) + 3) []    match, plug in
→ ((0 + 1) + 2) + 3           arithmetic at last
→ (1 + 2) + 3                 ditto
→ 3 + 3                       ditto
→ 6

This takes Ω(n) space for the postponed arithmetic.

Exercise: Go through the evaluation of mySumV1 [1,2,3] (defined below) or foldr (+) 0 [1,2,3] to see how it also takes Ω(n) space, though through a different journey.

mySumV1 [] = 0
mySumV1 (x:xs) = x + mySumV1 xs

in HaskellEval.hs

Summing a list does not need lazy evaluation for the accumulator. There is “seq” for killing lazy evaluation where you deem it unsuitable.

To evaluate “seq x y”: evaluate x to “weak head normal form”, then continue with y.

Weak head normal form (WHNF) means:

for built-in number types: until you have the number
for algebraic data types: until you have a data constructor
for functions: until you have a lambda

Naturally, “seq x y” is most meaningful when x is something that y will need.

Here is a summing algorithm using seq to kill accumulator laziness:

mySumV3 xs = g 0 xs
  where
    g accum [] = accum
    g accum (x:xs) = seq accum (g (accum + x) xs)
    -- Note: accum is something that "g (accum + x) xs" will need.

    -- Alternative:
    -- g accum (x:xs) = let a1 = accum + x
    --                  in seq a1 (g a1 xs)
    -- Note: a1=accum+x is something that "g a1 xs" will need.

in HaskellEval.hs

Data.List has foldl' for this as well:

foldl' f z [] = z
foldl' f z (x:xs) = seq z (foldl' f (f z x) xs)