Learning the conditional probability table
Naïve: Just observe a lot
of strings and set the
conditional probabilities
equal to observed
probabilities
But do we really
believe it if we get a
zero?
Better: add 1 to top and
number of symbols to
bottom. This is like having
a weak uniform prior over
the transition probabilities.