The probability of generating a visible
sequence from an HMM
The same visible sequence can be produced by
many different hidden sequences
This is just like the fact that the same datapoint
could have been produced by many different
Gaussians when we are doing clustering.
But there are exponentially many possible
hidden sequences.
It seems hard to figure out