next up previous
Next: Unigram Algorithms Up: SEGMENTATION Previous: Probabilistic Language Models

Unigram Idea

Imagine that a sentence is produced by choosing a random word, or ``.'' from a particular distribution.

Continue generating random words until a period is chosen (probability 0.1).

Picture. Expected sentence length? Mostly likely sentence? Total probability over all sentences?



Guangwei Yuan
12/4/1999