next up previous
Next: Unigram Idea Up: SEGMENTATION Previous: Problems with Path Approach

Probabilistic Language Models

A popular idea in computational linguistics is to create a probabilistic model of language. Such a model assigns a probability to every sentence in English in such a way that more likely sentences (in some sense) get higher probability. If you are unsure between two possible sentences, pick the higher probability one.

Comment: A ``perfect'' language model is only attainable with true intelligence. However, approximate language models are often easy to create and good enough for many applications.

Some models:

We'll use unigrams.



Guangwei Yuan
12/4/1999