Natural language processing keywords, what is add-1 smoothing, what is Laplace smoothing, explain add-1 smoothing with an example, unigram and bi-gram with add-1 laplace smoothing

Add-1 (Laplace) smoothing

We have used Maximum Likelihood Estimation (MLE) for training the parameters of an N-gram model. The problem with MLE is that it assigns zero probability to unknown (unseen) words. This is because, MLE uses a training corpus. If the word in the test set is not available in the training set, then the count of that particular word is zero and it leads to zero probability.

To eliminate this zero probability, we can do smoothing. Smoothing is about taking some probability mass from the events seen in training and assigns it to unseen events. Add-1 smoothing (also called as Laplace smoothing) is a simple smoothing technique that Add 1 to the count of all n-grams in the training set before normalizing into probabilities.

Example:

Recall that the unigram and bi-gram probabilities for a word w are calculated as follows;

P(w) = C(w)/N

P(w_n|w_n-1) = C(w_n-1 w_n)/C(w_n-1)

Where, P(w) is the unigram probability, P(w_n-1 w_n) is the bigram probability, C(w) is the count of occurrence of w in the training set, C(w_n-1 w_n) is the count of bigram (w_n-1 w_n) in the training set, N is the total number of word tokens in the training set.

Add-1 smoothing for unigrams

P_Laplace(w) = (C(w)+1)/N+|V|

Here, N is the total number of tokens in the training set and |V| is the size of the vocabulary represents the unique set of words in the training set.

As we have added 1 to the numerator, we have to normalize that by adding the count of unique words with the denominator in order to normalize.

Add-1 smoothing for bigrams

P_Laplace(w_n|w_n-1) = (C(w_n-1 w_n)+1)/C(w_n-1)+|V|

*************************

Related articles:

Natural Language Processing - Home

NLP Multiple Choice Questions - Home

NLP True/False Questions - Home

NLP Glossary

TOPICS (Click to Navigate)

Saturday, October 17, 2020

Explain add-1 (Laplace) smoothing with an example

Natural language processing keywords, what is add-1 smoothing, what is Laplace smoothing, explain add-1 smoothing with an example, unigram and bi-gram with add-1 laplace smoothing

Add-1 (Laplace) smoothing

Explain add-1 smoothing

What is laplace smoothing

How to apply laplace smoothing in NLP for smoothing

Unigram and bigram probability calculations with add-1 smoothing

No comments:

Post a Comment

Featured Content

Multiple choice questions in Natural Language Processing Home

All time most popular contents

Report Abuse