How to understand this formula in the Lingpipe language model?

Question

How to understand this formula in the Lingpipe language model?

This is from the Lingpipe doc tutorial on building a language model. But I only partially understand the theory underlying it.

I especially do not know the basic probability.

enter image description here

Here's how to get the base p (d). If below - part of the token and their frequency in the unigram file.

ab 20 aba 3 abd 2 abef 2 abkk 3

Under such a condition, what are lamda (), 1-lamda (), extcount, numExtentions and Base P (ab)? This is one question, but they are connected by a chain.

Thank you very much.

+1

java nlp

Warren May 29 '12 at 11:04

source share

No one has answered this question yet.

See similar questions:

0

Separation of English words into graphemes corresponding to different sounds

or similar:

3799

How do I read / convert an InputStream to a string in Java?

3324

How to generate random integers in a specific range in Java?

3073

How to efficiently iterate over each entry on a Java map?

2853

How to convert String to int in Java?

2284

How can I fix the 'android.os.NetworkOnMainThreadException'?

2240

How to create an executable dependency JAR using Maven?

2171

How to determine if an array contains a specific value in Java?

2108

How can I name one constructor from another in Java?

1915

How to declare and initialize an array in Java?

1818

How to get enum value from string value in Java?

How to understand this formula in the Lingpipe language model?

More articles: