Latent Dirichlet Allocation (LDA) . , ( ), "". , /.
If you run LDA in your paragraph collection, then by looking at the similarity of the vector of hidden topics, you can find out if these two paragraphs are related or not.
Of course, the baseline is to not use LDA and instead use the term frequency (supplemented by tf / idf) to measure similarity (vector space model).
source
share