|
计算机科学 2008
Topic-based Probabilistic Document Correlation Model
|
Abstract:
Existing models on document relationship analysis have a difficulty in learning document correlation from topic level.To overcome this difficulty,a topic-based probabilistic document correlation model(TPDC)was proposed.The model learns the topic structure of a document through the latent dirichlet allocation model,infers the posterior probability of a document by computing the posterior probability of its topics and topic similarity,and then constructs the document correlation model based on the document po...