New computation of author's topic

daniel-acuna / reviewer_assignment

Reviewer assignment and scoring system based on topic models

http:/pr.scienceofscience.org

5 stars 4 forks source link

New computation of author's topic #41

Open daniel-acuna opened 9 years ago

daniel-acuna commented 9 years ago

Transform each of the abstracts of the authors through the Tf-Idf -> truncated SVD and then average. Now, it puts all abstracts together in a big text and does the transformation.

titipata commented 9 years ago

Sure, I will add the function to train tfidf and Truncated SVD.

daniel-acuna commented 9 years ago

Awesome. In this case, we need to keep the pipeline around for when we compute an author's "mean topic vector". Before, we were creating an imaginary abstract with all abstracts concatenated. Now, we won't need which can potentially be faster.