macks22 / dblp

Parse the dblp data into a structured format for experimentation.
MIT License
74 stars 22 forks source link

Repdocs Module Documentation #21

Open psombe opened 6 years ago

psombe commented 6 years ago

Could you please add descriptions for each file in the repdocs module. I'm trying to use this parser for my projects and am unclear what all the files contain and how they relate to each other. For example, the dictionary created using gensim.corpora has a different number of documents than the tfidf matrix created.