fdac15 / OSSFinder

Open Source Software Recommendation Engine
1 stars 1 forks source link

just code without any explaination #4

Closed inthesunset closed 9 years ago

inthesunset commented 9 years ago

Firstly I downloads some readme.md files in folder Readme. Secondly I do some language parsing to clean data. Thirdly I use word2vec to train a model. Since our object is to use word2vec model to make readme.file searchable, it seems that word2vec doesn't provide a similarity calculation on documents. So, I decide to use the second approach proposed by Dr. Mockus.