TheDataLeek / Python-LSA

Performing Latent Semantic Analysis with Python on large datasets.
Other
13 stars 7 forks source link

How to run the script ? #21

Open hangingman opened 4 years ago

hangingman commented 4 years ago

I found this project in StackOverflow post https://stackoverflow.com/a/5544583/2565527

But, I can't run the script; document_analysis.py. Because of lacking documents to prepare environments.

It seems this project need to have more documents. In my perspective, following explanation is required.

TheDataLeek commented 4 years ago

Hi! Thanks for the interest in the project, I honestly haven't touched this since college. Let me dig through and see if I can get some answers for you.

I will say that this project was pretty custom-tailored to a specific dataset/project, and as a result it's not a one-size-fits-all solution to your personal LSA problem. If you're looking for generic NLP resources I recommend gensim and nltk as more flexible libraries.

As for the original purpose of this project, it was to easily search through a list of recipes and return not-perfect matching recipes. For instance, typing in "soup" could return a recipe for "chicken noodle broth" even though soup doesn't appear anywhere in the recipe/title.

hangingman commented 4 years ago

Thank you for your explanation. I leave this issue for another people having the same wondering.