saltudelft / type4py

Type4Py: Deep Similarity Learning-Based Type Inference for Python
Apache License 2.0
61 stars 13 forks source link

Cannot preprocess ManyTypes4Py dataset #15

Closed p-fruck closed 1 year ago

p-fruck commented 1 year ago

Hey there,

I am currently trying to getting this project up and running and was following the instructions to train the model using the ManyTypes4Py dataset. Unfortunately, the preprocess command just skips the dataset (or rather, does not find any relevant information). I solved this issue by removing the files all_fns.csv and all_vars.csv and symlinking processed_projects_complete to processed_projects.

Did I miss anything during the setup? Are those steps expected and should be added to the documentation?

mir-am commented 1 year ago

Hi @p-fruck,

Thanks for using Type4Py. Removing the mentioned files and renaming processed_projects_complete to processed_projects solve the issue. I need to push a hotfix for file name differences.

mir-am commented 1 year ago

The issue is now fixed in 47b5397.