facebookresearch / cc_net

Tools to download and cleanup Common Crawl data
MIT License
932 stars 138 forks source link

Add info about prerequisites on Ubuntu #10

Open leogao2 opened 3 years ago

facebook-github-bot commented 3 years ago

Hi @leogao2!

Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file.

In order for us to review and merge your code, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

gwenzek commented 3 years ago

Hi Leo, sorry for the very late review. Could you explain (in this issue) why we need those dependencies: libboost-all-dev cmake libeigen3-dev liblzma-dev ? Is it because of getpy, or kenlm ?

Thanks.