soskek / bookcorpus

Crawl BookCorpus
MIT License
812 stars 110 forks source link

Sort by author #15

Closed bakszero closed 5 years ago

bakszero commented 5 years ago

Is it possible to sort the downloaded files author-wise here? Thanks!

soskek commented 5 years ago

Thank you for using my code. It is easy.

https://github.com/soskek/bookcorpus/blob/a1413aee7969300d278cba3080695875bb511fbe/download_files.py#L72-L85

This is the part to define the path where the text file is saved. And, we can use the author information from data['author']. Please exploit it in your path!

bakszero commented 5 years ago

Thanks for the response!