MusicConnectionMachine / UnstructuredData

In this project we will be scanning unstructured online resources such as the common crawl data set
GNU General Public License v3.0
3 stars 1 forks source link

Timeout during download kills process #223

Open felixschorer opened 7 years ago

felixschorer commented 7 years ago

When a timeout occurs during the download, the process dies unexpectedly. Some uncatched error kills the process, probably originating from zlib or the warc parser, as a consequence of the timeout.