Closed misialq closed 2 years ago
I encountered the same error while running two large get-metadata
fetches and they exited with the RuntimeError
at the 375th and 280th batch, respectively. So at around 90k run IDs no new threads could be started.
I tried a quick fix by simply deleting the efetcher object that launches the threads in each batch - it did not reduce the amount of threads that are launched.
When I'm trying to fetch metadata for ~40k run IDs, at some point during the fetch I get the following error:
As I ran this in a Jupyter notebook, that same cell was executed maybe twice (there was some network issue the first time) so to reproduce this it would probably be required to use a much higher number of IDs (for the same reason I cannot provide an exact instruction as to how to reproduce this behaviour...).
It would seem though that on every iteration (through retstart in EFetch) a new thread is being spawned and not cleaned up properly - if there are enough batches, the process reaches the maximum allowed count of threads (here it was 4096) and exits with the above-mentioned error.