deanmalmgren / textract

extract text from any document. no muss. no fuss.
http://textract.readthedocs.io
MIT License
3.89k stars 599 forks source link

unsafe for multiprocessing? #440

Open chapmanjacobd opened 2 years ago

chapmanjacobd commented 2 years ago

Describe the bug I don't think the library is safe for use with multiprocessing? Something is creating temporary pict001.png, pict001.wmf, and other similarly named files in the CWD. Sure it cleans up after itself but if two of the same program runs then they might overwrite the temp files (depending on how the dependency implemented temp file naming, but most likely, if the program is doing this, I would assume it is not going to safely run)

Expected behavior I'm not sure what dependency is doing it but whichever one is doing it should probably run in a temporary directory as the working directory.

Desktop (please complete the following information):