Request to process only new PDFs

cutright / IMRT-QA-Data-Miner

Scans a directory for IMRT QA results

MIT License

13 stars 5 forks source link

Request to process only new PDFs #17

Open cutright opened 4 years ago

cutright commented 4 years ago

Feature request to ignore previously processed PDFs

cutright commented 4 years ago

main.process_files() in branch issue_17 has the feature to ignore previously processed files. Collecting all processed files is pretty fast, but it seems like the bottleneck is iterating through the OS directory, not parsing the data. Or perhaps the time is spent checking if a file name exists in the previously processed files.

Needs investigation.