nextstrain / augur

Pipeline components for real-time phylodynamic analysis
https://docs.nextstrain.org/projects/augur/
GNU Affero General Public License v3.0
268 stars 128 forks source link

merge: Import metadata files concurrently? #1617

Open tsibley opened 2 months ago

tsibley commented 2 months ago

General idea is to concurrently import metadata files into separate SQLite dbs, then ATTACH them to export merged metadata.

I whipped up a working prototype and pushed it to trs/merge/concurrent-import. On a trivial two file, three record input set, this alone cuts off 1.5s of wall clock time, because of the slow slow slow augur start up time being paid concurrently instead of serially for the two input files passed thru augur read-file.