invinst / invisible-flow

9 stars 2 forks source link

[#84][Kyle] Improve loading strategy #91

Closed KyleDDolezal closed 4 years ago

KyleDDolezal commented 4 years ago

Increase efficiency of database loading

Context The COPA data being scraped contains a significant number of rows, and existing rows within our database must be checked for duplicates every time a scrape occurs. As the number of rows in the database grows, the efficiency of the loading strategy will be significant.

Problem The current design of the loader has not taken into account the efficiency of the loader.

Solution Use benchmark tests to compare solutions. See commits 3fc1106 and fba8524.