nextstrain / fauna

RethinkDB database to support real-time virus analysis
GNU Affero General Public License v3.0
33 stars 13 forks source link

Allow filtering of tdb downloads #47

Closed trvrb closed 7 years ago

trvrb commented 7 years ago

In vdb, we have a select command line argument:

https://github.com/nextstrain/fauna/tree/master/vdb#commands-1

that subsets download to just specific fields at a certain values eg. --select field1:value1 field2:value1,value2. We definitely want the ability to filter tdb downloads to

  1. Subset on data source:
    • just CDC titers
  2. Subset on assay type:
    • just HI assays
    • just FRA assays

Ie. something like: python tdb/download.py -db tdb -v flu --subtype h3n2 --select assay_type:HI source:CDC. Make it generic, rather than specifically tailored to source and assay_type.