nextstrain / fauna

RethinkDB database to support real-time virus analysis
GNU Affero General Public License v3.0
33 stars 13 forks source link

adding syntax to check for cattle sequences and add cattle as a disti… #154

Closed lmoncla closed 7 months ago

lmoncla commented 7 months ago

Description of proposed changes

This is a small pull request to make cattle a separate host category during fauna upload. This was motivated by the recent cattle outbreaks of H5N1 in the US. Currently, host species are grouped into avian, human, nonhuman mammal, environment, and other. cattle will now be its own category.

The changes include adding a list of cattle-related species, and adding a small test to query entries with nonhuman mammal annotations to find those with strain names that indicate that they are derived from cattle.

Checklist

I tested this with the original dairy cattle sequences from Texas, which were annotated with strain names including dairy cattle, but with the host field annotated as Other Mammals. The changes worked as expected.