dashboard: start breaking out per-bioproject counts

As we add more and more data, the dashboard is getting slow to load. I want to fix this by showing only a subset of the data by default. To enable loading the data only for specific bioprojects, break comparison_sample_counts and human_virus_sample_counts out by bioproject.

Note that this PR doesn't include updating the dashboard to read these files; that's coming later.

Only the changes in dashboard/prepare-dashboard-data.py need manual review: the other changes are the data output.

naobservatory / mgs-pipeline

dashboard: start breaking out per-bioproject counts #40