naobservatory / mgs-pipeline

MIT License
4 stars 2 forks source link

dashboard: start breaking out per-bioproject counts #40

Closed jeffkaufman closed 7 months ago

jeffkaufman commented 7 months ago

As we add more and more data, the dashboard is getting slow to load. I want to fix this by showing only a subset of the data by default. To enable loading the data only for specific bioprojects, break comparison_sample_counts and human_virus_sample_counts out by bioproject.

Note that this PR doesn't include updating the dashboard to read these files; that's coming later.

Only the changes in dashboard/prepare-dashboard-data.py need manual review: the other changes are the data output.