virusseq / portal-ui

Canadian VirusSeq Data Portal
https://virusseq-dataportal.ca/
GNU Affero General Public License v3.0
8 stars 8 forks source link

possible duplicates in arranger downloads #392

Closed scottcain closed 11 months ago

scottcain commented 1 year ago

From Chanchal:

When the metadata is downloaded for all the samples belonging to a Study ID (e.g. EH-NL), the downloaded file 
contains lots of duplicates, resulting in much higher sample# than what’s on the portal. Also, Nithu just noticed 
that when we try to download both the metadata and sequences for a specific Study ID the download includes
all the samples in the portal, not just for the Study ID selected.
nithujohn commented 1 year ago

Hi @scottcain , adding a comment I downloaded samples another study ID from Nova Scotia and found the same error with duplicate samples. Looks like it's not to one study ID but has the issue for all the samples.

scottcain commented 1 year ago

@nithujohn and Chanchal, I think it may be fixed in production; there was a problem with the software that powers searching and filtering but we just updated it with something that I think will fix it. Please let me know.

justincorrigible commented 1 year ago

This issue is still present in prod, and we're looking into it, expecting to have a fix before EOD.

The one resolved yesterday was for the study selection not filtering the downloads (i.e. selecting a study would still produce a bundle with all the records from all studies).

justincorrigible commented 1 year ago

Changes deployed to prod. The issue has been resolved as far as we can tell from our end, but we'll rely on your approval before closing this ticket.

leoraba commented 11 months ago

Closing ticket due to inactivity.