openproblems-bio / openproblems

Formalizing and benchmarking open problems in single-cell genomics
MIT License
314 stars 78 forks source link

add new dataloaders #792

Closed danielStrobl closed 1 year ago

danielStrobl commented 1 year ago

Submission type

Testing

Submission guidelines

PR review checklist

This PR will be evaluated on the basis of the following checks:

scottgigante-immunai commented 1 year ago

Not yet passing tests, converting back to draft

scottgigante-immunai commented 1 year ago

Tests passing: https://tower.nf/orgs/openproblems-bio/workspaces/openproblems-bio/watch/247vkt93iKumfu

LuckyMD commented 1 year ago

@danielStrobl why is it called "Lung (by batch)"? And still missing a note for the other data loader that batch variable already correctly named...

github-actions[bot] commented 1 year ago

Current build status

danielStrobl commented 1 year ago

That was from another dataloader that I copied, this is already changed though with the most recent commit, same for the comment for the batch column

danielStrobl commented 1 year ago

@scottgigante-immunai scprep can't download files >3 GB, I think this could be fixed by updating urllib3>=1.26.11, see here

scottgigante-immunai commented 1 year ago

Working on this now

scottgigante-immunai commented 1 year ago

@danielStrobl doesn't look like that worked?

danielStrobl commented 1 year ago

still the same error. do you think it makes sense to split up this PR and leave out the immune cell human mouse dataset for now?

scottgigante-immunai commented 1 year ago

Works for me

scottgigante-immunai commented 1 year ago

Tests at https://tower.nf/orgs/openproblems-bio/workspaces/openproblems-bio/watch/2NnhA8I7V2Xqbi