xingjianleng / DBGA

The repository for the genome sequence alignment research project
BSD 3-Clause "New" or "Revised" License
3 stars 1 forks source link

CRASSPhage data set #26

Open GavinHuttley opened 1 year ago

GavinHuttley commented 1 year ago

get whole genome data set for thousands of sequences, can we align them all?

Post a link to a tarball with all the data, metadata

Vini2 commented 1 year ago

Paper link: https://www.nature.com/articles/s41467-021-21350-w Data link: https://zenodo.org/record/4437596 I will go through the data and pull out the crAssphage genomes.

Vini2 commented 1 year ago

Found more datasets of Ebola, HCV, HIV-1 and SARS-Cov2. Data link: https://github.com/niemasd/ViralMSA-Paper/tree/master/data Paper link: https://academic.oup.com/bioinformatics/article/37/5/714/5894544