CRC-FONDA / A2-metagenome-snakemake

Collection of metagenomic read mapping workflows that have been tailored for different computational architectures.
0 stars 0 forks source link

Bovine data is protein sequence (MG-R) #2

Open eaasna opened 3 years ago

eaasna commented 3 years ago

The bovine data has protein sequences but DREAM-Yara only maps DNA.

The bovine data set is unique and useful because the reference set is very small (200MB) compared to other real datasets.

eaasna commented 3 years ago

Lambda is specifically tailored for protein sequences.

eaasna commented 3 years ago

Need to finish implementing my sliding window filter for protein sequences and combine it with lambda. The unfinished protein sequence workflow lives in a separate branch in the MG-R folder.