A two-way classifier to characterize metagenomes based on short and long read technologies
1
stars
1
forks
source link
De novo metagenomic marker pipeline
Pipeline
- Human Infant Microbiome Dataset ("Babybiome")
- /src/query.py
- Based on magicBLAST, a new RNAseq BLAST mapper
- /src/coverager.py & /scripts/test_coverager.sh
- Generation of BAMs with magicBLAST mapping to long reads (direct streaming from SRA)
- Building a histogram of read coverage
- Thresholding for uniform deep and broad coverage of long reads with short reads (indicator contigs)
- Using chi-squared test to check for uniformity
- Generating probability of long read in short read set
- Gen. Classifier
- Separation by physiological features
- Male-Female
- Delivery mode
- Probability of gene co-occurrence?