sebhtml / ray

Ray -- Parallel genome assemblies for parallel DNA sequencing
http://denovoassembler.sf.net
Other
65 stars 12 forks source link

Independent Kmer total in Taxons.xml #146

Open fredericraymond opened 11 years ago

fredericraymond commented 11 years ago

In addition to kmerOccurences, add kmerUnits or something like that in Taxons.xml.

fredericraymond commented 11 years ago

Also add the mean kmer depth of kmers or mode kmer depth.

fredericraymond commented 11 years ago

If it is possible, the total number of possible kmer within a taxon would be useful.

sebhtml commented 11 years ago

The current entry format:

Streptococcusgenus1418 Bacteriakingdom4018 Firmicutesphylum4045 Bacilliclass4659 Lactobacillalesorder55 Streptococcaceaefamily4364 Streptococcusgenus1418 501105710.8316420.831642 61125672 1.014451.014451

* numberOfKmers * modeKmerObservations * meanKmerObservations * maximumNumberOfKmers

* numberOfKmers * modeKmerObservations \ meanKmerObservations

sebhtml commented 11 years ago

I need to read the code to check what is sent and synchronize -- it may be necessary to change the code to synchronize distributions instead of raw counts. If it is the case, this will move out of the v2.2.0 milestone and into the next.

sebhtml commented 11 years ago

TaxonomyViewer/TaxonomyViewer.cpp

line 834

            m_taxonObservations[taxon]+=kmerCoverage; // case 2.

this will be replaced by

            m_taxonObservations[taxon][kmerCoverage] ++; // case 2.

Moving this to Ray-Future.

sebhtml commented 11 years ago

Evaluation: 30 human-hours