Open taylorreiter opened 4 years ago
is this for the documentation, or for a paper, or "just" for understanding? I'm happy to help take it in any of those directions :)
I read through it ✅
I went to start writing up what charcoal does, and realized that I didn't know exactly how it was doing what it does. So I read the code and wrote down what each step does. I don't think this necessary belongs in the documentation, but it did help me understand how charcoal works and so I thought it could be helpful to put it somewhere where others can see it, in case they don't want to take the time to read the code.
Got it! Random thoughts --
Still needs work, esp around talking about what sourmash is actually doing. But first pass!
Charcoal identifies and removes contamination in metagenome-assembled genomes using k-mer based methods.
f_ident
); e.g. the number of hashes from the entire genome that matched to any genome in the database.f_major
).