KatyBrown / CIAlign

MIT License
117 stars 9 forks source link

More cleaning/filtration steps #43

Open fanninpm opened 2 years ago

fanninpm commented 2 years ago

I'm in the middle of making an IRMA module for Adenoviruses. I came across your repo today and thought it would be useful for that purpose (I'm definitely thinking of using it to generate consensus sequences.) The IRMA paper mentions a few filtration steps that I thought would be a natural fit (in the "Methods" section, in the "Datasets" sub-section, in the "Influenza alignment dataset" sub-sub-section, second paragraph). In particular, they mentioned:

KatyBrown commented 2 years ago

I'm sorry it's taken such a long time to reply! We will look at incorporating these features. All except the frameshift seems reasonably straightforward - I'll look into it and get back to you.