airr-community / ogrdb

Website and associated database for managing submissions of inferred alleles
Other
8 stars 1 forks source link

Identify duplicated sequences in genotype_statistics.R #45

Closed williamdlees closed 5 years ago

williamdlees commented 5 years ago

In the human library, for example: IGHV3-30-3¬03 has the same V sequence as IGHV3-30¬04 IGHV3-30-5¬01 has the same V sequence as IGHV3-30¬18 IGHV3-30-5¬02 has the same V sequence as IGHV3-30¬02

We wouldn't expect any duplicates after genotyping, because today's tools don't have methods that can distinguish them. Should warn if we see any.

williamdlees commented 5 years ago

A warning is now issued.