repseqio / library-imgt

IMGT segment library converted to RepSeq.IO JSON format
12 stars 5 forks source link

Sequences dropped - is this normal? #9

Open fabio-t opened 4 years ago

fabio-t commented 4 years ago
Sequence dropped because contain wildcards: K02153|IGHV1S20*01|Mus musculus_A/J|F|V-REGION|1..251|251 nt|1| | || |251+67=318|partial in 5'| |
Sequence dropped because contain wildcards: K02154|IGHV1S21*01|Mus musculus_A/J|F|V-REGION|1..230|230 nt|1| | || |230+67=297|partial in 5'| |
Sequence dropped because contain wildcards: M34981|IGHV1S51*01|Mus musculus_A/J|P|V-REGION|261..554|294 nt|1| | || |294+24=318| | |
Downloading: http://www.imgt.org/genedb/GENElect?query=7.14+IGHD&species=Mus+musculus
Downloading: http://www.imgt.org/genedb/GENElect?query=7.14+IGHJ&species=Mus+musculus
Downloading: http://www.imgt.org/genedb/GENElect?query=7.14+IGKV&species=Mus+musculus
Sequence dropped because contain wildcards: M28134|IGKV1-117*02|Mus musculus_CE/J|F|V-REGION|787..1088|302 nt|1| | || |302+33=335| | |

I'm submitting this to your attention just for safety - I don't know if it's a mistake in the script that builds the IMGT database, or an error upstream. That is just an extract - there's quite a few more dropped sequences.