sr320 / paper-pano-go

Draft manuscript describing Panopea gonad transcriptome
2 stars 7 forks source link

Duplicates #18

Closed mdelrio1 closed 7 years ago

mdelrio1 commented 8 years ago

@sr320 Hi Steven. I was checking the file Panopea_Dheilly_gameto-matches.xlsx in order to find other interesting genes, and found 11 repeated contigs (f. i. comp142462_c0_seq1, which by the way is also similar to comp142462_c1_seq1= Sperm-associated antigen involved in sperm motility), which would leave 161, instead of 172 sequences, should I look for more repeated sequences or leave as the file you uploaded.

sr320 commented 8 years ago

Given how this was generating - blasting all Panopea contigs to Dheilly this would be expected. I think it is fine to leave in as these could be different isoforms, homologs. Related it would be worth determining how trinity labels contigs. comp versus c versus seq.

mdelrio1 commented 8 years ago

@sr320 Thanks I'll check trinity labelling.