WGLab / PennCNV

Copy number vaiation detection from SNP arrays
http://penncnv.openbioinformatics.org
Other
89 stars 53 forks source link

custom pfb - duplicate values #43

Closed andrisvAria closed 5 years ago

andrisvAria commented 5 years ago

Hey,

I am trying to create my own pfb for Illumina GSA. I compiled the pfb using 500 samples. The output is quite big and I noticed that there are a lot of duplicates, with different baf values. How should I deal with the duplicates? Do they somehow affect the next step detect_cnv? Can I remove them?

image

kaichop commented 5 years ago

This should not occur. Are they the exact same SNP? Did you check that these duplicates are in the original input files (same location, but different SNP identifiers?) -Kai

On Tue, May 28, 2019 at 7:32 AM andri90 notifications@github.com wrote:

Hey,

I am trying to create my own pfb for Illumina GSA. I compiled the pfb using 500 samples. The output is quite big and I noticed that there are a lot of duplicates, with different baf values. How should I deal with the duplicates? Do they somehow affect the next step detect_cnv? Can I remove them?

[image: image] https://user-images.githubusercontent.com/51119653/58474950-dc001000-814c-11e9-95b3-ac34a20fd87b.png

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/WGLab/PennCNV/issues/43?email_source=notifications&email_token=ABNG3OF7MOCP47JKTHSWTQDPXUJ3ZA5CNFSM4HQCOS7KYY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4GWGQM7A, or mute the thread https://github.com/notifications/unsubscribe-auth/ABNG3OETI3KKIHQHNZDD4CDPXUJ3ZANCNFSM4HQCOS7A .

andrisvAria commented 5 years ago

I doublechecked my procedure and found out it was due to a problem with splitting the original Illumina report that contained the samples. I now have a pfb file with unique values. Thank you

-Andri

kaichop commented 5 years ago

Okay thank you for the update.