NBISweden / IgDiscover-legacy

Analyze antibody repertoires and discover new V genes from high-throughput sequencing reads
https://www.igdiscover.se
MIT License
17 stars 10 forks source link

What is CDR3_shared_ratio #98

Closed carollia closed 10 months ago

carollia commented 5 years ago

Hi,

I was going through the results of my assigned_V_pregermline.tab and assigned V_germline.tab and a number of candidates were filtered out because of a high CDR3_shared_ratio. I've tried to figure out what this means and how it's calculated but I cannot. Can you tell me what it means and how it is calculated?

Thanks, Hannah

MartinMatthewC commented 5 years ago

Hi Carollia,

The cdr3_shared ratio is meant as a type of indirect means of identifying chimeric sequences and of filtering them from the output. It is designed to identify unique CDR3 sequences from one rearrangement and then identifying the same CDR3 sequence in other allele - indicating that there may be some kind of chimerism event during library PCR.

In general chimeric events tend to occur at a low rate and are generally removed by the allelic ratio filter but the cdr3_shared ratio can be useful in some circumstances..

However, while this filter can identify and remove chimeric sequences we have noticed recently that it will also remove some sequences that are real germlines so to avoid this we tend to set a very high setting to minimize this problem (for example, set cdr3_shared_ratio: 0.99).

Martin


From: carollia notifications@github.com Sent: Thursday, May 30, 2019 12:32 AM To: NBISweden/IgDiscover Cc: Subscribed Subject: [NBISweden/IgDiscover] What is CDR3_shared_ratio (#98)

Hi,

I was going through the results of my assigned_V_pregermline.tab and assigned V_germline.tab and a number of candidates were filtered out because of a high CDR3_shared_ratio. I've tried to figure out what this means and how it's calculated but I cannot. Can you tell me what it means and how it is calculated?

Thanks, Hannah

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://github.com/NBISweden/IgDiscover/issues/98?email_source=notifications&email_token=ACCRULL77ZTHNQKGIYHYS2LPX4AALA5CNFSM4HQW5UP2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4GWTGRAQ, or mute the threadhttps://github.com/notifications/unsubscribe-auth/ACCRULO7J6ZJTYF6F2LJRPTPX4AALANCNFSM4HQW5UPQ.

När du skickar e-post till Karolinska Institutet (KI) innebär detta att KI kommer att behandla dina personuppgifter. Här finns information om hur KI behandlar personuppgifterhttps://ki.se/medarbetare/integritetsskyddspolicy.

Sending email to Karolinska Institutet (KI) will result in KI processing your personal data. You can read more about KI’s processing of personal data herehttps://ki.se/en/staff/data-protection-policy.

marcelm commented 10 months ago

This repository is outdated and is going to be archived. Please see the new repository at https://gitlab.com/gkhlab/igdiscover22/ or the homepage at https://www.igdiscover.se/ for the most recent and maintained IgDiscover version.