zanmer / Mesothelioma_evolution_deciphering_drugable_somatic_alterations

Analysis scripts and relative data used in the paper "Clonal Architecture in Mesothelioma is prognostic and shapes the Tumour Microenvironment" by Zhang et al.
3 stars 2 forks source link

Adding Copy Number changes in the REVOLVER input #3

Open sunandinisharma opened 2 years ago

sunandinisharma commented 2 years ago

Hi,

I wanted to ask if you could explain a little how were the Copy number aberrations added as an input to REVOLVER? It will be great if I can get some more clarification on a method to follow to assign CCF and cluster to copy number changes.

Thanks

zanmer commented 2 years ago

First, you should join the overlapped copy number segments among multiple samples. I used GenomicRanges R package or HTSeq python package to do this. And you will generate a matrix of copy number segments where the value means present or absent of the copy number aberrations. And then you should annotate each segments with cytoband name and known driver genes according to the chromosome coordination. SCNAs were manually assigned to CCF clusters based on p/a pattern of each segments in samples.

sunandinisharma commented 2 years ago

Hi Kevin, Thanks for a quick response. As you said, I have a matrix with SCNA: 17p, 1p, 6q with presence and absence of these aberrations across all patient samples. Now, I am just stuck with how to assign these SCNA to the CCF cluster? I ran pyclone sample by sample meaning one at a time since these samples are not series samples but rather diagnostic. Pyclone gave ccf clusters with different mutations for each .tsv that I ran through pyclone. I need a hint how will I assign the cluster and determine the ccf for SCNA?

I will really appreciate your help. This will be very helpful for the completion of my phd deader station project.

Thanks, Sunandini Sharma

Get Outlook for iOShttps://aka.ms/o0ukef


From: Kevin Zhang @.> Sent: Monday, August 29, 2022 11:33:58 PM To: zanmer/Mesothelioma_evolution_deciphering_drugable_somatic_alterations @.> Cc: Sharma, Sunandini @.>; Author @.> Subject: Re: [zanmer/Mesothelioma_evolution_deciphering_drugable_somatic_alterations] Adding Copy Number changes in the REVOLVER input (Issue #3)

Non-UNMC email First, you should join the overlapped copy number segments among multiple samples. I used GenomicRanges R package or HTSeq python package to do this. And you will generate a matrix of copy number segments where the value means present or absent

First, you should join the overlapped copy number segments among multiple samples. I used GenomicRanges R package or HTSeq python package to do this. And you will generate a matrix of copy number segments where the value means present or absent of the copy number aberrations. And then you should annotate each segments with cytoband name and known driver genes according to the chromosome coordination. SCNAs were manually assigned to CCF clusters based on p/a pattern of each segments in samples.

— Reply to this email directly, view it on GitHub [github.com]https://urldefense.com/v3/__https://github.com/zanmer/Mesothelioma_evolution_deciphering_drugable_somatic_alterations/issues/3*issuecomment-1231128805__;Iw!!JkUDQA!N6kpjY9BL5KNg7TbATiqRwp6757E3vC6vhBy4CVnkZfzKZ-JVnCN1VWPDtKKmrScdvwaB7vrUaHTg1rlvG4gSGYnzDICVg$, or unsubscribe [github.com]https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ANCRZG6QNVKSTTUV6AGEHKLV3WFLNANCNFSM577RL7YA__;!!JkUDQA!N6kpjY9BL5KNg7TbATiqRwp6757E3vC6vhBy4CVnkZfzKZ-JVnCN1VWPDtKKmrScdvwaB7vrUaHTg1rlvG4gSGaQMDGaSw$. You are receiving this because you authored the thread.Message ID: @.***>

The information in this e-mail may be privileged and confidential, intended only for the use of the addressee(s) above. Any unauthorized use or disclosure of this information is prohibited. If you have received this e-mail by mistake, please delete it and immediately contact the sender.

zanmer commented 2 years ago

You should run pyclone patient by patient, not sample by sample, means multiple samples from one patient are incorporated together to do pyclone analysis. And you should do "force calling" procedure, usually using bam-readcount, to obtain the read counts supporting ref and alt alleles for the mutations not detected in individual sample. Also accurate ploidies and purities of tumor samples are necessary. When finish pyclone, you can construct a CCF matrix which row represent clusters and column for samples. cluster s1 s2 s3 s4 1 0.97 0.93 0.96 0.92 2 0 0.70 0 0 3 0.48 0 0 0

According to the pattern of CCF values, you can manually assign SCNA to the corresponding clusters.

sunandinisharma commented 2 years ago

Thank you so much Kevin. I ran Pyclone yesterday on the patient samples. This explanation given by you makes sense. Of note, the design of the experiment is such, I don't have multiple samples for a patient meaning this is not a multi region sequencing neither we have relapse biopsy. So, let's say Cluster 1 in patient 1 is not same as Cluster 1 in patient 2? Is that correct? Please let me know if I am thinking in the right direction or I can share the CCF matrix that I will construct from the results. Maybe it will make more sense talking about this.

You have been a great help. I sincerely appreciate it a lot.

Thanks and kindest regards, Sunandini

Get Outlook for iOShttps://aka.ms/o0ukef


From: Kevin Zhang @.> Sent: Wednesday, August 31, 2022 3:27:11 AM To: zanmer/Mesothelioma_evolution_deciphering_drugable_somatic_alterations @.> Cc: Sharma, Sunandini @.>; Author @.> Subject: Re: [zanmer/Mesothelioma_evolution_deciphering_drugable_somatic_alterations] Adding Copy Number changes in the REVOLVER input (Issue #3)

Non-UNMC email You should run pyclone patient by patient, not sample by sample, means multiple samples from one patient are incorporated together to do pyclone analysis. And you should do "force calling" procedure, usually using bam-readcount [github. com]

You should run pyclone patient by patient, not sample by sample, means multiple samples from one patient are incorporated together to do pyclone analysis. And you should do "force calling" procedure, usually using bam-readcount [github.com]https://urldefense.com/v3/__https://github.com/genome/bam-readcount__;!!JkUDQA!L2F0e3khbLgdS8YpCuZmiVHNMzvtx8V3ct_hPPuV0DPLJAc5L5eHUy2MqoMXinymRfMSbSxpuOws_PSS4j9EcEdMmCmrHw$, to obtain the read counts supporting ref and alt alleles for the mutations not detected in individual sample. Also accurate ploidies and purities of tumor samples are necessary. When finish pyclone, you can construct a CCF matrix which row represent clusters and column for samples. cluster s1 s2 s3 s4 1 0.97 0.93 0.96 0.92 2 0 0.70 0 0 3 0.48 0 0 0

According to the pattern of CCF values, you can manually assign SCNA to the corresponding clusters.

— Reply to this email directly, view it on GitHub [github.com]https://urldefense.com/v3/__https://github.com/zanmer/Mesothelioma_evolution_deciphering_drugable_somatic_alterations/issues/3*issuecomment-1232627193__;Iw!!JkUDQA!L2F0e3khbLgdS8YpCuZmiVHNMzvtx8V3ct_hPPuV0DPLJAc5L5eHUy2MqoMXinymRfMSbSxpuOws_PSS4j9EcEeOvuiyGA$, or unsubscribe [github.com]https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ANCRZG7CW7UO3L662FQBL7TV34JN7ANCNFSM577RL7YA__;!!JkUDQA!L2F0e3khbLgdS8YpCuZmiVHNMzvtx8V3ct_hPPuV0DPLJAc5L5eHUy2MqoMXinymRfMSbSxpuOws_PSS4j9EcEeq3v2i-g$. You are receiving this because you authored the thread.Message ID: @.***>

The information in this e-mail may be privileged and confidential, intended only for the use of the addressee(s) above. Any unauthorized use or disclosure of this information is prohibited. If you have received this e-mail by mistake, please delete it and immediately contact the sender.

zanmer commented 2 years ago

Yes, you are right. Clusters from different patients may share the same somatic events, but they are truly different tumor clones or cells. Pyclone clusters are only comparable among samples from the same patient . Besides, REVOVLER was designed to infer repeated evolutionary trajectories based on multi-region sequencing, not relapse datasets.