umccr / RNAsum

Pipeline for generating RNAseq-based cancer patient reports
https://umccr.github.io/RNAsum/
Other
7 stars 4 forks source link

Handle new PURPLE CNV file #174

Closed pdiakumis closed 1 month ago

pdiakumis commented 1 month ago

At some point the PURPLE purple.cnv.gene.tsv file had some changes made to its columns. We need to handle the newer structure:

  chromosome  start    end gene        minCopyNumber maxCopyNumber unused somaticRegions germlineHomDeletionRegions germlineHetToHomDeletionRegions transcriptId    transcriptVersion chromosomeBand minRegions minRegionStart minRegionEnd minRegionStartSupport minRegionEndSupport minRegionMethod minMinorAlleleCopyNumber
   <chr>       <int>  <int> <chr>               <dbl>         <dbl> <chr>           <dbl>                      <dbl>                           <dbl> <chr>           <chr>             <chr>               <dbl>          <int>        <int> <chr>                 <chr>               <chr>                              <dbl>
 1 chr1        11869  14409 DDX11L1              1.98          1.98 0                   1                          0                               0 ENST00000456328 2                 p36.33                  1              1      2652000 TELOMERE              NONE                BAF_WEIGHTED                           0
 1 chromosome                     
 2 start                          
 3 end                            
 4 gene                           
 5 minCopyNumber                  
 6 maxCopyNumber                  
 7 unused                         
 8 somaticRegions                 
 9 germlineHomDeletionRegions     
10 germlineHetToHomDeletionRegions
11 transcriptId                   
12 transcriptVersion              
13 chromosomeBand                 
14 minRegions                     
15 minRegionStart                 
16 minRegionEnd                   
17 minRegionStartSupport          
18 minRegionEndSupport            
19 minRegionMethod                
20 minMinorAlleleCopyNumber       
   chromosome  start    end gene        minCopyNumber maxCopyNumber somaticRegions transcriptId    isCanonical chromosomeBand minRegions minRegionStart minRegionEnd minRegionStartSupport minRegionEndSupport minRegionMethod minMinorAlleleCopyNumber depthWindowCount
   <chr>       <dbl>  <dbl> <chr>               <dbl>         <dbl>          <dbl> <chr>           <lgl>       <chr>               <dbl>          <dbl>        <dbl> <chr>                 <chr>               <chr>                              <dbl>            <dbl>
 1 chr1        11869  14409 DDX11L1              1.93          1.93              1 ENST00000450305 TRUE        p36.33                  1              1      1117830 TELOMERE              BND                 BAF_WEIGHTED                           0               78
 1 chromosome              
 2 start                   
 3 end                     
 4 gene                    
 5 minCopyNumber           
 6 maxCopyNumber           
 7 somaticRegions          
 8 transcriptId            
 9 isCanonical             
10 chromosomeBand          
11 minRegions              
12 minRegionStart          
13 minRegionEnd            
14 minRegionStartSupport   
15 minRegionEndSupport     
16 minRegionMethod         
17 minMinorAlleleCopyNumber
18 depthWindowCount