Closed quiquemedina closed 6 months ago
Thanks, I will take a look
@lishensuo 关于 .opt_pancan
的设置、调用和更新你还需要优化下。
当前代码还有多处 cnv_gistic2 的记录,但你对应的 query_value
函数以及没有了对应的处理。
@quiquemedina Set data_type = "cnv"
should be fine. We only use the GISTIC2 data in the pan-can analysis (including previous versions). At default (the current version), the thresholded GISTIC2 results are used.
I will inspect and remove "cnv_gistic2" properly. In addition, it might be better to set use_thresholded_data=FALSE
as default which will be modified in next PR.
Got it.
Dear @ShixiangWang and @lishensuo,
I appreciate your clarification regarding the "cnv" feature now encompassing the "cnv_gistic2" data in the latest version of UCSCXenaShiny. However, I would like to emphasize a critical aspect of our research that necessitates distinguishing between these two data types.
In our priviosu analyses, we have observed that exploring gene correlations using "cnv" and "cnv_gistic2" separately in R yields distinct gene signatures, implying unique biological insights. This is because the underlying datasets for these two types are inherently different. Hence, maintaining them as separate options in the new version, as was the case in previous iterations, would be highly beneficial for detailed genomic analysis.
To illustrate, employing an algorithm to discover genes with high correlation cutoffs (e.g., rho > 0.4 or <-0.4) in UCS cancer type for mRNA expression vs stemness attributes, we identified:
Clearly, the gene signatures vary significantly between these data types.
In previous versions, the data_type
argument allowed for a range of gene profile types, including "mRNA", "transcript", "protein", "mutation", "cnv" (-2, -1, 0, 1, 2), "cnv_gistic2", "methylation", "miRNA". The ability to select either "cnv" or "cnv_gistic2" explicitly was invaluable for our analyses.
Therefore, I kindly request that you consider reinstating these as separate data_type options in the new version. This change would greatly enhance the tool's utility and accuracy for genomic research.
Furthermore, the radar ploting for "cnv" argument in the new version is faling: Erro: 'arg' should be one of “mRNA”, “transcript”, “protein”, “mutation”, “cnv”, “methylation”, “miRNA”, “fusion”, “promoter”, “APOBEC”
'''
Thank you for considering this request. Your support is crucial to the advancement of our research.
Best regards,
Enrique
Helping to make my case above, let me point to Pros and Cons of Using CNV vs. CNV GISTIC2:
CNV (Copy Number Variation) Pros:
Cons:
CNV GISTIC2 (Genomic Identification of Significant Targets in Cancer 2)
Pros:
Cons:
In summary, while CNV provides a broad perspective on genomic variations, CNV GISTIC2 offers a more targeted and statistically relevant approach, especially valuable in the context of cancer research. The choice between them should be guided by the specific research objectives, the disease of interest, and the level of detail required.
Thank you for your question. We have discussed you request. Please wait for one or two days for the optimization.
@quiquemedina Hi, thanks for your comments and insights :). Previously, I only included the gistic2 copy number data. Based on your suggestions, we would like to reunify the data options.
i.e., for the following datasets:
cnv_gistic
cnv_gistic (thresholded)
cnv
Hi, @lishensuo Please make sure the three options are available to the users (in all exported functions and Shiny UI).
Should we consider that option 2 be merged into option 1 in the internal code, and support option 2 with your designed .opt_pancan
? Find the easier way to implement the feature. Please discuss with me if you have any problems.
OK. I will pull the request ASAP.
From my perspective, I think it is better to show one choice for each molecular profile. Within one profile, we can provide further setting like DNA methylation which supports two arrays and personalized limitation.
Based on above 3 types of CNV data, I plan to use the thresholded gistic2 as default choice and provide two further settings
TRUE
);FALSE
)@lishensuo and @ShixiangWang,
Absolutely, I wholeheartedly agree with your approach! Focusing on one choice per molecular profile, with the added flexibility of further settings like the dual-array support in DNA methylation, is a strategic and user-friendly way to present these options. Your plan to use the thresholded GISTIC2 as the default for CNV data is particularly insightful. Providing users with the option to choose GISTIC2 data (defaulting to TRUE) and the additional choice to use thresholded data (defaulting to FALSE) when GISTIC2 is enabled adds a valuable layer of customization. This approach not only enhances the utility of your tool but also caters to diverse user needs in a comprehensive and efficient manner. Great development!
@quiquemedina Thanks. Let's wait for a new PR from shensuo, I will review and merge it to the master branch.
Thanks all.
Dear Support Team,
I am writing to address a concern regarding the recent update of UCSCXenaShiny (UCSCXenaShiny v2.0.0 based on UCSCXenaTools v1.4.8 ). It appears that the data-type "cnv_gistic2" is no longer accessible in R in this latest version. To illustrate, in previous versions of UCSCXenaShiny (e.g., v1.1.10 based on UCSCXenaTools v1.4.8), the following R snippet was operational without issues:
However, executing the same code in the latest version fails to plot resulting in an error message:
This suggests that "cnv_gistic2" as a data_type option is absent from the updated version.
I kindly request your assistance in investigating this issue. The availability of "cnv_gistic2" data-type is crucial for our analyses, and its absence significantly impacts our research workflow.
Thank you for your attention to this matter. I look forward to your prompt response.
Sincerely,
Enrique