Closed ZWael closed 1 year ago
Hi @ZWael
Please use version >= 2.13.3
of cBioPortalData
.
Best regards,
Marcel
Hi @LiNk-NY I test this with a more recent version of R and i have this message advising me manually download the data any idea why ?
Our testing shows that 'pog570_bcgsc_2020' is not currently building. Use 'downloadStudy()' to manually obtain the data. Proceed anyway? [y/n]:
package.version("cBioPortalData")
[1] "2.13.6"
data=cBioPortalData(api = cbio,
by = "hugoGeneSymbol",
genes=hugo_id,
studyId = "pog570_bcgsc_2020",
molecularProfileIds ="pog570_bcgsc_2020_rna_seq_mrna" )
Our testing shows that 'pog570_bcgsc_2020' is not currently building.
Use 'downloadStudy()' to manually obtain the data.
Proceed anyway? [y/n]:
y
Error in split.default(x = seq_len(nrow(x)), f = f, drop = drop, ...) :
group length is 0 but data length > 0
Hi @ZWael
Can you provide a reproducible example? Particularly the value of hugo_id
.
I have tried this with one of the gene panels and it works ok.
suppressPackageStartupMessages(library(cBioPortalData))
cbio <- cBioPortal()
cBioPortalData(
api = cbio,
genePanelId = "IMPACT341",
studyId = "pog570_bcgsc_2020",
molecularProfileIds ="pog570_bcgsc_2020_rna_seq_mrna"
)
#> A MultiAssayExperiment object of 1 listed
#> experiment with a user-defined name and respective class.
#> Containing an ExperimentList class object of length 1:
#> [1] pog570_bcgsc_2020_rna_seq_mrna: SummarizedExperiment with 341 rows and 570 columns
#> Functionality:
#> experiments() - obtain the ExperimentList instance
#> colData() - the primary/phenotype DataFrame
#> sampleMap() - the sample coordination DataFrame
#> `$`, `[`, `[[` - extract colData columns, subset, or experiment
#> *Format() - convert into a long or wide DataFrame
#> assays() - convert ExperimentList to a SimpleList of matrices
#> exportClass() - save data to flat files
Created on 2023-08-25 with reprex v2.0.2
Note. There may be some data that was not able to be imported in the metadata()
.
@LiNk-NY Thank you,
interesting,
I tested with same arguments as you, and i verified the version (2.13.6 installation with BiocManager).
for hugo_id
it is a vector with hugo gene names from the gencode.v27 (40563 names) but i tested with "ALB" also
(https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_27/gencode.v27.annotation.gtf.gz)
data=cBioPortalData(api = cbio,
#by = "hugoGeneSymbol",
genePanelId = "IMPACT341",
studyId = "pog570_bcgsc_2020",
molecularProfileIds ="pog570_bcgsc_2020_rna_seq_mrna" )
Our testing shows that 'pog570_bcgsc_2020' is not currently building.
Use 'downloadStudy()' to manually obtain the data.
Proceed anyway? [y/n]:
The message you see is normal for datasets that have not been fully built. We do testing and update a small dataset that gets queried at every cBioPortalData
call. You can either download it with downloadStudy
or proceed but you may not get all the data in the MultiAssayExperiment
, see the metadata()
. I have added a Considerations
section in the vignette.
Best,
Marcel
Hello @LiNk-NY and cBioPortalData team, I have an error retrieving gene expression data fro some studies (the clinical data was retrieved with no problem with the same studyId) the error message was Error in .check_study_id_building(exargs[["studyId"]], "api_build"): 'studyId', pog570_bcgsc_2020, not found. See 'getStudies()'.
you can find a reprex below
Created on 2023-08-21 with reprex v2.0.2