Closed mt1022 closed 2 weeks ago
—gene-names is no longer really a maintained option in kb-python because gene names are already readily available in the counts_unfiltered directory that can be readily mapped to the corresponding gene IDs.
Thanks. I found the gene names file in the counts_unfiltered directory:
pbmc_1k_v3/counts_unfiltered/cells_x_genes.genes.names.txt
However, the gene names in this file are irregular. Here are some lines from this file.
...
WBP1LP7
OR4F29
ENSG00000237094
CICP7
ENSG00000250575
ENSG00000278757.1
ENSG00000293331
ENSG00000235146
MTND1P23
MTND2P28
MTCO1P12
ENSG00000278791
MTCO2P12
MTATP8P1
...
As shown above, some genes have names but were not parsed by kb:
grep 'ENSG00000278757.1' ~/psite_test/gencode46/gencode.v46.annotation.gtf
# chr1 ENSEMBL gene 516376 516479 . - . gene_id "ENSG00000278757.1"; gene_type "snRNA"; gene_name "U6"; level 3;
# chr1 ENSEMBL transcript 516376 516479 . - . gene_id "ENSG00000278757.1"; transcript_id "ENST00000614007.1"; gene_type "snRNA"; gene_name "U6"; transcript_type "snRNA"; transcript_name "U6.90-201"; level 3; transcript_support_level "NA"; tag "basic"; tag "Ensembl_canonical";
# chr1 ENSEMBL exon 516376 516479 . - . gene_id "ENSG00000278757.1"; transcript_id "ENST00000614007.1"; gene_type "snRNA"; gene_name "U6"; transcript_type "snRNA"; transcript_name "U6.90-201"; exon_number 1; exon_id "ENSE00003746310.1"; level 3; transcript_support_level "NA"; tag "basic"; tag "Ensembl_canonical";
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days
Describe the issue I am running the latest version of kb_python to reproduce the example pipeline in the BIVI paper (which was using an older version of kb_python). I generated the loom successfully but the target_name was gene IDs, not gene names. I rerun
kb count
with the--gene-names
argument but failed at loom generation step.What is the exact command that was run?
Command output (with
--verbose
flag)versions