JiekaiLab / scTE

MIT License
87 stars 27 forks source link

Wired Gene name in output h5ad file #67

Open XiaoyuZhan520 opened 11 months ago

XiaoyuZhan520 commented 11 months ago

Hello,

scTE was employed to perform the analysis with the following command.

scTE_build -g hg38
scTE -i input.bam -o out -x hg38.exclusive.idx -CB CB -UMI UB --hdf5 True -p 1

However, I find there are a large number of wired gene names (~13,000) in my output such as
(A)n (AAA)n (AAAAAAC)n (AAAAAAG)n (AAAAAAT)n (AAAAAC)n (AAAAACA)n

I have no idea where these genes come from. Could you give me any suggestions? Many thanks in advance!

jphe commented 11 months ago

it is the simple repeat, you can delete those features in the down-stream analysis if you don't need those information.