GangLiLab / genekitr

🧬 Gene analysis toolkit based on R
https://www.genekitr.fun
GNU General Public License v3.0
53 stars 7 forks source link

Problem about transId #26

Closed zy-fang closed 10 months ago

zy-fang commented 10 months ago

Hi,

Thank you for your fantastic work and the great convenience you've brought to us.

However, recently, when I attempted to convert a column of IDs, despite setting both the 'keepNA' and 'unique' parameters to TRUE, I noticed that the returned data length doesn't match the input. What's even more peculiar is that when I re-enter the initially missed IDs into the function, the data is then output completely, although some may be None. The package version of genekitr is 1.2.5. Details are as mentioned above. I'm looking forward to your response, and once again, thank you for your awesome work.

image image

Best wishes,

Zhaoyu

reedliu commented 10 months ago

Thanks. Could you please send me an example data of your input IDs so I could test?

zy-fang commented 10 months ago

test.csv Thanks.

reedliu commented 10 months ago

Hi, I have checked the data you uploaded. The reason is that the test.csv includes some duplicated ids while transid only gave you the unique result. For example, there are four ids of "TCR".

image
zy-fang commented 10 months ago

Oh, I got it. Thank you!

zy-fang commented 10 months ago

I have another question. I try to convert OR4F5 to Ensembl ID and found that it can be mapped to two different Ensembl IDs. When I checked on https://www.ensembl.org/, I discovered that one of the Ensembl IDs corresponds to another gene symbol. Is this due to different version naming? image image image

reedliu commented 10 months ago

The gene naming framework can be somewhat complicated. Basically, you can understand that "OR4F5" has one official name, which is also "OR4F5". However, it can also be referred to as an alias for other genes, such as "OR4F17" (source: NCBI Gene). This is why Genekitr will also return the ID "ENSG00000176695", which corresponds to "OR4F17".

image