GoekeLab / m6anet

Detection of m6A from direct RNA-Seq data
https://m6anet.readthedocs.io/
MIT License
104 stars 19 forks source link

transcript_position column header from data.site_proba.csv output file #130

Closed gsukrit closed 2 months ago

gsukrit commented 1 year ago

Hi team,

I would like to have clarity on the column header _transcriptposition in the output file of _data.site_proba.csv__ generated after the command m6anet inference. Does it indicate the position of modified base from the start of that particular transcript ID or the coordinate on the genome where that modified base is predicted. If it is the position of the base from the start of that particular transcript, does it start from the mRNA coordinates (starting from the 5' UTR region) on the genome ?

Any idea on this concern further will be highly appreciated.

Thanks,

Sukriti

kristinrma commented 1 year ago

Hi @gsukrit, The former is correct- the coordinates are relative to the specific transcript not the genome. The transcript position indicates the position of the modified base relative to the start of the 5’ end of that particular transcript. Discussion #104 gives further details on mapping between transcript and genome coordinates if you are interested.