Simon-Coetzee / motifBreakR

A Package For Predicting The Disruptiveness Of Single Nucleotide Polymorphisms On Transcription Factor Binding Sites.
27 stars 12 forks source link

Strand information #6

Closed xtmgah closed 3 months ago

xtmgah commented 8 years ago

Hello:

I am running the motifBreakR and want to find the promoter mutation effect on transcription factor binding sites. i make two same bed file but with different strand ( "+" and "-"). The result are different. So, what the "Strand" means in the provided bed file? All the mutation i generated is on the reference strand (like in vcf files). So, which strand should i use? Thanks.

dennishazelett commented 8 years ago

If you change the strand you change the meaning of the bed file completely. For example if you indicate in your SNP label that the values are ":T:G", motifbreakR understands by default that there is a "T" in the reference genome at that position, and it should substitute a "G" to calculate SNP effects. BUT if you put "-" in the strand field of the bed file, motifbreakR will reverse complement the SNP to make it compatible with the reference genome. I recommend sticking to the + strand to avoid confusion. Does this answer your question?

xtmgah commented 8 years ago

Yes. Great. Thanks for your explanation.

xtmgah commented 8 years ago

Another question, Do you know which program or software can be used to extract the Motifs that cover at specific genome location, and also show the core motif information (for example, ETS as "TTCC") ?

Thanks.

dennishazelett commented 8 years ago

use MotifDb package; see query function