Closed fransilvionGenomica closed 2 months ago
I'm glad you like our tool! To estimate the mutation effect using the Puffin model, you can reconstruct the wild-type and mutant genomic sequences from a VCF file and compare Puffin's predictions for both sequences. This will allow you to observe how the mutation changes the transcription initiation signal. For example, you can calculate the difference in transcription initiation signal within a specific window in TSS region.
I hope this is helpful!
@KseniiaDundyk Thank you. When I use "puffin_D.py sequence fasta_file" command and I have sequences where variants are located on the forward strand, do I need to provide forward sequences to the model and collect only first 5 output channels? And vice versa, if variants are on the reverse strand, should I give the model reverse complement sequences and collect last 5 output channels? Or is the model strand-agnostic, and I can just always give it forward strand sequences?
Either way should work; you can always input forward strand sequence and use the first five output channels when the gene of interest is located on the ‘+’ strand and the last five output channels when the gene is located on the ‘-’ strand. Or you can give the model the reverse complement sequence, when the gene is located on the ‘-’ strand, but in this case, you will need to use the first five channels.
Hello,
Thank you for an amazing tool! In your README you are saying Puffin can be used for "Predict the effect of mutations on transcription initiation and study its mechanisms." If I have a VCF file with variants in TSS regions, what is the best way to use Puffin to check their effect on transcription initiation?