Open zhongguodiyidao opened 1 year ago
The test.regions_vs_motifs.rankings.feather
file and motifs-v10nr_clust-nr2.tgi-m0.01-o0.0.tbl
you can use with pySCENIC:
https://pyscenic.readthedocs.io/en/latest/installation.html#docker-podman-and-singularity-apptainer-images
Although for pySCENIC is would be better to use gene-based databases (-g
option).
First of all, thank you for your reply! I'm so sorry,I've been quite confused lately. The species we are studying is the Brazilian red eared turtle,When using Python for scenic analysis, I prepared several files.Firstly, we used bedtools getfasta to create feature files for upstream and downstream 5kb and upstream 500bp, respectively. ,
Furthermore, as per your previous suggestion, we have modified the gene name of the mouse in the motiftoTF file to that of the turtle,
,
Finally, we replaced it https://resources.aertslab.org/cistarget/tf_lists/allTFs_mm.txt The gene name of the transfer factor. May I ask if these three files are sufficient? Looking forward to your reply, thank you again!
Yes, those should be sufficient.
Hi Ghuls, I don't think that these files are actually sufficient based on my experience trying to create the database for Xenopus tropicalis.
I have generated regions_vs_motifs.rankings.feather
To do this, I used
Where my .fasta was created by
I put gene names instead of coordinate positions because I am linking the genenames to GeneList in
Soo, when I upload my regions_vs_motifs.rankings.feather it looks like this
However, This is different than your sample data which looks like this
As you can see, your demo file has "features" with encode motif id, however, my feather file does NOT have the same information. No where in any documentation do you show the format of the fasta file required to generate such an output from another species, unless I have missed something.
When I further run my .feather file it looks like this:
So I get this weird warning, and then when I look at the output it looks like this:
BUT if I run using your demo data I get:
So my question is what do I need to add to my create_cistarget_motif_databases.py syntax to be able to actually link my .tbl database (with all lowercase Xenopus tropicalis genes) to my rankings.feather. I don't understand how to go from .regions_vs_motifs.rankings.feather to .genes_vs_motifs.ranking.feather.
Please help, I really really want to be able to use your awesome software on Xenopus single cell data.
Dear author,
Looking forward to your reply, thank you!