ZealousGeneticist / cartogene

A Python program for taking chemicals and finding the genes they interact with and the mechanisms of interaction.
GNU General Public License v3.0
2 stars 0 forks source link
bioinformatics bioinformatics-tool genetics

cartogene

A Python program for taking chemicals and finding the genes they interact with and the mechanisms of interaction.

Installation Instructions

git clone https://github.com/ZealousGeneticist/cartogene.git

User Guide

Put the chemicals you are wanting to analyze in the input text file (tutorial example: ) as a MeSH® name, synonym, or accession ID (“MESH:…”), or by CAS RN. You may also limit your search to official names by using the “name:” prefix. Make sure they are return- or |-delimited!

Then you can run the program on the chemicals by running this command in the terminal (given you have python3 installed and pip installed):

python3 cartogene_standalone.py -i

Your final list should be in another text file called , unless you wish to name it something else, in which case you simply add -o to the above command and it will come out as .

If you are using a machine like a supercomputer where you do not have permissions to install packages to the python folder which are needed for this program, make sure to run this command after MANUALLY installing the required packages. A fix for this so you don't need to manually do that is coming soon for the less permissioned among your machines.

python3 cartogene_standalone.py -i -z


Quick Guide on Networks

Nodes: Think of nodes as individual points or entities. In a biological network, nodes could be genes or chemicals or proteins or all of the above.

Edges: Edges represent connections or relationships between nodes. In a biological network, an edge could represent an interaction between genes or proteins or chemicals or any of the above.

Networks: Networks are a collection of nodes and edges. When you have multiple chemicals/proteins/genes connected through interactions, you have a biological network.

Communities: Communities are groups of nodes that are more densely connected to each other than to nodes outside the group. In a biological network, a community could be a group of proteins who interact more with each other (say, to make an enzyme) than with proteins outside the group.

So, nodes are individual data points, edges are connections between them, networks are the overall structure, and communities are tightly-knit groups within that structure.


Advanced User Guide

Here are the optional commands that can be utilized for cartogene:

Extra: There should be maxium number of ~4000 chemicals that can be utilized as stated by the CTD Batch Query API.