Open dufaultc opened 2 years ago
Addressed in string_db_pull.py
Source of data: string-db.org (protein links full) Where data is located: ./Corpuses after running Description of data: JSON format, each protein is linked to an associated protein along with relevant scoring/distances.
Example:
"9606.ENSP00000000233": {
"protein2": "9606.ENSP00000361716",
"neighborhood": "0",
"neighborhood_transferred": "0",
"fusion": "0",
"cooccurence": "0",
"homology": "0",
"coexpression": "0",
"coexpression_transferred": "62",
"experiments": "0",
"experiments_transferred": "212",
"database": "0",
"database_transferred": "0",
"textmining": "0",
"textmining_transferred": "85",
"combined_score": "264"
},
From string-db.org, will be used for constructing gene-gene edges of our graph.
Information Needed