Open kmartinez834 opened 1 year ago
From #115: This is about the integration of species, disease, tissue etc data into our system. This task may become optional for 2.0 if we can not finish it.
Moved to 2.1 task list - will prioritize in May 2023
@ubhuiyan
I created the following files (same content, different file types) to deal with the line breaks in the txt file. You can use these to help with mapping/instructions for Robel (the script I wrote is at /software/glygen/carbbank-mapping.py):
generated/datasets/compiled/carbbank.csv
generated/datasets/compiled/carbbank.json
I would aim to get the first 2 bullet points done this release, and try to work on tissue/cell line/disease mapping over time:
/data/projects/glygen/generated/misc/carbbank_mapping.csv
, it includes all of the unique BS entries for the glygen organisms.
--> In some cases, you have to look at multiple subfields, for example (disease) cancer, (OT) leukemiaIt would be good to know what the workflow is. In the end we want to assign species, tissue etc to glycans in the file.
This task may become optional for 2.0 if we can not finish it