RTXteam / RTX

Software repo for Team Expander Agent (Oregon State U., Institute for Systems Biology, and Penn State U.)
https://arax.ncats.io/
MIT License
34 stars 20 forks source link

KG2.9.2c Rollout #2274

Open sundareswarpullela opened 2 months ago

sundareswarpullela commented 2 months ago

THE BRANCH FOR THIS ROLLOUT IS: KG2.9.2c THE ARAX-DATABASES.RTX.AI DIRECTORY FOR THIS ROLLOUT IS: /home/rtxconfig/KG2.9.2

Prerequisites

ssh access

To complete this workflow, you will need ssh access to:

GitHub access
AWS access

You will need:

Slack workspaces

You will also need access to the following Slack workspaces:

Example ssh config for setting up login into arax.ncats.io:

Host arax.ncats.io
    User stephenr
    ProxyCommand ssh -i ~/.ssh/id_rsa_long -W %h:%p stephenr@35.87.194.254
    IdentityFile ~/.ssh/id_rsa_long
    Hostname 172.31.53.16

1. Build and load KG2c:

2. Rebuild downstream databases:

The following databases should be rebuilt and copies of them should be put in /home/rtxconfig/KG2.X.Y on arax-databases.rtx.ai. Please use this kind of naming format: mydatabase_v1.0_KG2.X.Y.sqlite.

NOTE: As databases are rebuilt, RTX/code/config_dbs.json will need to be updated to point to their new paths! Push these changes to the branch for this KG2 version, unless the rollout of this KG2 version has already occurred, in which case you should push to master (but first follow the steps described here).

3. Update the ARAX codebase:

All code changes should go in the branch for this KG2 version!

4. Pre-upload databases:

Before rolling out, we need to pre-upload the new databases (referenced in config_dbs.json) to arax.ncats.io and the ITRB SFTP server. These steps can be done well in advance of the rollout; it doesn't hurt anything to do them early.

5. Rollout new KG2c version to arax.ncats.io development endpoints

6. Final items/clean up:

7. Roll-out to ITRB TEST

8. Roll-out to ITRB PRODUCTION

sundareswarpullela commented 1 month ago

Test synonymizer has been built finally. Ran into some issues with sufficient splitting of the full-fledged KG2pre 2.9.2 files to generate the test build set. Needing to use 1,000,000 nodes and edges to get a sizable set of nodes and edges where subject and object nodes of edges, both are present in the nodes set.

amykglen commented 3 weeks ago

current status: plover pytest suite is fully passing for KG2.9.2c; NGD build is underway on ngdbuild.rtx.ai.

amykglen commented 3 weeks ago

ok, the NGD build is done and the database is uploaded and everything, so I think we can proceed with ARAX testing!

amykglen commented 3 weeks ago

so I've updated the Expand and Resultify pytests as needed and all are passing now in the KG2.9.2c branch (including --runslow tests). basically all that was required was changing usages of three predicates as follows:

treats --> treats_or_applied_or_studied_to_treat
predisposes --> predisposes_to_condition
ameliorates --> ameliorates_condition