Closed dustine32 closed 6 months ago
From @dustine32, prototype at : https://go-public.s3.amazonaws.com/pathway-like_go-cams_2024-05-07.tar.gz
@kltm I have a draft repo with a Makefile (and README!) ready to go at https://github.com/geneontology/sparql-for-pathway-go-cams. Let me know if you want me to change this at all to fit in the GO pipeline better.
@dustine32 Cheers! If there is no deus ex machina that saves the qc side of things, I'll attempt to switch those (after testing locally).
Context:
sjcarbon@moiraine:~/local/src/git/sparql-for-pathway-go-cams[main]$:) ls -AlFrt
total 39247844
drwxr-xr-x 2 sjcarbon sjcarbon 4096 May 9 17:56 sparql/
-rw-r--r-- 1 sjcarbon sjcarbon 845 May 9 17:56 README.md
-rw-r--r-- 1 sjcarbon sjcarbon 7 May 9 17:56 .gitignore
drwxr-xr-x 2 sjcarbon sjcarbon 4096 May 10 16:53 scripts/
-rw-r--r-- 1 sjcarbon sjcarbon 2627 May 10 16:53 Makefile
drwxr-xr-x 8 sjcarbon sjcarbon 4096 May 10 16:53 .git/
-rw-r--r-- 1 sjcarbon sjcarbon 40189755392 May 10 17:10 blazegraph-production.jnl
Running:
sjcarbon@moiraine:~/local/src/git/sparql-for-pathway-go-cams[main]$:) NOCTUA_MODELS_PATH=/home/sjcarbon/local/src/git/noctua-models make target/pathway-like_go-cams.tar.gz
Final product:
target/pathway-like_go-cams.tar.gz
@dustine32 Noting test inclusion in second stage snapshot: http://skyhook.berkeleybop.org/snapshot/products/ttl/pathway-like_go-cams.tar.gz This look right to you?
Noting: ~17m of runtime.
This look right to you?
@kltm Yup! (mostly) A minor thing is it untars retaining the target/pathway_like_go_cams/
directory structure instead of just pathway_like_go_cams/
. I’ll see if this happens on my local end but I’m fine with it if you are!
@dustine32 Hm, I don't think I'm doing anything different than the make
above (https://github.com/geneontology/pipeline/issues/376#issuecomment-2105407963). Does this happen when you're running locally?
@kltm Dang, it is on my end (in the Makefile
)! I'll fix this quick.
Commit https://github.com/geneontology/sparql-for-pathway-go-cams/commit/83f3e3cc7583295adca20c54e6a739c54379602c should fix the target/
issue. Sorry, I guess I never tested extracting the tar.gz
product.
No worries--I'll give it another run now.
@dustine32 Run through. Howzit look now?
Cool! It untars as intended now:
$ tar -zxvf pathway-like_go-cams.tar.gz
x pathway_like_go_cams/
x pathway_like_go_cams/641ce4dc00000214.ttl
x pathway_like_go_cams/65bc474400000788.ttl
x pathway_like_go_cams/63f809ec00000347.ttl
x ...
Thanks @kltm!
@pgaudet This will go out with the snapshots
starting next week. If you can think of a better place than
/products/ttl/pathway-like_go-cams.tar.gz
let me know and we can move it.
For each GO release, run SPARQL queries against
blazegraph-production.jnl
using blazegraph-runner to find "pathway-like" GO-CAM models and save their TTL files as requested by @thomaspd.The "pathway-like" criteria differs a bit from the usual GO-CAM website "get all causal models" query:
has_output
andhas_input
edges.I have a start on these queries here (can transfer repo to GO org or individual queries to appropriate GO repo).