ExposuresProvider / cam-pipeline

Data loading pipeline for CAM database
https://exposuresprovider.github.io/cam-pipeline/
MIT License
2 stars 4 forks source link

SPARQL reports are running out of memory with larger dataset #73

Closed balhoff closed 1 month ago

balhoff commented 2 years ago

The queries are failing after the addition of CTD data.

gaurav commented 8 months ago

I think this was fixed as part of https://github.com/ExposuresProvider/cam-pipeline/pull/108?

balhoff commented 8 months ago

The new pipeline doesn't run SPARQL reports. If we reenable them, we should just implement them inside the datalog script.

gaurav commented 1 month ago

As per Jim's previous comment, this shouldn't be an issue any longer, and we can get some high-level stats from the Automat CAM-KP metadata endpoint (https://automat.renci.org/cam-kp/metadata). Closing.