ExposuresProvider / cam-pipeline

Data loading pipeline for CAM database
https://exposuresprovider.github.io/cam-pipeline/
MIT License
2 stars 4 forks source link

Group results by provenances #110

Open gaurav opened 9 months ago

gaurav commented 9 months ago

We currently produce lots of different edges with the same provenance (e.g. 3000 different edges saying "X is found in humans [source]"). We should modify the kg_edges Souffle script so that it groups edges that have the same provenance. This will also make our outputs much smaller and will make downstream processing by Evan's tools better.