Knowledge-Graph-Hub / knowledge-graph-hub-support

Issues, support, and discussion for KG-Hub. Covers tools, infrastructure, and graph projects.
BSD 3-Clause "New" or "Revised" License
5 stars 2 forks source link

Distribute graph TSV enabling remote querying (e.g, with DuckDB) as per Monarch KG #20

Open caufieldjh opened 6 months ago

caufieldjh commented 6 months ago

From @cmungall on Slack:

I would be great if KG-Hub was distributed in the same way as the Monarch KG facilitating remote queries; e.g.


$ duckdb

select predicate, count(*) from 'https://data.monarchinitiative.org/monarch-kg/2024-03-18/monarch-kg-denormalized-edges.tsv.gz' group by predicate;

┌────────────────────────────────────────────────────┬──────────────┐ │ predicate │ count_star() │ │ varchar │ int64 │ ├────────────────────────────────────────────────────┼──────────────┤ │ biolink:orthologous_to │ 551239 │ │ biolink:related_to │ 179842 │ │ biolink:part_of │ 94177 │ │ biolink:actively_involved_in │ 799846 │ │ biolink:acts_upstream_of_or_within │ 181004 │ │ biolink:acts_upstream_of_negative_effect │ 174 │ │ biolink:expressed_in │ 2307867 │ │ biolink:located_in │ 504697 │ │ biolink:acts_upstream_of_or_within_negative_effect │ 181 │ │ biolink:treats_or_applied_or_studied_to_treat │ 5640 │ │ biolink:interacts_with │ 2913608 │ │ biolink:colocalizes_with │ 4051 │ │ biolink:acts_upstream_of_positive_effect │ 509 │ │ biolink:acts_upstream_of_or_within_positive_effect │ 507 │ │ biolink:gene_associated_with_condition │ 8004 │ │ biolink:has_phenotype │ 1171787 │ │ biolink:subclass_of │ 493276 │ │ biolink:acts_upstream_of │ 9343 │ │ biolink:has_mode_of_inheritance │ 8492 │ │ biolink:enables │ 845645 │ │ biolink:active_in │ 145898 │ │ biolink:contributes_to │ 7418 │ │ biolink:participates_in │ 271247 │ │ biolink:causes │ 6670 │ ├────────────────────────────────────────────────────┴──────────────┤ │ 24 rows 2 columns │ └───────────────────────────────────────────────────────────────────┘