InternetHealthReport / internet-yellow-pages

A knowledge graph for Internet resources
GNU General Public License v3.0
39 stars 16 forks source link

Issue #116 Include layer information to stanford.asdb AS categories #126

Closed JustinLoye closed 7 months ago

JustinLoye commented 7 months ago

Description

Added a layer property to links -[r:CATEGORIZED {reference_name: "stanford.asdb"}]- Added a PART_OF relationship (subcategory) -> (category) Removed Other and other subcategories

Motivation and Context

Issue 116

How Has This Been Tested?

Thorough check of my file parsing (noticed new errors).

Overall number of links -[r:CATEGORIZED {reference_name: "stanford.asdb"}]-: before 394 834, now roughly twice more 592 754

Pushed and observed changes on my local iyp

Screenshots (if appropriate):

Screenshot from 2024-02-16 14-08-46

Types of changes

Checklist:

JustinLoye commented 7 months ago

As a side comment, I checked that the PART_OF links are well behaved

MATCH (n: Tag)-[r:PART_OF {reference_name: "stanford.asdb"}]-(m: Tag) return n, r, m Should return ${number_of_layer1_categories} graphs of diameter 2.

As shown in this screenshot, it's almost the case Screenshot from 2024-02-16 17-12-29

the exception (left-most graph) being caused by the error mentioned in https://github.com/InternetHealthReport/internet-yellow-pages/issues/116#issuecomment-1920688892