Graph Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, NetworkX, RAPIDS, RDFlib, pySHACL, PyVis, morph-kgc, pslpython, pyarrow, etc.
Just exploring a little and noticed that load_parquet seems to be hanging when loading from a saved parquet file. At least, it's taking a lot longer to read the kg from file than it did to create the original kg. While it takes 2 minutes to generate the kg from a csv (kg.add(...)), it's taking over 15 minutes to load the file and appears to be hanging? Any ideas?
The parquet file is ~9MB, and the kg has 200k nodes and 4 Literal relations per node.
Very nice library!
Just exploring a little and noticed that
load_parquet
seems to be hanging when loading from a saved parquet file. At least, it's taking a lot longer to read the kg from file than it did to create the original kg. While it takes 2 minutes to generate the kg from a csv (kg.add(...)
), it's taking over 15 minutes to load the file and appears to be hanging? Any ideas?The parquet file is ~9MB, and the kg has 200k nodes and 4 Literal relations per node.
The code to load the file is: