kuzudb / kuzu

Embeddable property graph database management system built for query speed and scalability. Implements Cypher.
https://kuzudb.com/
MIT License
1.36k stars 96 forks source link

BI Benchmark bulk delete causes Segmentation Fault #2828

Open mxwli opened 8 months ago

mxwli commented 8 months ago

Currently, everything in the BI benchmark is done with a scale factor of 1.

In the BI Benchmark, running the delete queries causes a segmentation fault. Currently, there is also an issue with the insert queries that causes the benchmark to not complete. In order to skip the insert queries, replace these lines in benchmark.py:89

insert_nodes = ["Comment", "Forum", "Person", "Post"]
insert_edges = ["Comment_hasCreator_Person", "Comment_hasTag_Tag", "Comment_isLocatedIn_Country", "Comment_replyOf_Comment", "Comment_replyOf_Post", "Forum_containerOf_Post", "Forum_hasMember_Person", "Forum_hasModerator_Person", "Forum_hasTag_Tag", "Person_hasInterest_Tag", "Person_isLocatedIn_City", "Person_knows_Person", "Person_likes_Comment", "Person_likes_Post", "Person_studyAt_University", "Person_workAt_Company", "Post_hasCreator_Person", "Post_hasTag_Tag", "Post_isLocatedIn_Country"]

with

insert_nodes = ["Comment"]
insert_edges = []

and remove --queries in scripts/benchmark.sh

and then run the benchmark as specified in the readme file. Running the delete queries gets a segmentation fault in the first delete, which takes the form

LOAD WITH HEADERS (deletionDate TIMESTAMP, id INT64) FROM $csv_file (header=true, delim='|')
MATCH (:Comment {id: id})<-[:REPLY_OF*0..]-(comment:Comment)
DETACH DELETE comment
RETURN count(*)

and can be found here

mxwli commented 8 months ago

Closed because currently it cannot be reproduced. May be reopened once I find out why.

mxwli commented 8 months ago

reopened because the issue is now reproducable