ldbc / ldbc_snb_interactive_v1_impls

Reference implementations for LDBC Social Network Benchmark's Interactive workload.
https://ldbcouncil.org/benchmarks/snb-interactive
Apache License 2.0
97 stars 85 forks source link

Invalid record in validation-params-sf0.1.csv #401

Closed winattack closed 8 months ago

winattack commented 8 months ago

for this record with messageIdContent 1099511633351: {"messageIdContent":1099511633351}|{"messageContent":"About Augustine of Hippo, ns, the alleviation of sAbout Paul the Apostle, ile groups that wo","messageCreationDate":1347529349776}

I checked both datasets of social_network-csv_basic-sf0.1 and social_network-csv_composite-longdateformatter-sf0.1, no such id in comment_0_0.csv and post_0_0.csv, so

  1. where does this record generated?
  2. If the validation-params-sf0.1.csv is old, where can i get the latest?
szarnyasg commented 8 months ago

Hi @winattack,

This entry is in the forum update stream:

$ grep 1347529349776 social_network-sf0.1-CsvBasic-LongDateFormatter/updateStream_0_0_forum.csv
1347529349776|1271326709963|6|1099511633351||1347529349776|27.54.154.214|Internet Explorer|uz|About Augustine of Hippo, ns, the alleviation of sAbout Paul the Apostle, ile groups that wo|92|2199023257128|68719477057|1|6;1940;2785;11687

Note that the data sets also changed between Interactive v0.3.5 and v1.0.0 (the change is minimal and only affects timestamps around posts, and the directory names are slightly different). So make sure you use the v1.0.0 repository: https://ldbcouncil.org/data-sets-surf-repository/snb-interactive-v1-datagen-v100