Closed TchaloSon closed 1 year ago
Hi ! I would like to know,if we should merge the csv files in each folder or only make a choice between them
Example: In pregenerated files,we have CommentHasTagTag folder which have 4 csv files:
file1=parentPath+"part-00000-9b2d99a7-efc1-4c6b-bacc-d2092b8f3ed7-c000.csv" file2=parentPath+"part-00001-9b2d99a7-efc1-4c6b-bacc-d2092b8f3ed7-c000.csv" file3=parentPath+"part-00004-9b2d99a7-efc1-4c6b-bacc-d2092b8f3ed7-c000.csv" file4=parentPath+"part-00006-9b2d99a7-efc1-4c6b-bacc-d2092b8f3ed7-c000.csv"
I wonder ,if these files should be merged or just choose on of them for the benchmark. Thanks
Hi @Aboudourazakou, all CSV parts should be used for the benchmark.
Thank you so much!
Hi ! I would like to know,if we should merge the csv files in each folder or only make a choice between them
Example: In pregenerated files,we have CommentHasTagTag folder which have 4 csv files:
I wonder ,if these files should be merged or just choose on of them for the benchmark. Thanks