intellistream / Sesame

[SIGMOD'23] Data Stream Clustering: An In-depth Empirical Study [ICDM'24] MOStream: A Modular and Self-Optimizing Data Stream Clustering Algorithm
MIT License
18 stars 6 forks source link

CluStream segmentation fault when `offline_time_window != 0` #145

Closed wzru closed 2 years ago

wzru commented 2 years ago

How to reproduce: https://github.com/intellistream/Sesame/blob/21afdd73e92c2d8adaec5600d7b9c95a0f8d50e8/test/SystemTest/CluStreamTest.cpp#L30 Change this value to 2, then run test:

$ make -j4
$ ./google_test --gtest_filter=SystemTest.CluStreamTest
Running main() from /home/shaun/Sesame/build/_deps/googletest-src/googletest/src/gtest_main.cc
Note: Google Test filter = SystemTest.CluStreamTest
[==========] Running 1 test from 1 test suite.
[----------] Global test environment set-up.
[----------] 1 test from SystemTest
[ RUN      ] SystemTest.CluStreamTest
Read from the file...
Complete reading from the file...
Finished loading input data
data number: 3000
Algorithm: CluStream num_last_arr: 8   time_window: 300   num_offline_clusters: 7   ClusterNumber: 150   radius: 20   buf_size: 500
DataSource spawn thread=0
Engine spawn thread=1
DataSink spawn thread=2
DataSink start to grab data
Algorithm start to process data
DataSource start to emit data
KMeans++ start!!!
sourceEnd set to true
DataSource sourceEnd emit data
DataSource takes 299983 useconds to finish.
Point cluster need to be adjust, start a new iteration!
Point cluster need to be adjust, start a new iteration!
Point cluster need to be adjust, start a new iteration!
Point cluster need to be adjust, start a new iteration!
Point cluster need to be adjust, start a new iteration!
KMeans++ sourceEnd!!!
ready to process remaining data
ready to offline clustering
Segmentation fault