KNN baseline not reproducible and incorrect information about the HAR dataset

mims-harvard / TFC-pretraining

Self-supervised contrastive learning for time series via time-frequency consistency

MIT License

440 stars 82 forks source link

I have been unable to reproduce the results of the KNN (K=2) baseline on the provided datasets, except for the Epilepsy dataset where I was able to get the same score as reported in the paper. However, the performance on other datasets is significantly different. For example, in the "EMG" dataset, I am getting an accuracy score of 0.122 while the paper reports a score of 0.439.
The information provided about the HAR dataset is incorrect. The paper mentions that the dataset contains 9 channels, but upon inspection I have found that it only contains 3 channels. This discrepancy needs to be addressed.
Could you please provide more information about how the KNN baseline was run for multivariate time series data, and also correct the information about the number of channels in the HAR dataset? My goal was simply to do a quick development run, so any assistance in reproducing the results would be greatly appreciated. Thank you.

mims-harvard / TFC-pretraining