chhongs / cloud-bug-prediction-asat

0 stars 0 forks source link

remove duplicates in the csv #12

Closed chhongs closed 4 years ago

chhongs commented 4 years ago

The code produces multiple instances of the same class, sometimes indicating there is a bug sometimes not. Happens because the metrics are computed for each commit and each affected file in the commit.

chhongs commented 4 years ago

Couldn't find a way to do this, yet. Perhaps extend the ML pipeline to preprocess the data accordingly.

YasaraPeiris commented 4 years ago

Required functionalities have implemented in both java code level and machine learning preprocessing level.