bupaverse / daqapo

R-package for data quality assessment of process-oriented data
https://nielsmartin.github.io/daqapo/
Other
6 stars 3 forks source link

daqapo does not take eventlogs or data.frames but only tibbles? #12

Open mlane3 opened 3 years ago

mlane3 commented 3 years ago

I really enjoyed the last process mining conference and wanted to try daqapo for myself, but I ran into an unusual error. The team has already used tidyverse to reduce the lines of code large processing model from 5000 lines of code to about 1000 lines (now split into small 50 line chunks). I am hoping we can use daqapo to replace some of the customized cleaning functions we wrote. However, we ran into a bit of a weird issue. R version = 4.0.2 or 4.0.3 (we found the error in both) daqapo = 0.3.1 It seems like daqapo will only clean 4.0 Tibbles and activity logs but will not clean eventlog, data.frames, or special data.frames (ie. tidyr legacy Tibbles). The problem is this means we have to recode the event log at every data cleaning step, which makes zero sense.

patients %>% detect_missing_values() Output: Error in UseMethod("detect_missing_values") : no applicable method for 'detect_missing_values' applied to an object of class "c('eventlog', 'tbl_df', 'tbl', 'data.frame')"

gertjanssenswillen commented 2 years ago

daqapo is currently only supporting activitylogs indeed. Support for eventlogs is expected in the future. Activity log is a new data format that daqapo introduced that we are first rolling out throughout the other bupaR packages.

mlane3 commented 2 years ago

Thank you for the update. Please keep me posted.