Closed LeonBein closed 1 year ago
Hi @LeonBein, thanks for reporting the issue. We're looking into it. However, it seems we are at the edge of what Split Miner can do here. Simod uses Split Miner first for basic model discovery and then optimises it by tuning different parameters. But if the model cannot be mined in any of the iterations, it fails.
Here's the Split Miner paper, https://link.springer.com/article/10.1007/s10115-018-1214-x#Sec17. The authors actually claim that they used BPIC 15 in the evaluation but applied filtering (https://ieeexplore.ieee.org/document/7579568/references#algorithm1) beforehand:
… we applied the filtering method in [11] to remove infrequent behavior prior to applying each of the discovery methods. Without this filtering step, all the method generated models with an F-score of close to zero due to the complexity of these logs …
I'll leave this issue open for now as a feature request in case we can extract the filtering implementation from the “Infrequent Behavior Filter” plugin for the ProM framework.
BTW, this is how the model looks mined with another tool:
CC: @david-chapela, @marlondumas
I confirm the open-source Split Miner implementation might not be able to handle BPIC 2015 unfiltered. To handle this log, it would require some cleaning (preprocessing) and perhaps more than 2GB of RAM. Workaround:
Yes, I agree it probably doesn't make sense to mine this log without preprocessing. However, I am unsure whether the bug only occurs due to the size: Digging a little bit more, I found the following commit https://github.com/apromore/ProMforApromore/commit/050e23849631e451dcdc265c0fef4f24f082cfb1 which seems to fix the bug for a newer version of the plugin.
Maybe the plugin version used in Simod can be updated?
Hi @LeonBein,
Thanks for the hints! For now, @iharsuvorau managed to discover a BPMN process model with Apromore to bypass the use of SplitMiner, and run Simod to obtain the rest of the simulation parameters (we have an option to provide the process model in a BPMN file and skip the control-flow discovery phase).
In case you are interested in working with the BPIC 2015, we placed the discovered process model, as well as the Simod configuration in this folder. Also, a test running this can be found in this file.
We will take a look at the SplitMiner repository, and the Apromore repository you shared, to see if we can update the version of SplitMiner we have.
@LeonBein This issues should've been resolved by now. We've replaced recently the Split Miner build with a new one with less dependencies.
When running Simod on the BPI challenge 2015 Municipality 1 log the run fails with
java.lang.IllegalArgumentException: Comparison method violates its general contract
in one of the SplitMiner plugins.Here is my Simod Config:
Here is the full console log: