ivanDonadello / Declare4Py

A Python package for declarative Process Mining with Machine Learning applications
https://declare4py.readthedocs.io/en/latest/
21 stars 5 forks source link
bpm business-process-management declarative-process-models encoding linear-temporal-logic process-mining python

Declare4Py

Declare4Py is the first Python package for declarative Process Mining with core functionalities to easily implement Machine Learning applications for Process Mining. Declarative process mining uses declarative behavioural rules (based on Linear Temporal Logic on finite traces) for defining process models. This results in a high flexibility of the business process model definition without neglecting hard constraints that must be satisfied. Moreover, declarative languages can be used as a bridge between Process Mining and Machine learning with the DECLARE encoding that encodes the traces in a log into a numeric format suitable as input to Machine Learning algorithms. Declare4Py implements such a bridge by including standard algorithms for:

  1. declarative Process Mining with LTLf or (MP)-DECLARE templates (e.g., conformance checking, model discovery, trace generation, query checking);
  2. log encodings (e.g., complex-index, aggregate, Declare);
  3. log labelling according to filtering or declarative rules.

All the Declare4Py data formats are compatible with the main Machine Learning Python packages: scikit-learn, Tensorflow and PyTorch.

Installation

We recommend the use of a virtual environment to avoid possible clashes between your local Python version and the required libraries. A virtual environment can be created with Conda or with the venv Python utility. Once a virtual environment has been created, download the Declare4Py project on your local machine, activate the created virtual environment and use pip or conda to install the required dependencies in the requirements.txt file. As an alternative you can install Declare4py from PyPi.

In addition, the Lydia backend for the LTLf conformance checking need to be installed with Docker:

  1. Install Docker;
  2. Download the Lydia Docker image with docker pull whitemech/lydia:latest;
  3. Make the Docker image executable under the name lydia. On Linux and MacOS machines, the following commands should work:
    echo '#!/usr/bin/env sh' > lydia
    echo 'docker run -v$(pwd):/home/default whitemech/lydia lydia "$@"' >> lydia
    sudo chmod u+x lydia
    sudo mv lydia /usr/local/bin/
  4. More information can be found here.

Tutorials

The docs/source/tutorials/ folder contains a walk-through of Declare4Py. In order, the tutorials cover the following topics:

  1. Managing event logs: methods to manage event logs, importing them, extracting useful information, converting them in other formats;
  2. Managing process models: simple methods to parse and manage process models from strings and/or files and checking their satisfiability;
  3. Conformance checking of LTLf templates/formulas: check what are the traces in an event log that satisfy a given LTLf model;
    1. Log filtering with LTLf properties: filter a log according to an LTLf model;
  4. Conformance checking of MP-DECLARE templates: check what are the traces in an event log (along with the fulfillments/violations) that satisfy a given MP_DECLARE model;
  5. Query Checking with DECLARE models: discover what are the activities that make an input DECLARE constraint satisfied in an event log;
  6. Discovery of DECLARE models: discover what are the most satisfied DECLARE constraints in an event log;
  7. Filtering an event log: select a subset of an event log that satisfy some input properties;
  8. Log generation with a MP-DECLARE model: generate synthetic cases that satisfy an MP-DECLARE model.

The tutorials are Jupyter notebooks and consider the Sepsis cases log.

Repository Structure

Citing Declare4Py

If you use Declare4Py in your research, please use the following BibTeX entry.

@inproceedings{DonadelloRMS22,
  author    = {Ivan Donadello and
               Francesco Riva and
               Fabrizio Maria Maggi and
               Aladdin Shikhizada},
  title     = {Declare4Py: {A} Python Library for Declarative Process Mining},
  booktitle = {{BPM} (PhD/Demos)},
  series    = {{CEUR} Workshop Proceedings},
  volume    = {3216},
  pages     = {117--121},
  publisher = {CEUR-WS.org},
  year      = {2022}
}