AutomatedProcessImprovement / log-distance-measures

Python package with event log distance and similarity metrics
Apache License 2.0
5 stars 1 forks source link

Consider refactoring inter-arrival metric to measure the arrival distribution #8

Closed david-chapela closed 1 year ago

david-chapela commented 1 year ago

Current implementation measure the distribution of inter-arrival times (times in between two consecutive case arrival) assuming a continuous distribution.

Maybe it's better to implement the distance between the distribution of arrivals (instead of inter-arrival times):

  1. Get all arrival timestamps (first timestamp of each trace).
  2. Bin them, by default, by absolute hour (e.g. from 01/01/2023 08.00 am to 01/01/2023 09.00 am).
  3. Measure distance between the two distributions.

In this way, the metric captures the temporal dimension in a better way. For example: