These days, businesses keep track of more and more data in their information systems. Moreover, this data becomes more fine-grained than ever, tracking clicks and mutations in databases at the lowest level possible. Faced with such data, process discovery often struggles with producing comprehensible models, as they instead return spaghetti-like models. Such finely granulated models do not fit the business user's mental model of the process under investigation. To tackle this, event log abstraction (ELA) techniques can transform the underlying event log to a higher granularity level. However, insights into the performance of these techniques are lacking in literature as results are only based on small-scale experiments and are often inconclusive. Against this background, this paper evaluates state-of-the-art abstraction techniques on 400 event logs. Results show that ELA sacrifices fitness for precision, but complexity reductions heavily depend on the ELA technique used. This study also illustrates the importance of a larger-scale experiment, as sub-sampling of results leads to contradictory conclusions.

An empirical evaluation of unsupervised event log abstraction techniques in process mining

de Leoni, Massimiliano
Writing – Review & Editing
;
2024

Abstract

These days, businesses keep track of more and more data in their information systems. Moreover, this data becomes more fine-grained than ever, tracking clicks and mutations in databases at the lowest level possible. Faced with such data, process discovery often struggles with producing comprehensible models, as they instead return spaghetti-like models. Such finely granulated models do not fit the business user's mental model of the process under investigation. To tackle this, event log abstraction (ELA) techniques can transform the underlying event log to a higher granularity level. However, insights into the performance of these techniques are lacking in literature as results are only based on small-scale experiments and are often inconclusive. Against this background, this paper evaluates state-of-the-art abstraction techniques on 400 event logs. Results show that ELA sacrifices fitness for precision, but complexity reductions heavily depend on the ELA technique used. This study also illustrates the importance of a larger-scale experiment, as sub-sampling of results leads to contradictory conclusions.
2024
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3504192
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact