Process data properties matter: Introducing gated convolutional neural networks (GCNN) and key-value-predict attention networks (KVP) for next event prediction with deep learning

Please always quote using this URN: urn:nbn:de:bvb:20-opus-352751
  • Predicting next events in predictive process monitoring enables companies to manage and control processes at an early stage and reduce their action distance. In recent years, approaches have steadily moved from classical statistical methods towards the application of deep neural network architectures, which outperform the former and enable analysis without explicit knowledge of the underlying process model. While the focus of prior research was on the long short-term memory network architecture, more deep learning architectures offer promisingPredicting next events in predictive process monitoring enables companies to manage and control processes at an early stage and reduce their action distance. In recent years, approaches have steadily moved from classical statistical methods towards the application of deep neural network architectures, which outperform the former and enable analysis without explicit knowledge of the underlying process model. While the focus of prior research was on the long short-term memory network architecture, more deep learning architectures offer promising extensions that have proven useful for other applications of sequential data. In our work, we introduce a gated convolutional neural network and a key-value-predict attention network to the task of next event prediction. In a comprehensive evaluation study on 11 real-life benchmark datasets, we show that these two novel architectures surpass prior work in 34 out of 44 metric-dataset combinations. For our evaluation, we consider the effects of process data properties, such as sparsity, variation, and repetitiveness, and discuss their impact on the prediction quality of the different deep learning architectures. Similarly, we evaluate their classification properties in terms of generalization and handling class imbalance. Our results provide guidance for researchers and practitioners alike on how to select, validate, and comprehensively benchmark (novel) predictive process monitoring models. In particular, we highlight the importance of sufficiently diverse process data properties in event logs and the comprehensive reporting of multiple performance indicators to achieve meaningful results.show moreshow less

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar Statistics
Metadaten
Author: Kai Heinrich, Patrick Zschech, Christian Janiesch, Markus Bonin
URN:urn:nbn:de:bvb:20-opus-352751
Document Type:Journal article
Faculties:Wirtschaftswissenschaftliche Fakultät / Betriebswirtschaftliches Institut
Language:English
Parent Title (English):Decision Support Systems
Year of Completion:2021
Volume:143
Article Number:113494
Source:Decision Support Systems (2021) 143:113494. https://doi.org/10.1016/j.dss.2021.113494
DOI:https://doi.org/10.1016/j.dss.2021.113494
Dewey Decimal Classification:3 Sozialwissenschaften / 33 Wirtschaft / 330 Wirtschaft
Tag:deep learning; gated convolutional neural network; key-value-predict attention network; machine learning; predictive process monitoring; process mining
Release Date:2024/11/28
Licence (German):License LogoCC BY: Creative-Commons-Lizenz: Namensnennung 4.0 International