This Title All WIREs
How to cite this WIREs title:
WIREs Data Mining Knowl Discov
Impact Factor: 7.250

On the application of sequential pattern mining primitives to process discovery: Overview, outlook and opportunity identification

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

Abstract Sequential pattern mining (SPM) is a well‐studied theme in data mining, in which one aims to discover common sequences of item sets in a large corpus of temporal itemset data. Due to the sequential nature of data streams, supporting SPM in streaming environments is commonly studied in the area of data stream mining as well. On the other hand, stream‐based process discovery (PD), originating from the field of process mining, focusses on learning process models on the basis of online event data. In particular, the main goal of the models discovered is to describe the underlying generating process in an end‐to‐end fashion. As both SPM and PD use data that are comparable in nature, that is, both involve time‐stamped instances, one expects that techniques from the SPM domain are (partly) transferable to the PD domain. However, thus far, little work has been done in the intersection of the two fields. In this focus article, we therefore study the possible application of SPM techniques in the context of PD. We provide an overview of the two fields, covering their commonalities and differences, highlight the challenges of applying them, and, present an outlook and several avenues for future work. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Fundamental Concepts of Data and Knowledge > Big Data Mining
Typical process discovery result based on the example data of Table
[ Normal View | Magnified View ]
An abstract overview of the online–offline stream process discovery model
[ Normal View | Magnified View ]
Example local process model (BPMN) based on corpus of event data containing a large amount of traces of the form 〈…, a, b, c, d, …〉 and 〈…, a, c, b, d, …〉. BPMN, Business Process Modeling Notation
[ Normal View | Magnified View ]
Mining models employed in stream SPM. Both (a) batch‐based and (b) sliding‐window‐based approaches forward their intermediate output to an offline SPM approach. SPM, sequential pattern mining
[ Normal View | Magnified View ]

Browse by Topic

Fundamental Concepts of Data and Knowledge > Big Data Mining
Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining
Algorithmic Development > Spatial and Temporal Data Mining

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts