Skip to main content

Desire Lines in Big Data

  • Reference work entry
  • First Online:
Encyclopedia of Social Network Analysis and Mining
  • 74 Accesses

Synonyms

Business process intelligence; Distributed process mining; Process discovery; Process mining

Glossary

Conformance checking:

Monitoring deviations by comparing model and log

Event log:

Multiset of traces

Event:

Occurrence of some discrete incident (e.g., completion of an activity)

Process discovery:

Extracting process models from an event log

Process mining:

Collection of techniques to discover, monitor, and improve real processes by extracting knowledge from event data

Trace:

Sequence of events

Definition

Processes leave footprints in information systems just like people leave footprints in grassy spaces. Desire lines, i.e., the tracks formed by erosion showing where people really walk, may be very different from the formal pathways. When people deviate from the official path there is often a good reason and room for improvement. The goal of process mining is to extract desire lines from event logs, e.g., to automatically infer a process model from raw events recorded by some...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 2,500.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Adriansyah A, van Dongen B, van der Aalst WMP (2011) Conformance checking using cost-based fitness analysis. In: Chi CH, Johnson P (eds) IEEE international Enterprise computing conference (EDOC 2011). Los Alamitos: IEEE Computer Society, pp 55–64

    Google Scholar 

  • Adriansyah A, Munoz-Gama J, Carmona J, van Dongen BF, van der Aalst WMP (2015) Measuring precision of modeled behavior. IseB 13(1):37–67

    Article  Google Scholar 

  • Aggarwal C (2007) Data streams: models and algorithms, volume 31 of advances in database systems. Springer, Berlin

    Book  Google Scholar 

  • Alves de Medeiros AK, Weijters AJMM, van der Aalst WMP (2007) Genetic process mining: an experimental evaluation. Data Min Knowl Disc 14(2):245–304

    Article  MathSciNet  Google Scholar 

  • Barlow Rogers E (1987) Rebuilding central park: a management and restoration plan. MIT Press, Cambridge, MA

    Google Scholar 

  • Bergenthum R, Desel J, Lorenz R, Mauser S (2007) Process mining based on regions of languages. In: Alonso G, Dadam P, Rosemann M (eds) International conference on Business Process Management (BPM 2007), volume 4714 of lecture notes in computer science. Springer, Berlin, pp 375–383

    Google Scholar 

  • Burattin A, Sperduti A, van der Aalst WMP (2014) Control-flow discovery from event streams. In: IEEE Congress on Evolutionary Computation (CEC 2014). Los Alamitos: IEEE Computer Society, pp 2420–2427

    Google Scholar 

  • Carmona J, Gavalda R (2012) Online techniques for dealing with concept drift in process mining. In: Advances in intelligent data analysis XI, vol 172. Springer, Berlin, pp 90–102

    Chapter  Google Scholar 

  • Carmona J, Cortadella J, Kishinevsky M (2009) Genet: a tool for the synthesis and mining of Petri nets. In: Application of Concurrency to System Design (ACSD 2009). Los Alamitos: IEEE Computer Society, pp 181–185

    Google Scholar 

  • Greco G, Guzzo A, Pontieri L, Saccà D (2006) Discovering expressive process models by clustering log traces. IEEE Trans Knowl Data Eng 18(8):1010–1027

    Article  Google Scholar 

  • Günther CW, van der Aalst WMP (2007) Fuzzy mining: adaptive process simplification based on multi-perspective metrics. In: Alonso G, Dadam P, Rosemann M (eds) International conference on Business Process Management (BPM 2007), volume 4714 of lecture notes in computer science. Springer, Berlin, pp 328–343

    Google Scholar 

  • Hilbert M, Lopez P (2011) The world’s technological capacity to store, communicate, and compute information. Science 332(6025):60–65

    Article  Google Scholar 

  • Jagadeesh Chandra Bose RP, van der Aalst WMP (2010) Trace clustering based on conserved patterns: towards achieving better process models. In: Rinderle-Ma S, Sadiq S, Leymann F (eds) BPM 2009 workshops, proceedings of the fifth workshop on Business Process Intelligence (BPI’09), volume 43 of lecture notes in business information processing. Springer, Berlin, pp 170–181

    Google Scholar 

  • Jagadeesh Chandra Bose RP, van der Aalst WMP, Zliobaite I, Pechenizkiy M (2014) Dealing with concept drifts in process mining. IEEE Trans Neural Netw Learn Syst 25(1):154–171

    Article  Google Scholar 

  • Leemans SJJ, Fahland D, van der Aalst WMP (2014) Discovering block-structured process models from event logs containing infrequent behaviour. In: Lohmann N, Song M, Wohed P (eds) Business Process Management Workshops, international workshop on Business Process Intelligence (BPI 2013), volume 171 of lecture notes in business information processing. Springer, Berlin, pp 66–78

    Google Scholar 

  • Leemans SJJ, Fahland D, van der Aalst WMP (2015) Scalable process discovery with guarantees. In: Gaaloul K, Schmidt R, Nurcan S, Guerreiro S, Ma Q (eds) Enterprise, Business-Process and Information Systems Modeling (BPMDS 2015), volume 214 of lecture notes in business information processing. Springer, Berlin, pp 85–101

    Google Scholar 

  • Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C (2011) In: Byers A (ed) Big data: the next frontier for innovation, competition, and productivity. McKinsey Global Institute, Washington, DC

    Google Scholar 

  • Mendling J (2009) Metrics for process models: empirical foundations of verification, error prediction, and guidelines for correctness, volume 6 of lecture notes in business information processing. Springer, Berlin

    Google Scholar 

  • Mendling J, Neumann G, van der Aalst WMP (2007) Understanding the occurrence of errors in process models based on metrics. In: Curbera F, Leymann F, Weske M (eds) Proceedings of the OTM conference on cooperative information systems (CoopIS 2007), volume 4803 of lecture notes in computer science. Springer, Berlin, pp 113–130

    Google Scholar 

  • Myhill C (2004) Commercial success by looking for desire lines. In: Computer human interaction, volume 3101 of lecture notes in computer science. Springer, Berlin, pp 293–304

    Google Scholar 

  • Reisig W, Rozenberg G (eds) (1998) Lectures on Petri nets I: basic models, volume 1491 of lecture notes in computer science. Springer, Berlin

    Google Scholar 

  • Rozinat A, van der Aalst WMP (2008) Conformance checking of processes based on monitoring real behavior. Inf Syst 33(1):64–95

    Article  Google Scholar 

  • Weijters AJMM, van der Aalst WMP (2003) Rediscovering workflow models from event-based data using little thumb. Integr Comput Aided Eng 10(2):151–162

    Google Scholar 

  • van der Aalst WMP (2013) Decomposing Petri nets for process mining: a generic approach. Distrib Parallel Databases 31(4):471–507

    Article  Google Scholar 

  • van der Aalst WMP (2016) Process mining: data science in action. Springer, Berlin

    Book  Google Scholar 

  • van der Aalst WMP, Weijters AJMM, Maruster L (2004) Workflow mining: discovering process models from event logs. IEEE Trans Knowl Data Eng 16(9):1128–1142

    Article  Google Scholar 

  • van der Aalst WMP, van Hee KM, van der Werf JM, Verdonk M (2010) Auditing 2.0: using process mining to support tomorrow’s auditor. IEEE Comput 43(3):90–93

    Article  Google Scholar 

  • van der Aalst WMP, Schonenberg MH, Song M (2011) Time prediction based on process mining. Inf Syst 36(2):450–475

    Article  Google Scholar 

  • van der Aalst WMP, Adriansyah A, van Dongen B (2012) Replaying history on process models for conformance checking and performance analysis. WIREs Data Min Knowl Discovery 2(2):182–192

    Article  Google Scholar 

  • van der Werf JMEM, van Dongen BF, Hurkens CAJ, Serebrenik A (2010) Process discovery using integer linear programming. Fundam Inform 94:387–412

    MathSciNet  MATH  Google Scholar 

  • van Leeuwen M, Siebes A (2008) StreamKrimp: detecting change in data streams. In: Machine learning and knowledge discovery in databases, volume 5211 of lecture notes in computer science. Springer, Berlin, pp 672–687

    Google Scholar 

  • Widmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Mach Learn 23:69–101

    Google Scholar 

Recommended Reading

  • To get started with process mining, the reader is advised to read the book “Process mining: data science in action” (van der Aalst 2016)

    Google Scholar 

Download references

Acknowledgments

The author would like to thank all involved in the development of the process mining tool ProM and related techniques (processmining.org) and all members of the IEEE Task Force on Process Mining (www.win.tue.nl/ieeetfpm/).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wil M. P. van der Aalst .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media LLC, part of Springer Nature

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

van der Aalst, W.M.P. (2018). Desire Lines in Big Data. In: Alhajj, R., Rokne, J. (eds) Encyclopedia of Social Network Analysis and Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-7131-2_396

Download citation

Publish with us

Policies and ethics