Working paper

Adaptations Of Data Mining Methodologies: A Systematic Literature Review

Year:

2020

Published in:

University of Tartu
Data Mining
Data Mining Methodologies
CRISP-DM
Literature Review

The use of end-to-end data mining methodologies such as CRISP-DM, KDD process, and SEMMA, has grown substantially over the past decade. However, little is known as to how these methodologies are used in practice. In particular, the question of whether data mining methodologies are used ’as-is’ or adapted for specific purposes, has not been thoroughly investigated. This paper addresses this gap via a systematic literature review focused on the context in which data mining methodologies are used and the adaptations they undergo. The literature review covers 207 peer-reviewed and ’grey’ publications. We find that data mining methodologies are primarily applied ’as-is’. At the same time, we also identify various adaptations of data mining methodologies and we note that their number is growing rapidly. The dominant adaptations pattern is related to methodology adjustments at a granular level (modifications) followed by extensions of existing methodologies with additional elements. Further, we identify two recurrent purposes for adaptation: (1) adaptations to handle Big Data technologies, tools and environments (technological adaptations); and (2) adaptations for context-awareness and for integrating data mining solutions into business processes and IT systems (organizational adaptations). The study suggests that standard data mining methodologies do not pay sufficient attention to deployment issues, which play a prominent role when turning data mining models into software products that are integrated into the IT architectures and business processes of organizations. We conclude that refinements of existing methodologies aimed at combining data, technological, and organizational aspects, could help to mitigate these gaps.

Other publications by

8 publications found

2022
Journal article

Applying The CRISP‑DM Data Mining Process In The Financial Services Industry: Elicitation Of Adaptation Requirements

Publisher: Data & Knowledge Engineering

Authors: Veronika Plotnikova, Fredrik Milani, Marlon Dumas

2022
Journal article

Designing A Data Mining Process For The Financial Services Domain

Publisher: Journal of Business Analytics

Authors: Veronika Plotnikova, Marlon Dumas, Alexander Nolte, Fredrik Milani

2019
Conference proceedings

Data Mining Methodologies in the Banking Domain: A Systematic Literature Review

Publisher: Perspectives in Business Informatics Research

Authors: Veronika Plotnikova, Fredrik Milani, Marlon Dumas

2019
Working paper

Systematic Literature Review Protocol: Adaptations of Data Mining Methodologies

Publisher: University of Tartu

Authors: Veronika Plotnikova, Fredrik Milani, Marlon Dumas

2020
Working paper

Adaptations Of Data Mining Methodologies: A Systematic Literature Review

Publisher: University of Tartu

Authors: Veronika Plotnikova, Fredrik Milani, Marlon Dumas

2019
Working paper

Towards a Data Mining Methodology for the Banking Domain

Publisher: University of Tartu

Authors: Veronika Plotnikova

2018
Conference proceedings

Building Payment Classification Models from Rules and Crowdsourced Labels: A Case Study

Publisher: Advanced Information Systems Engineering Workshops

Authors: Veronika Plotnikova, Artem Mateush, Rajesh Sharma , Marlon Dumas, Ivan Slobozhan, Jaan Übi

2021
Conference proceedings

Adapting the CRISP‑DM Data Mining Process: A Case Study in the Financial Services Domain

Publisher: Research Challenges in Information Science

Authors: Veronika Plotnikova, Fredrik Milani, Marlon Dumas

You have reached the end of the list