The blind men and the elephant: on meeting the problem of multiple truths in data from clustering and pattern mining perspectives

Verfasser / Beitragende:
[Arthur Zimek, Jilles Vreeken]
Ort, Verlag, Jahr:
2015
Enthalten in:
Machine Learning, 98/1-2(2015-01-01), 121-155
Format:
Artikel (online)
ID: 605478031
LEADER caa a22 4500
001 605478031
003 CHVBK
005 20210128100403.0
007 cr unu---uuuuu
008 210128e20150101xx s 000 0 eng
024 7 0 |a 10.1007/s10994-013-5334-y  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s10994-013-5334-y 
245 0 4 |a The blind men and the elephant: on meeting the problem of multiple truths in data from clustering and pattern mining perspectives  |h [Elektronische Daten]  |c [Arthur Zimek, Jilles Vreeken] 
520 3 |a In this position paper, we discuss how different branches of research on clustering and pattern mining, while rather different at first glance, in fact have a lot in common and can learn a lot from each other's solutions and approaches. We give brief introductions to the fundamental problems of different sub-fields of clustering, especially focusing on subspace clustering, ensemble clustering, alternative (as a variant of constraint) clustering, and multiview clustering (as a variant of alternative clustering). Second, we relate a representative of these areas, subspace clustering, to pattern mining. We show that, while these areas use different vocabularies and intuitions, they share common roots and they are exposed to essentially the same fundamental problems; in particular, we detail how certain problems currently faced by the one field, have been solved by the other field, and vice versa. The purpose of our survey is to take first steps towards bridging the linguistic gap between different (sub-) communities and to make researchers from different fields aware of the existence of similar problems (and, partly, of similar solutions or of solutions that could be transferred) in the literature on the other research topic. 
540 |a The Author(s), 2013 
690 7 |a Subspace clustering  |2 nationallicence 
690 7 |a Pattern mining  |2 nationallicence 
690 7 |a Ensemble clustering  |2 nationallicence 
690 7 |a Alternative clustering  |2 nationallicence 
690 7 |a Constraint clustering  |2 nationallicence 
690 7 |a Multiview clustering  |2 nationallicence 
700 1 |a Zimek  |D Arthur  |u Department of Computing Science, University of Alberta, Edmonton, AB, Canada  |4 aut 
700 1 |a Vreeken  |D Jilles  |u Advanced Database Research and Modelling, Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium  |4 aut 
773 0 |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 98/1-2(2015-01-01), 121-155  |x 0885-6125  |q 98:1-2<121  |1 2015  |2 98  |o 10994 
856 4 0 |u https://doi.org/10.1007/s10994-013-5334-y  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s10994-013-5334-y  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Zimek  |D Arthur  |u Department of Computing Science, University of Alberta, Edmonton, AB, Canada  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Vreeken  |D Jilles  |u Advanced Database Research and Modelling, Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 98/1-2(2015-01-01), 121-155  |x 0885-6125  |q 98:1-2<121  |1 2015  |2 98  |o 10994