Enriched spatial comparison of clusterings through discovery of deviating subspaces

Verfasser / Beitragende:
[Eric Bae, James Bailey]
Ort, Verlag, Jahr:
2015
Enthalten in:
Machine Learning, 98/1-2(2015-01-01), 93-120
Format:
Artikel (online)
ID: 605478090
LEADER caa a22 4500
001 605478090
003 CHVBK
005 20210128100404.0
007 cr unu---uuuuu
008 210128e20150101xx s 000 0 eng
024 7 0 |a 10.1007/s10994-013-5332-0  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s10994-013-5332-0 
245 0 0 |a Enriched spatial comparison of clusterings through discovery of deviating subspaces  |h [Elektronische Daten]  |c [Eric Bae, James Bailey] 
520 3 |a Generation and analysis of multiple clusterings is a growing and important research field. A fundamental challenge underpinning this area is how to develop principled methods for assessing and explaining the similarity between two clusterings. A range of clustering similarity indices exist and an important subclass consists of measures for assessing spatial clustering similarity. These provide the advantage of being able to take into account properties of the feature space when assessing the similarity of clusterings. However, the output of spatially aware clustering comparison is limited to a single similarity value, which lacks detail for a user. Instead, a user may also wish to understand the degree to which the assessment of clustering similarity is dependent on the choice of feature space. To this end, we propose a technique for deeper exploration of the spatial similarity between two clusterings. Using as a reference a measure that assesses the spatial similarity of two clusterings in the full feature space, our method discovers deviating subspaces in which the spatial similarity of the two clusterings becomes substantially larger or smaller. Such information provides a starting point for the user to understand the circumstances in which the distance functions associated with each of the two clusterings are behaving similarly or dissimilarly. The core of our method employs a range of pruning techniques to help efficiently enumerate and explore the search space of deviating subspaces. We experimentally assess the effectiveness of our approach using an evaluation with synthetic and real world datasets and demonstrate the potential of our technique for highlighting novel information about spatial similarity between clusterings. 
540 |a The Author(s), 2013 
690 7 |a Clustering similarity  |2 nationallicence 
690 7 |a Clustering comparison  |2 nationallicence 
690 7 |a Multiple clusterings  |2 nationallicence 
700 1 |a Bae  |D Eric  |u Department of Computing and Information Systems, The University of Melbourne, Melbourne, Australia  |4 aut 
700 1 |a Bailey  |D James  |u Department of Computing and Information Systems, The University of Melbourne, Melbourne, Australia  |4 aut 
773 0 |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 98/1-2(2015-01-01), 93-120  |x 0885-6125  |q 98:1-2<93  |1 2015  |2 98  |o 10994 
856 4 0 |u https://doi.org/10.1007/s10994-013-5332-0  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s10994-013-5332-0  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Bae  |D Eric  |u Department of Computing and Information Systems, The University of Melbourne, Melbourne, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Bailey  |D James  |u Department of Computing and Information Systems, The University of Melbourne, Melbourne, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 98/1-2(2015-01-01), 93-120  |x 0885-6125  |q 98:1-2<93  |1 2015  |2 98  |o 10994