Convex relaxations of penalties for sparse correlated variables with bounded total variation

Verfasser / Beitragende:
[Eugene Belilovsky, Andreas Argyriou, Gaël Varoquaux, Matthew Blaschko]
Ort, Verlag, Jahr:
2015
Enthalten in:
Machine Learning, 100/2-3(2015-09-01), 533-553
Format:
Artikel (online)
ID: 605478384
LEADER caa a22 4500
001 605478384
003 CHVBK
005 20210128100405.0
007 cr unu---uuuuu
008 210128e20150901xx s 000 0 eng
024 7 0 |a 10.1007/s10994-015-5511-2  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s10994-015-5511-2 
245 0 0 |a Convex relaxations of penalties for sparse correlated variables with bounded total variation  |h [Elektronische Daten]  |c [Eugene Belilovsky, Andreas Argyriou, Gaël Varoquaux, Matthew Blaschko] 
520 3 |a We study the problem of statistical estimation with a signal known to be sparse, spatially contiguous, and containing many highly correlated variables. We take inspiration from the recently introduced k-support norm, which has been successfully applied to sparse prediction problems with correlated features, but lacks any explicit structural constraints commonly found in machine learning and image processing. We address this problem by incorporating a total variation penalty in the k-support framework. We introduce the (k,s) support total variation norm as the tightest convex relaxation of the intersection of a set of sparsity and total variation constraints. We show that this norm leads to an intractable combinatorial graph optimization problem, which we prove to be NP-hard. We then introduce a tractable relaxation with approximation guarantees that scale well for grid structured graphs. We devise several first-order optimization strategies for statistical parameter estimation with the described penalty. We demonstrate the effectiveness of this penalty on classification in the low-sample regime, classification with M/EEG neuroimaging data, and image recovery with synthetic and real data background subtracted image recovery tasks. We extensively analyse the application of our penalty on the complex task of identifying predictive regions from low-sample high-dimensional fMRI brain data, we show that our method is particularly useful compared to existing methods in terms of accuracy, interpretability, and stability. 
540 |a The Author(s), 2015 
690 7 |a Structured sparsity  |2 nationallicence 
690 7 |a Feature selection  |2 nationallicence 
690 7 |a Brain decoding  |2 nationallicence 
690 7 |a k -Support  |2 nationallicence 
690 7 |a Total variation  |2 nationallicence 
700 1 |a Belilovsky  |D Eugene  |u Inria Saclay, Palaiseau, France  |4 aut 
700 1 |a Argyriou  |D Andreas  |u Inria Saclay, Palaiseau, France  |4 aut 
700 1 |a Varoquaux  |D Gaël  |u Inria Saclay, Palaiseau, France  |4 aut 
700 1 |a Blaschko  |D Matthew  |u Inria Saclay, Palaiseau, France  |4 aut 
773 0 |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 100/2-3(2015-09-01), 533-553  |x 0885-6125  |q 100:2-3<533  |1 2015  |2 100  |o 10994 
856 4 0 |u https://doi.org/10.1007/s10994-015-5511-2  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s10994-015-5511-2  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Belilovsky  |D Eugene  |u Inria Saclay, Palaiseau, France  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Argyriou  |D Andreas  |u Inria Saclay, Palaiseau, France  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Varoquaux  |D Gaël  |u Inria Saclay, Palaiseau, France  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Blaschko  |D Matthew  |u Inria Saclay, Palaiseau, France  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 100/2-3(2015-09-01), 533-553  |x 0885-6125  |q 100:2-3<533  |1 2015  |2 100  |o 10994