Deletion/Substitution/Addition Algorithm in Learning with Applications in Genomics

Verfasser / Beitragende:
[Sandra E Sinisi, Mark J. van der Laan]
Ort, Verlag, Jahr:
2004
Enthalten in:
Statistical Applications in Genetics and Molecular Biology, 3/1(2004-08-12), 1-38
Format:
Artikel (online)
ID: 378925849
LEADER caa a22 4500
001 378925849
003 CHVBK
005 20180305123617.0
007 cr unu---uuuuu
008 161128e20040812xx s 000 0 eng
024 7 0 |a 10.2202/1544-6115.1069  |2 doi 
035 |a (NATIONALLICENCE)gruyter-10.2202/1544-6115.1069 
245 0 0 |a Deletion/Substitution/Addition Algorithm in Learning with Applications in Genomics  |h [Elektronische Daten]  |c [Sandra E Sinisi, Mark J. van der Laan] 
520 3 |a van der Laan and Dudoit (2003) provide a road map for estimation and performance assessment where a parameter of interest is defined as the risk minimizer for a suitable loss function and candidate estimators are generated using a loss function. After briefly reviewing this approach, this article proposes a general deletion/substitution/addition algorithm for minimizing, over subsets of variables (e.g., basis functions), the empirical risk of subset-specific estimators of the parameter of interest. This algorithm provides us with a new class of loss-based cross-validated algorithms in prediction of univariate outcomes, which can be extended to handle multivariate outcomes, conditional density and hazard estimation, and censored outcomes such as survival. In the context of regression, using polynomial basis functions, we study the properties of the deletion/substitution/addition algorithm in simulations and apply the method to detect transcription factor binding sites in yeast gene expression experiments. 
540 |a ©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston 
690 7 |a Statistical Models  |2 nationallicence 
690 7 |a Cross-validation  |2 nationallicence 
690 7 |a estimation  |2 nationallicence 
690 7 |a loss function  |2 nationallicence 
690 7 |a measures of importance  |2 nationallicence 
690 7 |a model selection  |2 nationallicence 
690 7 |a polynomial regression  |2 nationallicence 
690 7 |a prediction  |2 nationallicence 
690 7 |a risk  |2 nationallicence 
690 7 |a variable selection  |2 nationallicence 
700 1 |a Sinisi  |D Sandra E.  |u University of California, Berkeley  |4 aut 
700 1 |a van der Laan  |D Mark J.  |u University of California, Berkeley  |4 aut 
773 0 |t Statistical Applications in Genetics and Molecular Biology  |d De Gruyter  |g 3/1(2004-08-12), 1-38  |q 3:1<1  |1 2004  |2 3  |o sagmb 
856 4 0 |u https://doi.org/10.2202/1544-6115.1069  |q text/html  |z Onlinezugriff via DOI 
908 |D 1  |a research article  |2 jats 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.2202/1544-6115.1069  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Sinisi  |D Sandra E.  |u University of California, Berkeley  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a van der Laan  |D Mark J.  |u University of California, Berkeley  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Statistical Applications in Genetics and Molecular Biology  |d De Gruyter  |g 3/1(2004-08-12), 1-38  |q 3:1<1  |1 2004  |2 3  |o sagmb 
900 7 |b CC0  |u http://creativecommons.org/publicdomain/zero/1.0  |2 nationallicence 
898 |a BK010053  |b XK010053  |c XK010000 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-gruyter