Classifying Gene Expression Profiles from Pairwise mRNA Comparisons

Verfasser / Beitragende:
[Donald Geman, Christian d'Avignon, Daniel Q. Naiman, Raimond L. Winslow]
Ort, Verlag, Jahr:
2004
Enthalten in:
Statistical Applications in Genetics and Molecular Biology, 3/1(2004-08-30), 1-19
Format:
Artikel (online)
ID: 37892608X
LEADER caa a22 4500
001 37892608X
003 CHVBK
005 20180305123617.0
007 cr unu---uuuuu
008 161128e20040830xx s 000 0 eng
024 7 0 |a 10.2202/1544-6115.1071  |2 doi 
035 |a (NATIONALLICENCE)gruyter-10.2202/1544-6115.1071 
245 0 0 |a Classifying Gene Expression Profiles from Pairwise mRNA Comparisons  |h [Elektronische Daten]  |c [Donald Geman, Christian d'Avignon, Daniel Q. Naiman, Raimond L. Winslow] 
520 3 |a We present a new approach to molecular classification based on mRNA comparisons. Our method, referred to as the top-scoring pair(s) (TSP) classifier, is motivated by current technical and practical limitations in using gene expression microarray data for class prediction, for example to detect disease, identify tumors or predict treatment response. Accurate statistical inference from such data is difficult due to the small number of observations, typically tens, relative to the large number of genes, typically thousands. Moreover, conventional methods from machine learning lead to decisions which are usually very difficult to interpret in simple or biologically meaningful terms. In contrast, the TSP classifier provides decision rules which i) involve very few genes and only relative expression values (e.g., comparing the mRNA counts within a single pair of genes); ii) are both accurate and transparent; and iii) provide specific hypotheses for follow-up studies. In particular, the TSP classifier achieves prediction rates with standard cancer data that are as high as those of previous studies which use considerably more genes and complex procedures. Finally, the TSP classifier is parameter-free, thus avoiding the type of over-fitting and inflated estimates of performance that result when all aspects of learning a predictor are not properly cross-validated. 
540 |a ©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston 
690 7 |a Computational Biology/Bioinformatics  |2 nationallicence 
690 7 |a Microarrays  |2 nationallicence 
690 7 |a microarray data  |2 nationallicence 
690 7 |a class prediction  |2 nationallicence 
690 7 |a mRNA comparisons  |2 nationallicence 
700 1 |a Geman  |D Donald  |u Johns Hopkins University  |4 aut 
700 1 |a d'Avignon  |D Christian  |u Johns Hopkins University  |4 aut 
700 1 |a Naiman  |D Daniel Q.  |u Johns Hopkins University  |4 aut 
700 1 |a Winslow  |D Raimond L.  |u Johns Hopkins University  |4 aut 
773 0 |t Statistical Applications in Genetics and Molecular Biology  |d De Gruyter  |g 3/1(2004-08-30), 1-19  |q 3:1<1  |1 2004  |2 3  |o sagmb 
856 4 0 |u https://doi.org/10.2202/1544-6115.1071  |q text/html  |z Onlinezugriff via DOI 
908 |D 1  |a research article  |2 jats 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.2202/1544-6115.1071  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Geman  |D Donald  |u Johns Hopkins University  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a d'Avignon  |D Christian  |u Johns Hopkins University  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Naiman  |D Daniel Q.  |u Johns Hopkins University  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Winslow  |D Raimond L.  |u Johns Hopkins University  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Statistical Applications in Genetics and Molecular Biology  |d De Gruyter  |g 3/1(2004-08-30), 1-19  |q 3:1<1  |1 2004  |2 3  |o sagmb 
900 7 |b CC0  |u http://creativecommons.org/publicdomain/zero/1.0  |2 nationallicence 
898 |a BK010053  |b XK010053  |c XK010000 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-gruyter