A novel approach for predicting DNA splice junctions using hybrid machine learning algorithms

Verfasser / Beitragende:
[Indrajit Mandal]
Ort, Verlag, Jahr:
2015
Enthalten in:
Soft Computing, 19/12(2015-12-01), 3431-3444
Format:
Artikel (online)
ID: 605469172
LEADER caa a22 4500
001 605469172
003 CHVBK
005 20210128100321.0
007 cr unu---uuuuu
008 210128e20151201xx s 000 0 eng
024 7 0 |a 10.1007/s00500-014-1550-z  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s00500-014-1550-z 
100 1 |a Mandal  |D Indrajit  |u Computer Science and Engineering Department, Rajiv Gandhi Institute of Technology, 560032, Bangalore, India  |4 aut 
245 1 2 |a A novel approach for predicting DNA splice junctions using hybrid machine learning algorithms  |h [Elektronische Daten]  |c [Indrajit Mandal] 
520 3 |a Accurate identification of splice junctions in a DNA sequence is an active area of research. The knowledge of splice junction's occurrence provides valuable information about its internal genomic structure and aids in its deeper analysis and interpretation. The major problems faced during gene analysis are diversity, complexity and the uncertainty nature of DNA sequences. The application of computational techniques using machine learning algorithms in this direction has attracted enormous attention in the last few decades. In this study, the development of hybrid machine learning ensembles approaches is presented that address the splice junction problem more effectively. Multiple classifier systems consisting of random subspace, rotation forest and boosting methods are implemented and are validated over the real genome sequence dataset. A novel feature selection technique based on attribute's correlation estimation using Best first strategy is proposed. The average prediction accuracy achieved is more than 98% in identifying the splice junctions. All the computations are performed with 95% confidence interval. The results presented in this study are superior as compared to the state-of-the-art approaches in the literature. This work strengthens the viability of expanding and using machine learning models to similar problems. 
540 |a Springer-Verlag Berlin Heidelberg, 2014 
690 7 |a Splice junctions  |2 nationallicence 
690 7 |a DNA  |2 nationallicence 
690 7 |a Machine learning  |2 nationallicence 
690 7 |a Multiple classifier system  |2 nationallicence 
690 7 |a Correlation  |2 nationallicence 
773 0 |t Soft Computing  |d Springer Berlin Heidelberg  |g 19/12(2015-12-01), 3431-3444  |x 1432-7643  |q 19:12<3431  |1 2015  |2 19  |o 500 
856 4 0 |u https://doi.org/10.1007/s00500-014-1550-z  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s00500-014-1550-z  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 100  |E 1-  |a Mandal  |D Indrajit  |u Computer Science and Engineering Department, Rajiv Gandhi Institute of Technology, 560032, Bangalore, India  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Soft Computing  |d Springer Berlin Heidelberg  |g 19/12(2015-12-01), 3431-3444  |x 1432-7643  |q 19:12<3431  |1 2015  |2 19  |o 500