To combat multi-class imbalanced problems by means of over-sampling and boosting techniques
Gespeichert in:
Verfasser / Beitragende:
[Lida Abdi, Sattar Hashemi]
Ort, Verlag, Jahr:
2015
Enthalten in:
Soft Computing, 19/12(2015-12-01), 3369-3385
Format:
Artikel (online)
Online Zugang:
| LEADER | caa a22 4500 | ||
|---|---|---|---|
| 001 | 60546930X | ||
| 003 | CHVBK | ||
| 005 | 20210128100321.0 | ||
| 007 | cr unu---uuuuu | ||
| 008 | 210128e20151201xx s 000 0 eng | ||
| 024 | 7 | 0 | |a 10.1007/s00500-014-1291-z |2 doi |
| 035 | |a (NATIONALLICENCE)springer-10.1007/s00500-014-1291-z | ||
| 245 | 0 | 0 | |a To combat multi-class imbalanced problems by means of over-sampling and boosting techniques |h [Elektronische Daten] |c [Lida Abdi, Sattar Hashemi] |
| 520 | 3 | |a Imbalanced problems are quite pervasive in many real-world applications. In imbalanced distributions, a class or some classes of data, called minority class(es), is/are under-represented compared to other classes. This skewness in the data underlying distribution causes many difficulties for typical machine learning algorithms. The notion becomes even more complicated when machine learning algorithms are to combat multi-class imbalanced problems. The presented solutions for tackling the issues arising from imbalanced distributions, generally fall into two main categories: data-oriented methods and model-based algorithms. Focusing on the latter, this paper suggests an elegant blend of boosting and over-sampling paradigms, which is called MDOBoost, to bring considerable benefits to the learning ability of multi-class imbalanced data sets. The over-sampling technique introduced and adopted in this paper, Mahalanobis distance-based over-sampling technique (MDO in short), is delicately incorporated into boosting algorithm. In fact, the minority classes are over-sampled via MDO technique in such a way that they almost preserve the original minority class characteristics. MDO, in comparison with the popular method in this field, SMOTE, generates more similar minority class examples to original class samples. Moreover, the broader representation of minority class examples is provided via MDO, and this, in turn, causes the classifier to build larger decision regions. MDOBoost increases the generalization ability of a classifier, since it indicates better results with pruned version of C4.5 classifier; unlike other over-sampling/boosting procedures, which have difficulties with pruned version of C4.5. MDOBoost is applied to real-world multi-class imbalanced benchmarks and its performance is then compared with several data-level and model-based algorithms. The empirical results and theoretical analyses reveal that MDOBoost offers superior advantages compared to popular class decomposition and over-sampling techniques in terms of MAUC, G-mean, and minority class recall. | |
| 540 | |a Springer-Verlag Berlin Heidelberg, 2014 | ||
| 690 | 7 | |a Multi-class imbalance |2 nationallicence | |
| 690 | 7 | |a Over-sampling |2 nationallicence | |
| 690 | 7 | |a Mahalanobis distance |2 nationallicence | |
| 690 | 7 | |a Boosting algorithm |2 nationallicence | |
| 690 | 7 | |a Class decomposition techniques |2 nationallicence | |
| 700 | 1 | |a Abdi |D Lida |u CSE and IT Department, Shiraz University, Engineering Campus Number 2, Mollasadra Ave., Shiraz, Iran |4 aut | |
| 700 | 1 | |a Hashemi |D Sattar |u CSE and IT Department, Shiraz University, Engineering Campus Number 2, Mollasadra Ave., Shiraz, Iran |4 aut | |
| 773 | 0 | |t Soft Computing |d Springer Berlin Heidelberg |g 19/12(2015-12-01), 3369-3385 |x 1432-7643 |q 19:12<3369 |1 2015 |2 19 |o 500 | |
| 856 | 4 | 0 | |u https://doi.org/10.1007/s00500-014-1291-z |q text/html |z Onlinezugriff via DOI |
| 898 | |a BK010053 |b XK010053 |c XK010000 | ||
| 900 | 7 | |a Metadata rights reserved |b Springer special CC-BY-NC licence |2 nationallicence | |
| 908 | |D 1 |a research-article |2 jats | ||
| 949 | |B NATIONALLICENCE |F NATIONALLICENCE |b NL-springer | ||
| 950 | |B NATIONALLICENCE |P 856 |E 40 |u https://doi.org/10.1007/s00500-014-1291-z |q text/html |z Onlinezugriff via DOI | ||
| 950 | |B NATIONALLICENCE |P 700 |E 1- |a Abdi |D Lida |u CSE and IT Department, Shiraz University, Engineering Campus Number 2, Mollasadra Ave., Shiraz, Iran |4 aut | ||
| 950 | |B NATIONALLICENCE |P 700 |E 1- |a Hashemi |D Sattar |u CSE and IT Department, Shiraz University, Engineering Campus Number 2, Mollasadra Ave., Shiraz, Iran |4 aut | ||
| 950 | |B NATIONALLICENCE |P 773 |E 0- |t Soft Computing |d Springer Berlin Heidelberg |g 19/12(2015-12-01), 3369-3385 |x 1432-7643 |q 19:12<3369 |1 2015 |2 19 |o 500 | ||