Data-based prediction of sentiments using heterogeneous model ensembles

Verfasser / Beitragende:
[Stephan Winkler, Susanne Schaller, Viktoria Dorfer, Michael Affenzeller, Gerald Petz, Michał Karpowicz]
Ort, Verlag, Jahr:
2015
Enthalten in:
Soft Computing, 19/12(2015-12-01), 3401-3412
Format:
Artikel (online)
ID: 605469253
LEADER caa a22 4500
001 605469253
003 CHVBK
005 20210128100321.0
007 cr unu---uuuuu
008 210128e20151201xx s 000 0 eng
024 7 0 |a 10.1007/s00500-014-1325-6  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s00500-014-1325-6 
245 0 0 |a Data-based prediction of sentiments using heterogeneous model ensembles  |h [Elektronische Daten]  |c [Stephan Winkler, Susanne Schaller, Viktoria Dorfer, Michael Affenzeller, Gerald Petz, Michał Karpowicz] 
520 3 |a In this paper, we present an ensemble modeling approach for sentiment analysis using machine learning algorithms. The main goal of sentiment analysis is to develop estimators that are able to identify the sentiment orientation (positive, negative, or neutral) of sentences found in any arbitrary source. The novel approach presented here relies on the analysis of the words found in sentences and the formation of large sets of heterogeneous models, i.e., binary as well as multi-class classification models that are calculated by various different machine learning methods; these models shall represent the relationship between the presence of given words (or combination of words) and sentiments. All models trained during the learning phase are applied during the test phase and the final sentiment assessment is annotated with a confidence value that specifies, how reliable the models are regarding the presented decision. In the empirical part of this paper, we show results achieved using a German corpus of Amazon recensions and a set of machine learning methods (decision trees and adaptive boosting, Gaussian processes, random forests, k-nearest neighbor classification, support vector machines and artificial neural networks with evolutionary feature and parameter optimization, and genetic programming). Using a heterogeneous model ensemble learning approach that combines multi-class classifiers as well as binary classifiers, the classification accuracy can be increased significantly and the ratio of totally wrongly classified samples (i.e., those that are assigned to the completely opposite sentiment orientation) can be decreased significantly. 
540 |a Springer-Verlag Berlin Heidelberg, 2014 
690 7 |a Sentiment analysis  |2 nationallicence 
690 7 |a Machine learning  |2 nationallicence 
690 7 |a Heterogeneous model ensembles  |2 nationallicence 
690 7 |a Evolutionary computation  |2 nationallicence 
700 1 |a Winkler  |D Stephan  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
700 1 |a Schaller  |D Susanne  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
700 1 |a Dorfer  |D Viktoria  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
700 1 |a Affenzeller  |D Michael  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
700 1 |a Petz  |D Gerald  |u Department of Marketing and Electronic Business, University of Applied Sciences Upper Austria, Steyr, Austria  |4 aut 
700 1 |a Karpowicz  |D Michał  |u Department of Marketing and Electronic Business, University of Applied Sciences Upper Austria, Steyr, Austria  |4 aut 
773 0 |t Soft Computing  |d Springer Berlin Heidelberg  |g 19/12(2015-12-01), 3401-3412  |x 1432-7643  |q 19:12<3401  |1 2015  |2 19  |o 500 
856 4 0 |u https://doi.org/10.1007/s00500-014-1325-6  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s00500-014-1325-6  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Winkler  |D Stephan  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Schaller  |D Susanne  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Dorfer  |D Viktoria  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Affenzeller  |D Michael  |u Heuristic and Evolutionary Algorithms Laboratory, Bioinformatics Research Group, University of Applied Sciences Upper Austria, Hagenberg, Austria  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Petz  |D Gerald  |u Department of Marketing and Electronic Business, University of Applied Sciences Upper Austria, Steyr, Austria  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Karpowicz  |D Michał  |u Department of Marketing and Electronic Business, University of Applied Sciences Upper Austria, Steyr, Austria  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Soft Computing  |d Springer Berlin Heidelberg  |g 19/12(2015-12-01), 3401-3412  |x 1432-7643  |q 19:12<3401  |1 2015  |2 19  |o 500