Using causal discovery for feature selection in multivariate numerical time series

Verfasser / Beitragende:
[Youqiang Sun, Jiuyong Li, Jixue Liu, Christopher Chow, Bingyu Sun, Rujing Wang]
Ort, Verlag, Jahr:
2015
Enthalten in:
Machine Learning, 101/1-3(2015-10-01), 377-395
Format:
Artikel (online)
ID: 605477795
LEADER caa a22 4500
001 605477795
003 CHVBK
005 20210128100402.0
007 cr unu---uuuuu
008 210128e20151001xx s 000 0 eng
024 7 0 |a 10.1007/s10994-014-5460-1  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s10994-014-5460-1 
245 0 0 |a Using causal discovery for feature selection in multivariate numerical time series  |h [Elektronische Daten]  |c [Youqiang Sun, Jiuyong Li, Jixue Liu, Christopher Chow, Bingyu Sun, Rujing Wang] 
520 3 |a Time series data contains temporal ordering, which makes its feature selection different from the normal feature selection. Feature selection in multivariate time series has two tasks: identifying the relevant features and finding their effective window sizes of lagged values. The methods extended from normal feature selection methods do not solve this two-dimensional feature selection problem since they do not take lagged observations of features into consideration. In this paper, we present a method using the Granger causality discovery to identify causal features with effective sliding window sizes in multivariate numerical time series. The proposed method considers the influence of lagged observations of features on the target time series. We compare our proposed feature selection method with several normal feature selection methods on multivariate time series data using three well-known modeling methods. Our method outperforms other methods for predicting future values of target time series. In a real world case study on water quality monitoring data, we show that the features selected by our method contain four out of five features used by domain experts, and prediction performance on our features is better than that on features of domain experts using three modeling methods. 
540 |a The Author(s), 2014 
690 7 |a Feature selection  |2 nationallicence 
690 7 |a Multivariate time series  |2 nationallicence 
690 7 |a Causal discovery  |2 nationallicence 
690 7 |a Prediction and regression  |2 nationallicence 
690 7 |a Granger causality  |2 nationallicence 
700 1 |a Sun  |D Youqiang  |u School of Information Science and Technology, University of Science and Technology of China, Hefei, Anhui, China  |4 aut 
700 1 |a Li  |D Jiuyong  |u School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, SA, Australia  |4 aut 
700 1 |a Liu  |D Jixue  |u School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, SA, Australia  |4 aut 
700 1 |a Chow  |D Christopher  |u Australian Water Quality Centre, SA Water, Adelaide, SA, Australia  |4 aut 
700 1 |a Sun  |D Bingyu  |u School of Information Science and Technology, University of Science and Technology of China, Hefei, Anhui, China  |4 aut 
700 1 |a Wang  |D Rujing  |u School of Information Science and Technology, University of Science and Technology of China, Hefei, Anhui, China  |4 aut 
773 0 |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 101/1-3(2015-10-01), 377-395  |x 0885-6125  |q 101:1-3<377  |1 2015  |2 101  |o 10994 
856 4 0 |u https://doi.org/10.1007/s10994-014-5460-1  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s10994-014-5460-1  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Sun  |D Youqiang  |u School of Information Science and Technology, University of Science and Technology of China, Hefei, Anhui, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Li  |D Jiuyong  |u School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, SA, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Liu  |D Jixue  |u School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, SA, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Chow  |D Christopher  |u Australian Water Quality Centre, SA Water, Adelaide, SA, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Sun  |D Bingyu  |u School of Information Science and Technology, University of Science and Technology of China, Hefei, Anhui, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Wang  |D Rujing  |u School of Information Science and Technology, University of Science and Technology of China, Hefei, Anhui, China  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 101/1-3(2015-10-01), 377-395  |x 0885-6125  |q 101:1-3<377  |1 2015  |2 101  |o 10994