Evaluation of semi-supervised learning method on action recognition

Verfasser / Beitragende:
[Haoquan Shen, Yan Yan, Shicheng Xu, Nicolas Ballas, Wenzhi Chen]
Ort, Verlag, Jahr:
2015
Enthalten in:
Multimedia Tools and Applications, 74/2(2015-01-01), 523-542
Format:
Artikel (online)
ID: 605446849
LEADER caa a22 4500
001 605446849
003 CHVBK
005 20210128100129.0
007 cr unu---uuuuu
008 210128e20150101xx s 000 0 eng
024 7 0 |a 10.1007/s11042-014-1936-z  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s11042-014-1936-z 
245 0 0 |a Evaluation of semi-supervised learning method on action recognition  |h [Elektronische Daten]  |c [Haoquan Shen, Yan Yan, Shicheng Xu, Nicolas Ballas, Wenzhi Chen] 
520 3 |a Action recognition is one of the most difficult problems in computer vision and multimedia areas, since both spatial information and spatiotemporal semantic meaning should be taken into consideration. Moreover, the noisy and weakly annotated information make this task even harder. Nowadays, instead of the traditional features and classifiers, a lot of new attempts have made the task of action recognition promising. Noticing that there is no work on comparison of different combination of pooling and semi-supervised learning method under the same experiment setting, it would be interesting to apply different combination of pooling and semi-supervised learning method on both the synthetic and realistic action recognition datasets to see which combination or method performs better. In summary, we can obtain the following conclusions based on our experiments. Firstly, Second Order Pooling (Carreira et al.2012) is worse than the traditional Bag of Words (Schmid and Mohr1997; Dance et al.2004) regarding to the overall performance in some dataset, but is a good way to speed up the coding stage of video classification with little sacrifice of performance. Secondly, Semi-supervised Hierarchical Regression Algorithm (MLHR) and Manifold Regularized Least Square Regression (MRLS) (Belkin et al. J Mach Learn Res 12:2399-2434,2006) is better than some of the supervised learning methods (χ 2-SVM, SVM-2K (Farquhar et al.2006)) in the real world action recognition problems which shares little available annotated information. Thirdly, for KTH, UCF50 and HMDB dataset, late fusion doesn't necessarily improve the performance. In comparison, MLHR, SVM-2K and Multi-kernel Learning is a more natural way to deal with multi-feature problems. 
540 |a Springer Science+Business Media New York, 2014 
690 7 |a Semi-supervised learning  |2 nationallicence 
690 7 |a Multi-feature fusion  |2 nationallicence 
690 7 |a Second order pooling  |2 nationallicence 
690 7 |a Video concept annotation  |2 nationallicence 
690 7 |a Action recognition  |2 nationallicence 
700 1 |a Shen  |D Haoquan  |u Department of Computer Science, Zhejiang University, Zhejiang, China  |4 aut 
700 1 |a Yan  |D Yan  |u Department of Information Engineering and Computer Science, University of Trento, Trento, Italy  |4 aut 
700 1 |a Xu  |D Shicheng  |u Department of Computer Science, Zhejiang University, Zhejiang, China  |4 aut 
700 1 |a Ballas  |D Nicolas  |u CEA and Mines-ParisTech, Paris, France  |4 aut 
700 1 |a Chen  |D Wenzhi  |u Department of Computer Science, Zhejiang University, Zhejiang, China  |4 aut 
773 0 |t Multimedia Tools and Applications  |d Springer US; http://www.springer-ny.com  |g 74/2(2015-01-01), 523-542  |x 1380-7501  |q 74:2<523  |1 2015  |2 74  |o 11042 
856 4 0 |u https://doi.org/10.1007/s11042-014-1936-z  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s11042-014-1936-z  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Shen  |D Haoquan  |u Department of Computer Science, Zhejiang University, Zhejiang, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Yan  |D Yan  |u Department of Information Engineering and Computer Science, University of Trento, Trento, Italy  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Xu  |D Shicheng  |u Department of Computer Science, Zhejiang University, Zhejiang, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Ballas  |D Nicolas  |u CEA and Mines-ParisTech, Paris, France  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Chen  |D Wenzhi  |u Department of Computer Science, Zhejiang University, Zhejiang, China  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Multimedia Tools and Applications  |d Springer US; http://www.springer-ny.com  |g 74/2(2015-01-01), 523-542  |x 1380-7501  |q 74:2<523  |1 2015  |2 74  |o 11042