Augmenting bag-of-words: a robust contextual representation of spatiotemporal interest points for action recognition

Verfasser / Beitragende:
[Yang Li, Junyong Ye, Tongqing Wang, Shijian Huang]
Ort, Verlag, Jahr:
2015
Enthalten in:
The Visual Computer, 31/10(2015-10-01), 1383-1394
Format:
Artikel (online)
ID: 605540551
LEADER caa a22 4500
001 605540551
003 CHVBK
005 20210128100912.0
007 cr unu---uuuuu
008 210128e20151001xx s 000 0 eng
024 7 0 |a 10.1007/s00371-014-1020-8  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s00371-014-1020-8 
245 0 0 |a Augmenting bag-of-words: a robust contextual representation of spatiotemporal interest points for action recognition  |h [Elektronische Daten]  |c [Yang Li, Junyong Ye, Tongqing Wang, Shijian Huang] 
520 3 |a Although traditional bag-of-words model, together with local spatiotemporal features, has shown promising results for human action recognition, it ignores all structural information of features, which carries important information of motion structures in videos. Recent methods usually characterize the relationship of quantized spatiotemporal features to overcome this drawback. However, the propagation of quantization error leads to an unreliable representation. To alleviate the propagation of quantization error, we present a coding method, which considers not only the spatial similarity but also the reconstruction ability of visual words after giving a probabilistic interpretation of coding coefficients. Based on our coding method, a new type of feature called cumulative probability histogram is proposed to robustly characterize contextual structural information around interest points, which are extracted from multi-layered contexts and assumed to be complementary to local spatiotemporal features. The proposed method is verified on four benchmark datasets. Experiment results show that our method can achieve better performance than previous methods in action recognition. 
540 |a Springer-Verlag Berlin Heidelberg, 2014 
690 7 |a Action recognition  |2 nationallicence 
690 7 |a Contextual features  |2 nationallicence 
690 7 |a Cumulative probability histogram  |2 nationallicence 
690 7 |a Sparse coding  |2 nationallicence 
700 1 |a Li  |D Yang  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
700 1 |a Ye  |D Junyong  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
700 1 |a Wang  |D Tongqing  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
700 1 |a Huang  |D Shijian  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
773 0 |t The Visual Computer  |d Springer Berlin Heidelberg  |g 31/10(2015-10-01), 1383-1394  |x 0178-2789  |q 31:10<1383  |1 2015  |2 31  |o 371 
856 4 0 |u https://doi.org/10.1007/s00371-014-1020-8  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s00371-014-1020-8  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Li  |D Yang  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Ye  |D Junyong  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Wang  |D Tongqing  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Huang  |D Shijian  |u Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education, Chongqing University, Chongqing, China  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t The Visual Computer  |d Springer Berlin Heidelberg  |g 31/10(2015-10-01), 1383-1394  |x 0178-2789  |q 31:10<1383  |1 2015  |2 31  |o 371