An iteratively reweighting algorithm for dynamic video summarization

Verfasser / Beitragende:
[Pei Dong, Yong Xia, Shanshan Wang, Li Zhuo, David Feng]
Ort, Verlag, Jahr:
2015
Enthalten in:
Multimedia Tools and Applications, 74/21(2015-11-01), 9449-9473
Format:
Artikel (online)
ID: 605447063
LEADER caa a22 4500
001 605447063
003 CHVBK
005 20210128100130.0
007 cr unu---uuuuu
008 210128e20151101xx s 000 0 eng
024 7 0 |a 10.1007/s11042-014-2126-8  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s11042-014-2126-8 
245 0 3 |a An iteratively reweighting algorithm for dynamic video summarization  |h [Elektronische Daten]  |c [Pei Dong, Yong Xia, Shanshan Wang, Li Zhuo, David Feng] 
520 3 |a Information explosion has imposed unprecedented challenges on the conventional ways of video data consumption. Hence providing condensed and meaningful video summary to viewers has been recognized as a beneficial and attractive research in the multimedia community in recent years. Analyzing both the visual and textual modalities proves essential for an automatic video summarizer to pick up important contents from a video. However, most established studies in this direction either use heuristic rules or rely on simple ways of text analysis. This paper proposes an iteratively reweighting dynamic video summarization (IRDVS) algorithm based on the joint and adaptive use of the visual modality and accompanying subtitles. The proposed algorithm takes advantage of our developed SEmantic inDicator of videO seGment (SEDOG) feature for exploring the most representative concepts for describing the video. Meanwhile, the iteratively reweighting scheme effectively updates the dynamic surrogate of the original video by combining the high-level features in an adaptive manner. The proposed algorithm has been compared to four state-of-the-art video summarization approaches, namely the speech transcript-based (STVS) algorithm, attention model-based (AMVS) algorithm, sparse dictionary selection-based (DSVS) algorithm and heterogeneity image patch index-based (HIPVS) algorithm, on different video genres, including documentary, movie and TV news. Our results show that the proposed IRDVS algorithm can produce summarized videos with better quality. 
540 |a Springer Science+Business Media New York, 2014 
690 7 |a Video summarization  |2 nationallicence 
690 7 |a Semantic indicator of video segment (SEDOG)  |2 nationallicence 
690 7 |a Iterative weight estimation  |2 nationallicence 
690 7 |a Multimodal features  |2 nationallicence 
690 7 |a Saliency ranking  |2 nationallicence 
700 1 |a Dong  |D Pei  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
700 1 |a Xia  |D Yong  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
700 1 |a Wang  |D Shanshan  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
700 1 |a Zhuo  |D Li  |u Signal and Information Processing Laboratory, Beijing University of Technology, 100124, Beijing, China  |4 aut 
700 1 |a Feng  |D David  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
773 0 |t Multimedia Tools and Applications  |d Springer US; http://www.springer-ny.com  |g 74/21(2015-11-01), 9449-9473  |x 1380-7501  |q 74:21<9449  |1 2015  |2 74  |o 11042 
856 4 0 |u https://doi.org/10.1007/s11042-014-2126-8  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s11042-014-2126-8  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Dong  |D Pei  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Xia  |D Yong  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Wang  |D Shanshan  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Zhuo  |D Li  |u Signal and Information Processing Laboratory, Beijing University of Technology, 100124, Beijing, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Feng  |D David  |u Biomedical and Multimedia Information Technology (BMIT) Research Group, School of Information Technologies, The University of Sydney, NSW 2006, Sydney, Australia  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Multimedia Tools and Applications  |d Springer US; http://www.springer-ny.com  |g 74/21(2015-11-01), 9449-9473  |x 1380-7501  |q 74:21<9449  |1 2015  |2 74  |o 11042