Nonnegative cross-media recoding of visual-auditory content for social media analysis

Verfasser / Beitragende:
[Hong Zhang, Xin Xu]
Ort, Verlag, Jahr:
2015
Enthalten in:
Multimedia Tools and Applications, 74/2(2015-01-01), 577-593
Format:
Artikel (online)
ID: 605446806
LEADER caa a22 4500
001 605446806
003 CHVBK
005 20210128100129.0
007 cr unu---uuuuu
008 210128e20150101xx s 000 0 eng
024 7 0 |a 10.1007/s11042-014-1970-x  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s11042-014-1970-x 
245 0 0 |a Nonnegative cross-media recoding of visual-auditory content for social media analysis  |h [Elektronische Daten]  |c [Hong Zhang, Xin Xu] 
520 3 |a Cross-media semantics understanding, which focuses on multimedia data of different modalities, is a rising hot topic in social media analysis. One of the most challenging issues for cross-media semantics understanding is how to represent multimedia data of different modalities. Most traditional multimedia semantics analysis works are based on single modality data sources, such as Flickr images or YouTube videos, leaving efficient cross-media data representation wide open. In this paper, we propose a novel nonnegative cross-media recoding approach, which learns co-occurrences of cross-media feature spaces by explicitly learning a common subset of basis vectors. Moreover, we impose the nonnegativity constraint on the decomposed matrices so that the basis vectors represent important and locally meaningful features of the cross-media data. We take two kinds of typical multimedia data, that is, image and audio, as experimental data. Our approach can be applied to a wide range of multimedia applications. The experiments are conducted on image-audio dataset for applications of cross-media retrieval and data clustering. Experiment results are encouraging and show that the performance of our approach is effective. 
540 |a Springer Science+Business Media New York, 2014 
690 7 |a Cross-media  |2 nationallicence 
690 7 |a Subspace learning  |2 nationallicence 
690 7 |a Distance metric  |2 nationallicence 
690 7 |a Data clustering  |2 nationallicence 
700 1 |a Zhang  |D Hong  |u College of Computer Science & Technology, Wuhan University of Science & Technology, 430081, Wuhan, China  |4 aut 
700 1 |a Xu  |D Xin  |u College of Computer Science & Technology, Wuhan University of Science & Technology, 430081, Wuhan, China  |4 aut 
773 0 |t Multimedia Tools and Applications  |d Springer US; http://www.springer-ny.com  |g 74/2(2015-01-01), 577-593  |x 1380-7501  |q 74:2<577  |1 2015  |2 74  |o 11042 
856 4 0 |u https://doi.org/10.1007/s11042-014-1970-x  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s11042-014-1970-x  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Zhang  |D Hong  |u College of Computer Science & Technology, Wuhan University of Science & Technology, 430081, Wuhan, China  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Xu  |D Xin  |u College of Computer Science & Technology, Wuhan University of Science & Technology, 430081, Wuhan, China  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Multimedia Tools and Applications  |d Springer US; http://www.springer-ny.com  |g 74/2(2015-01-01), 577-593  |x 1380-7501  |q 74:2<577  |1 2015  |2 74  |o 11042