Semantic Model Vectors



Description

We propose an intermediate semantic layer between low-level features and high-level concepts in order to bridge the notorious semantic gap. This representation, named Semantic Model Vectors, consists of hundreds of discriminative semantic detectors and is used as a basis for modeling and detecting complex concepts in unconstrained images and/or videos, such as those from a social media feed or YouTube. Each discriminative semantic classifier in the Semantic Model Vectors is trained from thousands of labeled images and/or videos,and organized in a visual taxonomy. Our experiments reveal that the proposed Semantic Model Vectors representation outperforms and is complementary to other low-level visual descriptors such as deep embeddings. We demonstrated the effectiveness of Semantic Model Vectors, both alone and in combination with other low-level descriptors, for multiple hihg-level recognition tasks, including:


publications

Michele Merler, Bert Huang, Lexing Xie, Gang Hua, Apostol Natsev. Semantic model vectors for complex video event recognition. IEEE Transactions on Multimedia (TMM) 2012. PDF BibTeX Project and Data


Michele Merler, Liangliang Cao, John R Smith. You are what you tweet… pic! gender prediction based on semantic analysis of social media images. IEEE International on Conference on Multimedia and Expo (ICME) 2015. PDF BibTeX Slides


Xiaolong Wang, Guodong Guo, Michele Merler, Noel CF Codella, MV Rohith, John R Smith, Chandra Kambhamettu. Leveraging multiple cues for recognizing family photos. Image and Vision Computing(IVC) 2017. PDF BibTeX


Junjie Cai, Michele Merler, Sharath Pankanti, Qi Tian. Heterogeneous semantic level features fusion for action recognition. IEEE International on Conference on Multimedia Retrieval (ICMR) 2015. PDF BibTeX


Noel Codella, Jonathan Connell, Sharath Pankanti, Michele Merler, John R Smith. Automated medical image modality recognition by fusion of visual and text information. International Conference on Medical Image Computing and Computer-Assisted Intervention(MICCAI) 2014. PDF BibTeX CLEF13 Slides