Int J Performability Eng ›› 2018, Vol. 14 ›› Issue (9): 2219-2229.doi: 10.23940/ijpe.18.09.p31.22192229

Previous Articles     Next Articles

Cross-Media Retrieval based on Pseudo-Label Learning and Semantic Consistency Algorithm

Gongwen Xua, *, Zhiqi Sangb, and Zhijun Zhangc   

  1. aSchool of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China;
    bCollege of Architecture and Urban Planning, Shandong Jianzhu University, Jinan, 250101, China;
    cSchool of Computer Science and Technology, Shandong Jianzhu University, Jinan, 250101, China
  • Revised on ; Accepted on
  • Contact: * E-mail address: xugongwen@163.com

Abstract: To retrieve heterogeneous multimodal data with the same semantics, many algorithms for retrieval over multimodal data have been suggested. The organization and analysis of heterogeneous data have become the focus of intensive research. Here, a new and efficient algorithm for cross-media retrieval is proposed based on pseudo-label learning and semantic consistency (PLSC). In this algorithm, an adaptive learning projection matrix optimization method is proposed, and in the process of learning the projection matrices, the method fully considers the semantic information of the labeled and unlabeled samples. Thus, the PLSC algorithm can utilize more useful information than other methods and can learn the more efficient projection matrices. Firstly, the class centers of labeled text are computed. We use median feature vectors as the class center vectors. Next, unlabeled images are projected onto the text space and are assigned pseudo-labels by comparing with the class center vectors of the text data. Finally, a new training dataset, which includes labeled and unlabeled data, is generated for training the projection matrix. Using the projection matrix to project image or text data onto the same feature space, the data can be compared with each other for similarity, and the distance between data points can be calculated using the Euclidean metric. Validation experiments suggest that the PLSC outperforms other state-of-the-art algorithms.

Key words: cross-media retrieval, pseudo-label, semi-supervised, semantic analysis