Int J Performability Eng ›› 2021, Vol. 17 ›› Issue (1): 50-59.doi: 10.23940/ijpe.21.01.p5.5059

• Orginal Article • Previous Articles     Next Articles

Multi-Modal Input Mode via Graph Neural Networks for Outfit Compatibility

Huaiguang Wu, Yan Li*, Baohua Jin, Wenjun Shi, and Bin Lu   

  1. School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou, 450001, China
  • Submitted on ; Accepted on
  • Contact: * Corresponding author. E-mail address: liyanjuly@163.com
  • About author:
    Huaiguang Wu received his Ph.D. in computing from Wuhan University in 2011. He is affiliated with the School of Computer and Communication Engineering at Zhengzhou University of Light Industry. He was a postdoctoral fellow at Peking University and a research visitor at the University of Edinburgh from 2017 to 2018. His research interests include formal methods, software engineering, and algorithms. (Email: hgwu@zzuli.edu.cn)
    Yan Li is a postgraduate student studying big data outfit compatibility at Zhengzhou University of Light Industry. Her research interests include artificial intelligence, big data analysis, and machine learning. (Email: liyanjuly@163.com)
    Baohua Jin received his master's degree from Huazhong University of Science and Technology in 2003. Currently, he is a professor at Zhengzhou University of Light Industry. His research interests include artificial intelligence, and machine learning. (Email: jinbh@zzuli.deu.cn)
    Wenjun Shi is affiliated with the School of Computer and Communication Engineering at Zhengzhou University of Light Industry. Her research interests include big data analysis and algorithms. (Email: swjij@sina.com)
    Bin Lu is a postgraduate student studying big data fashion trends at Zhengzhou University of Light Industry. His research interests include artificial intelligence, big data analysis, and machine learning. (Email: binlu.computer@foxmail.com)

Abstract:

Technology changes life. Recently, state-of-the-art of deep learning has inspired the development of deep-learning based approaches for outfit compatibility. Constructive and useful advice on clothing matching is provided to people by outfit compatibility technology. Nevertheless, there are still some shortcomings to be overcome. To be specific, outfit compatibility is a subjective method to evaluate the effect of cloth matching because of economic factors such as price, but price factor has not been taken into consideration in previous works. In addition, the relationships among vision, text, and price features are complex. Therefore, a multi-modal input graph neural network (MI-GNN) model is proposed to solve these shortcomings. This model can not only better capture the complex relations between various items in an outfit, but also model outfit compatibility from multiple modalities. In fill-in-the-blank task and outfit compatibility prediction, the performance of compatibility modeling improved the performance by 0.96% and 0.56%, respectively.

Key words: outfit compatibility, graph neural networks, compatibility modeling