Heterogeneous Transfer Learning with Weighted Instance-Correspondence Data

  • Yuwei He Tsinghua University
  • Xiaoming Jin Tsinghua University
  • Guiguang Ding Tsinghua University
  • Yuchen Guo Tsinghua University
  • Jungong Han University of Warwick
  • Jiyong Zhang Hangzhou Dianzi University
  • Sicheng Zhao Berkeley

Abstract

Instance-correspondence (IC) data are potent resources for heterogeneous transfer learning (HeTL) due to the capability of bridging the source and the target domains at the instance-level. To this end, people tend to use machine-generated IC data, because manually establishing IC data is expensive and primitive. However, existing IC data machine generators are not perfect and always produce the data that are not of high quality, thus hampering the performance of domain adaption. In this paper, instead of improving the IC data generator, which might not be an optimal way, we accept the fact that data quality variation does exist but find a better way to use the data. Specifically, we propose a novel heterogeneous transfer learning method named Transfer Learning with Weighted Correspondence (TLWC), which utilizes IC data to adapt the source domain to the target domain. Rather than treating IC data equally, TLWC can assign solid weights to each IC data pair depending on the quality of the data. We conduct extensive experiments on HeTL datasets and the state-of-the-art results verify the effectiveness of TLWC.

Published
2020-04-03
Section
AAAI Technical Track: Machine Learning