• Skip to main content
  • Skip to primary sidebar
AAAI

AAAI

Association for the Advancement of Artificial Intelligence

    • AAAI

      AAAI

      Association for the Advancement of Artificial Intelligence

  • About AAAIAbout AAAI
    • News
    • AAAI Officers and Committees
    • AAAI Staff
    • Bylaws of AAAI
    • AAAI Awards
      • Fellows Program
      • Classic Paper Award
      • Dissertation Award
      • Distinguished Service Award
      • Allen Newell Award
      • Outstanding Paper Award
      • Award for Artificial Intelligence for the Benefit of Humanity
      • Feigenbaum Prize
      • Patrick Henry Winston Outstanding Educator Award
      • Engelmore Award
      • AAAI ISEF Awards
      • Senior Member Status
      • Conference Awards
    • AAAI Resources
    • AAAI Mailing Lists
    • Past AAAI Presidential Addresses
    • Presidential Panel on Long-Term AI Futures
    • Past AAAI Policy Reports
      • A Report to ARPA on Twenty-First Century Intelligent Systems
      • The Role of Intelligent Systems in the National Information Infrastructure
    • AAAI Logos
  • aaai-icon_ethics-diversity-line-yellowEthics & Diversity
  • Conference talk bubbleConferences & Symposia
    • AAAI Conference
    • AIES AAAI/ACM
    • AIIDE
    • IAAI
    • ICWSM
    • HCOMP
    • Spring Symposia
    • Summer Symposia
    • Fall Symposia
    • Code of Conduct for Conferences and Events
  • PublicationsPublications
    • AAAI Press
    • AI Magazine
    • Conference Proceedings
    • AAAI Publication Policies & Guidelines
    • Request to Reproduce Copyrighted Materials
  • aaai-icon_ai-magazine-line-yellowAI Magazine
    • Issues and Articles
    • Author Guidelines
    • Editorial Focus
  • MembershipMembership
    • Member Login
    • Developing Country List
    • AAAI Chapter Program

  • Career CenterCareer Center
  • aaai-icon_ai-topics-line-yellowAITopics
  • aaai-icon_contact-line-yellowContact

  • Twitter
  • Facebook
  • LinkedIn
Home / Proceedings / Proceedings of the AAAI Conference on Artificial Intelligence, 35 /

No. 4: AAAI-21 Technical Tracks 4

AAAI Technical Track on Computer Vision III

  • Context-Guided Adaptive Network for Efficient Human Pose Estimation

    Lei Zhao, Jun Wen, Pengfei Wang, Nenggan Zheng

    3492-3499

    PDF
  • ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation

    Sicheng Zhao, Yezhen Wang, Bo Li, Bichen Wu, Yang Gao, Pengfei Xu, Trevor Darrell, Kurt Keutzer

    3500-3509

    PDF
  • Robust Lightweight Facial Expression Recognition Network with Label Distribution Training

    Zengqun Zhao, Qingshan Liu, Feng Zhou

    3510-3519

    PDF
  • Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification

    Zhiwei Zhao, Bin Liu, Qi Chu, Yan Lu, Nenghai Yu

    3520-3528

    PDF
  • Robust Multi-Modality Person Re-identification

    Aihua Zheng, Zi Wang, Zihan Chen, Chenglong Li, Jin Tang

    3529-3537

    PDF
  • Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification

    Kecheng Zheng, Cuiling Lan, Wenjun Zeng, Zhizheng Zhang, Zheng-Jun Zha

    3538-3546

    PDF
  • RESA: Recurrent Feature-Shift Aggregator for Lane Detection

    Tu Zheng, Hao Fang, Yi Zhang, Wenjian Tang, Zheng Yang, Haifeng Liu, Deng Cai

    3547-3554

    PDF
  • CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point Cloud

    Wu Zheng, Weiliang Tang, Sijin Chen, Li Jiang, Chi-Wing Fu

    3555-3562

    PDF
  • Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition

    Benjia Zhou, Yunan Li, Jun Wan

    3563-3571

    PDF
  • Deep Semantic Dictionary Learning for Multi-label Image Classification

    Fengtao Zhou, Sheng Huang, Yun Xing

    3572-3580

    PDF
  • Model Uncertainty Guides Visual Object Tracking

    Lijun Zhou, Antoine Ledent, Qintao Hu, Ting Liu, Jianlin Zhang, Marius Kloft

    3581-3589

    PDF
  • Optimizing Information Theory Based Bitwise Bottlenecks for Efficient Mixed-Precision Activation Quantization

    Xichuan Zhou, Kui Liu, Cong Shi, Haijun Liu, Ji Liu

    3590-3598

    PDF
  • Inferring Camouflaged Objects by Texture-Aware Interactive Guidance Network

    Jinchao Zhu, Xiaoyu Zhang, Shuo Zhang, Junnan Liu

    3599-3607

    PDF
  • Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps

    Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu

    3608-3615

    PDF
  • Fooling Thermal Infrared Pedestrian Detectors in Real World Using Small Bulbs

    Xiaopei Zhu, Xiao Li, Jianmin Li, Zheyao Wang, Xiaolin Hu

    3616-3624

    PDF
  • ASHF-Net: Adaptive Sampling and Hierarchical Folding Network for Robust Point Cloud Completion

    Daoming Zong, Shiliang Sun, Jing Zhao

    3625-3632

    PDF
  • Visual Tracking via Hierarchical Deep Reinforcement Learning

    Dawei Zhang, Zhonglong Zheng, Riheng Jia, Minglu Li

    3315-3323

    PDF
  • One for More: Selecting Generalizable Samples for Generalizable ReID Model

    Enwei Zhang, Xinyang Jiang, Hao Cheng, Ancong Wu, Fufu Yu, Ke Li, Xiaowei Guo, Feng Zheng, Weishi Zheng, Xing Sun

    3324-3332

    PDF
  • Ada-Segment: Automated Multi-loss Adaptation for Panoptic Segmentation

    Gengwei Zhang, Yiming Gao, Hang Xu, Hao Zhang, Zhenguo Li, Xiaodan Liang

    3333-3341

    PDF
  • SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation

    Jiabin Zhang, Zheng Zhu, Jiwen Lu, Junjie Huang, Guan Huang, Jie Zhou

    3342-3350

    PDF
  • Enhancing Audio-Visual Association with Self-Supervised Curriculum Learning

    Jingran Zhang, Xing Xu, Fumin Shen, Huimin Lu, Xin Liu, Heng Tao Shen

    3351-3359

    PDF
  • Unsupervised Domain Adaptation for Person Re-identification via Heterogeneous Graph Alignment

    Minying Zhang, Kai Liu, Yidong Li, Shihui Guo, Hongtao Duan, Yimin Long, Yi Jin

    3360-3368

    PDF
  • Proactive Privacy-preserving Learning for Retrieval

    Peng-Fei Zhang, Zi Huang, Xin-Shun Xu

    3369-3376

    PDF
  • A Novel Visual Interpretability for Deep Neural Networks by Optimizing Activation Maps with Perturbation

    Qinglong Zhang, Lu Rao, Yubin Yang

    3377-3384

    PDF
  • Point Cloud Semantic Scene Completion from RGB-D Images

    Shoulong Zhang, Shuai Li, Aimin Hao, Hong Qin

    3385-3393

    PDF
  • Consensus Graph Representation Learning for Better Grounded Image Captioning

    Wenqiao Zhang, Haochen Shi, Siliang Tang, Jun Xiao, Qiang Yu, Yueting Zhuang

    3394-3402

    PDF
  • BoW Pooling: A Plug-and-Play Unit for Feature Aggregation of Point Clouds

    Xiang Zhang, Xiao Sun, Zhouhui Lian

    3403-3411

    PDF
  • Diverse Knowledge Distillation for End-to-End Person Search

    Xinyu Zhang, Xinlong Wang, Jia-Wang Bian, Chunhua Shen, Mingyu You

    3412-3420

    PDF
  • Weakly Supervised Semantic Segmentation for Large-Scale Point Cloud

    Yachao Zhang, Zonghao Li, Yuan Xie, Yanyun Qu, Cuihua Li, Tao Mei

    3421-3429

    PDF
  • PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection

    Yanan Zhang, Di Huang, Yunhong Wang

    3430-3437

    PDF
  • Efficient License Plate Recognition via Holistic Position Attention

    Yesheng Zhang, Zilei Wang, Jiafan Zhuang

    3438-3446

    PDF
  • Bag of Tricks for Long-Tailed Visual Recognition with Deep Convolutional Neural Networks

    Yongshun Zhang, Xiu-Shen Wei, Boyan Zhou, Jianxin Wu

    3447-3455

    PDF
  • Depth Privileged Object Detection in Indoor Scenes via Deformation Hallucination

    Zhijie Zhang, Yan Liu, Junjie Chen, Li Niu, Liqing Zhang

    3456-3464

    PDF
  • Learning Flexibly Distributional Representation for Low-quality 3D Face Recognition

    Zihui Zhang, Cuican Yu, Shuang Xu, Huibin Li

    3465-3473

    PDF
  • IA-GM: A Deep Bidirectional Learning Method for Graph Matching

    Kaixuan Zhao, Shikui Tu, Lei Xu

    3474-3482

    PDF
  • Distribution Adaptive INT8 Quantization for Training CNNs

    Kang Zhao, Sida Huang, Pan Pan, Yinghan Li, Yingya Zhang, Zhenyu Gu, Yinghui Xu

    3483-3491

    PDF
  • Object Relation Attention for Image Paragraph Captioning

    Li-Chuan Yang, Chih-Yuan Yang, Jane Yung-jen Hsu

    3136-3144

    PDF
  • Adversarial Robustness through Disentangled Representations

    Shuo Yang, Tianyu Guo, Yunhe Wang, Chang Xu

    3145-3153

    PDF
  • CPCGAN: A Controllable 3D Point Cloud Generative Adversarial Network with Semantic Label Generating

    Ximing Yang, Yuan Wu, Kaiyi Zhang, Cheng Jin

    3154-3162

    PDF
  • R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

    Xue Yang, Junchi Yan, Ziming Feng, Tao He

    3163-3171

    PDF
  • One-shot Face Reenactment Using Appearance Adaptive Normalization

    Guangming Yao, Yi Yuan, Tianjia Shao, Shuang Li, Shanqi Liu, Yong Liu, Mengmeng Wang, Kun Zhou

    3172-3180

    PDF
  • A Case Study of the Shortcut Effects in Visual Commonsense Reasoning

    Keren Ye, Adriana Kovashka

    3181-3189

    PDF
  • Instance Mining with Class Feature Banks for Weakly Supervised Object Detection

    Yufei Yin, Jiajun Deng, Wengang Zhou, Houqiang Li

    3190-3198

    PDF
  • Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition

    Bruce X.B. Yu, Yan Liu, Keith C.C. Chan

    3199-3207

    PDF
  • ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs

    Fei Yu, Jiji Tang, Weichong Yin, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

    3208-3216

    PDF
  • High-Resolution Deep Image Matting

    Haichao Yu, Ning Xu, Zilong Huang, Yuqian Zhou, Humphrey Shi

    3217-3224

    PDF
  • CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks

    Qihang Yu, Yingwei Li, Jieru Mei, Yuyin Zhou, Alan Yuille

    3225-3233

    PDF
  • Structure-Consistent Weakly Supervised Salient Object Detection with Local Saliency Coherence

    Siyue Yu, Bingfeng Zhang, Jimin Xiao, Eng Gee Lim

    3234-3242

    PDF
  • Fast and Compact Bilinear Pooling by Shifted Random Maclaurin

    Tan Yu, Xiaoyun Li, Ping Li

    3243-3251

    PDF
  • Simple and Effective Stochastic Neural Networks

    Tianyuan Yu, Yongxin Yang, Da Li, Timothy Hospedales, Tao Xiang

    3252-3260

    PDF
  • Learning Visual Context for Group Activity Recognition

    Hangjie Yuan, Dong Ni

    3261-3269

    PDF
  • StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

    Jinshan Zeng, Qi Chen, Yunxin Liu, Mingwen Wang, Yuan Yao

    3270-3277

    PDF
  • Demodalizing Face Recognition with Synthetic Samples

    Zhonghua Zhai, Pengju Yang, Xiaofeng Zhang, Maji Huang, Haijing Cheng, Xuejun Yan, Chunmao Wang, Shiliang Pu

    3278-3286

    PDF
  • EMLight: Lighting Estimation via Spherical Distribution Approximation

    Fangneng Zhan, Changgong Zhang, Yingchen Yu, Yuan Chang, Shijian Lu, Feiying Ma, Xuansong Xie

    3287-3295

    PDF
  • Universal Adversarial Perturbations Through the Lens of Deep Steganography: Towards a Fourier Perspective

    Chaoning Zhang, Philipp Benz, Adil Karjauv, In So Kweon

    3296-3304

    PDF
  • SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

    Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Fei Wu, Futai Zou

    3305-3314

    PDF
  • Non-Autoregressive Coarse-to-Fine Video Captioning

    Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang

    3119-3127

    PDF
  • Learning to Attack Real-World Models for Person Re-identification via Virtual-Guided Meta-Learning

    Fengxiang Yang, Zhun Zhong, Hong Liu, Zheng Wang, Zhiming Luo, Shaozi Li, Nicu Sebe, Shin'ichi Satoh

    3128-3135

    PDF
  • Binaural Audio-Visual Localization

    Xinyi Wu, Zhenyao Wu, Lili Ju, Song Wang

    2961-2968

    PDF
  • Beating Attackers At Their Own Games: Adversarial Example Detection Using Adversarial Gradient Directions

    Yuhang Wu, Sunpreet S Arora, Yanhong Wu, Hao Yang

    2969-2977

    PDF
  • Shape-Pose Ambiguity in Learning 3D Reconstruction from Images

    Yunjie Wu, Zhengxing Sun, Youcheng Song, Yunhan Sun, YiJie Zhong, Jinlong Shi

    2978-2985

    PDF
  • Boundary Proposal Network for Two-stage Natural Language Video Localization

    Shaoning Xiao, Long Chen, Songyang Zhang, Wei Ji, Jian Shao, Lu Ye, Jun Xiao

    2986-2994

    PDF
  • Amodal Segmentation Based on Visible Region Segmentation and Shape Prior

    Yuting Xiao, Yanyu Xu, Ziming Zhong, Weixin Luo, Jiawei Li, Shenghua Gao

    2995-3003

    PDF
  • Locate Globally, Segment Locally: A Progressive Architecture With Knowledge Review Network for Salient Object Detection

    Binwei Xu, Haoran Liang, Ronghua Liang, Peng Chen

    3004-3012

    PDF
  • Invariant Teacher and Equivariant Student for Unsupervised 3D Human Pose Estimation

    Chenxin Xu, Siheng Chen, Maosen Li, Ya Zhang

    3013-3021

    PDF
  • Imagine, Reason and Write: Visual Storytelling with Graph Knowledge and Relational Reasoning

    Chunpu Xu, Min Yang, Chengming Li, Ying Shen, Xiang Ao, Ruifeng Xu

    3022-3029

    PDF
  • Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation

    Hongbin Xu, Zhipeng Zhou, Yu Qiao, Wenxiong Kang, Qiuxia Wu

    3030-3038

    PDF
  • Efficient Deep Image Denoising via Class Specific Convolution

    Lu Xu, Jiawei Zhang, Xuanye Cheng, Feng Zhang, Xing Wei, Jimmy Ren

    3039-3046

    PDF
  • Investigate Indistinguishable Points in Semantic Segmentation of 3D Point Cloud

    Mingye Xu, Zhipeng Zhou, Junhao Zhang, Yu Qiao

    3047-3055

    PDF
  • Learning Geometry-Disentangled Representation for Complementary Understanding of 3D Object Point Cloud

    Mutian Xu, Junhao Zhang, Zhipeng Zhou, Mingye Xu, Xiaojuan Qi, Yu Qiao

    3056-3064

    PDF
  • Searching for Alignment in Face Recognition

    Xiaqing Xu, Qiang Meng, Yunxiao Qin, Jianzhu Guo, Chenxu Zhao, Feng Zhou, Zhen Lei

    3065-3073

    PDF
  • GIF Thumbnails: Attract More Clicks to Your Videos

    Yi Xu, Fan Bai, Yingxuan Shi, Qiuyu Chen, Longwen Gao, Kai Tian, Shuigeng Zhou, Huyang Sun

    3074-3082

    PDF
  • FaceController: Controllable Attribute Editing for Face in the Wild

    Zhiliang Xu, Xiyu Yu, Zhibin Hong, Zhen Zhu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai

    3083-3091

    PDF
  • AnchorFace: An Anchor-based Facial Landmark Detector Across Large Poses

    Zixuan Xu, Banghuai Li, Ye Yuan, Miao Geng

    3092-3100

    PDF
  • Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion

    Xu Yan, Jiantao Gao, Jie Li, Ruimao Zhang, Zhen Li, Rui Huang, Shuguang Cui

    3101-3109

    PDF
  • Learning Semantic Context from Normal Samples for Unsupervised Anomaly Detection

    Xudong Yan, Huaidong Zhang, Xuemiao Xu, Xiaowei Hu, Pheng-Ann Heng

    3110-3118

    PDF
  • PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

    Pengfei Wang, Chengquan Zhang, Fei Qi, Shanshan Liu, Xiaoqiang Zhang, Pengyuan Lyu, Junyu Han, Jingtuo Liu, Errui Ding, Guangming Shi

    2782-2790

    PDF
  • Dynamic Position-aware Network for Fine-grained Image Recognition

    Shijie Wang, Haojie Li, Zhihui Wang, Wanli Ouyang

    2791-2799

    PDF
  • Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection

    Tiancai Wang, Tong Yang, Jiale Cao, Xiangyu Zhang

    2800-2808

    PDF
  • Very Important Person Localization in Unconstrained Conditions: A New Benchmark

    Xiao Wang, Zheng Wang, Toshihiko Yamasaki, Wenjun Zeng

    2809-2816

    PDF
  • Teacher Guided Neural Architecture Search for Face Recognition

    Xiaobo Wang

    2817-2825

    PDF
  • Deep Multi-Task Learning for Diabetic Retinopathy Grading in Fundus Images

    Xiaofei Wang, Mai Xu, Jicong Zhang, Lai Jiang, Liu Li

    2826-2834

    PDF
  • Confidence-aware Non-repetitive Multimodal Transformers for TextCaps

    Zhaokai Wang, Renda Bao, Qi Wu, Si Liu

    2835-2843

    PDF
  • Geodesic-HOF: 3D Reconstruction Without Cutting Corners

    Ziyun Wang, Eric A. Mitchell, Volkan Isler, Daniel D. Lee

    2844-2851

    PDF
  • C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer

    Dongxu Wei, Xiaowei Xu, Haibin Shen, Kejie Huang

    2852-2860

    PDF
  • Semantic Consistency Networks for 3D Object Detection

    Wenwen Wei, Ping Wei, Nanning Zheng

    2861-2869

    PDF
  • Holistic Multi-View Building Analysis in the Wild with Projection Pooling

    Zbigniew Wojna, Krzysztof Maziarz, Łukasz Jocz, Robert Pałuba, Robert Kozikowski, Iason Kokkinos

    2870-2878

    PDF
  • Stereopagnosia: Fooling Stereo Networks with Adversarial Perturbations

    Alex Wong, Mukund Mundhra, Stefano Soatto

    2879-2888

    PDF
  • Generalising without Forgetting for Lifelong Person Re-Identification

    Guile Wu, Shaogang Gong

    2889-2897

    PDF
  • Decentralised Learning from Independent Multi-Domain Labels for Person Re-Identification

    Guile Wu, Shaogang Gong

    2898-2906

    PDF
  • Region-aware Global Context Modeling for Automatic Nerve Segmentation from Ultrasound Images

    Huisi Wu, Jiasheng Liu, Wei Wang, Zhenkun Wen, Jing Qin

    2907-2915

    PDF
  • Precise Yet Efficient Semantic Calibration and Refinement in ConvNets for Real-time Polyp Segmentation from Colonoscopy Videos

    Huisi Wu, Jiafu Zhong, Wei Wang, Zhenkun Wen, Jing Qin

    2916-2924

    PDF
  • Graph-to-Graph: Towards Accurate and Interpretable Online Handwritten Mathematical Expression Recognition

    Jin-Wen Wu, Fei Yin, Yan-Ming Zhang, Xu-Yao Zhang, Cheng-Lin Liu

    2925-2933

    PDF
  • Learning Comprehensive Motion Representation for Action Recognition

    Mingyu Wu, Boyuan Jiang, Donghao Luo, Junchi Yan, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xiaokang Yang

    2934-2942

    PDF
  • MVFNet: Multi-View Fusion Network for Efficient Video Recognition

    Wenhao Wu, Dongliang He, Tianwei Lin, Fu Li, Chuang Gan, Errui Ding

    2943-2951

    PDF
  • Anticipating Future Relations via Graph Growing for Action Prediction

    Xinxiao Wu, Jianwei Zhao, Ruiqi Wang

    2952-2960

    PDF
  • Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps Grounding

    Dexin Wang, Deyi Xiong

    2720-2728

    PDF
  • Temporal Relational Modeling with Self-Supervision for Action Segmentation

    Dong Wang, Di Hu, Xingjian Li, Dejing Dou

    2729-2737

    PDF
  • Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

    Jiapeng Wang, Chongyu Liu, Lianwen Jin, Guozhi Tang, Jiaxin Zhang, Shuaitao Zhang, Qianying Wang, Yaqiang Wu, Mingxiang Cai

    2738-2745

    PDF
  • Self-Domain Adaptation for Face Anti-Spoofing

    Jingjing Wang, Jingyi Zhang, Ying Bian, Youyi Cai, Chunmao Wang, Shiliang Pu

    2746-2754

    PDF
  • Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval

    Jinpeng Wang, Bin Chen, Qiang Zhang, Zaiqiao Meng, Shangsong Liang, Shutao Xia

    2755-2763

    PDF
  • Camera-Aware Proxies for Unsupervised Person Re-Identification

    Menglin Wang, Baisheng Lai, Jianqiang Huang, Xiaojin Gong, Xian-Sheng Hua

    2764-2772

    PDF
  • Unsupervised 3D Learning for Shape Analysis via Multiresolution Instance Discrimination

    Peng-Shuai Wang, Yu-Qi Yang, Qian-Fang Zou, Zhirong Wu, Yang Liu, Xin Tong

    2773-2781

    PDF

Primary Sidebar