• Skip to main content
  • Skip to primary sidebar
AAAI

AAAI

Association for the Advancement of Artificial Intelligence

    • AAAI

      AAAI

      Association for the Advancement of Artificial Intelligence

  • About AAAIAbout AAAI
    • AAAI Officers and Committees
    • AAAI Staff
    • Bylaws of AAAI
    • AAAI Awards
      • Fellows Program
      • Classic Paper Award
      • Dissertation Award
      • Distinguished Service Award
      • Allen Newell Award
      • Outstanding Paper Award
      • Award for Artificial Intelligence for the Benefit of Humanity
      • Feigenbaum Prize
      • Patrick Henry Winston Outstanding Educator Award
      • Engelmore Award
      • AAAI ISEF Awards
      • Senior Member Status
      • Conference Awards
    • AAAI Resources
    • AAAI Mailing Lists
    • Past AAAI Presidential Addresses
    • Presidential Panel on Long-Term AI Futures
    • Past AAAI Policy Reports
      • A Report to ARPA on Twenty-First Century Intelligent Systems
      • The Role of Intelligent Systems in the National Information Infrastructure
    • AAAI Logos
    • News
  • aaai-icon_ethics-diversity-line-yellowEthics & Diversity
  • Conference talk bubbleConferences & Symposia
    • AAAI Conference
    • AIES AAAI/ACM
    • AIIDE
    • IAAI
    • ICWSM
    • HCOMP
    • Spring Symposia
    • Summer Symposia
    • Fall Symposia
    • Code of Conduct for Conferences and Events
  • PublicationsPublications
    • AAAI Press
    • AI Magazine
    • Conference Proceedings
    • AAAI Publication Policies & Guidelines
    • Request to Reproduce Copyrighted Materials
  • aaai-icon_ai-magazine-line-yellowAI Magazine
    • Issues and Articles
    • Author Guidelines
    • Editorial Focus
  • MembershipMembership
    • Member Login
    • Developing Country List
    • AAAI Chapter Program

  • Career CenterCareer Center
  • aaai-icon_ai-topics-line-yellowAITopics
  • aaai-icon_contact-line-yellowContact

Home / Proceedings / Proceedings of the AAAI Conference on Artificial Intelligence, 35 /

No. 3: AAAI-21 Technical Tracks 3

AAAI Technical Track on Computer Vision II

  • BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation

    Haisheng Su, Weihao Gan, Wei Wu, Yu Qiao, Junjie Yan

    2602-2610

    PDF
  • MangaGAN: Unpaired Photo-to-Manga Translation Based on The Methodology of Manga Drawing

    Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Jiahe Cui, Ji Wan

    2611-2619

    PDF
  • MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection

    Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

    2620-2627

    PDF
  • Deep Probabilistic Imaging: Uncertainty Quantification and Multi-modal Solution Characterization for Computational Imaging

    He Sun, Katherine L. Bouman

    2628-2637

    PDF
  • Domain General Face Forgery Detection by Learning to Weight

    Ke Sun, Hong Liu, Qixiang Ye, Yue Gao, Jianzhuang Liu, Ling Shao, Rongrong Ji

    2638-2646

    PDF
  • Object-Centric Image Generation from Layouts

    Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio, R Devon Hjelm, Shikhar Sharma

    2647-2655

    PDF
  • Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation

    Jilin Tang, Yi Yuan, Tianjia Shao, Yong Liu, Mengmeng Wang, Kun Zhou

    2656-2664

    PDF
  • Gradient Regularized Contrastive Learning for Continual Domain Adaptation

    Shixiang Tang, Peng Su, Dapeng Chen, Wanli Ouyang

    2665-2673

    PDF
  • Adversarial Training Reduces Information and Improves Transferability

    Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

    2674-2682

    PDF
  • Adversarial Turing Patterns from Cellular Automata

    Nurislam Tursynbek, Ilya Vilkoviskiy, Maria Sindeeva, Ivan Oseledets

    2683-2691

    PDF
  • Artificial Dummies for Urban Dataset Augmentation

    Antonín Vobecký, David Hurych, Michal Uřičář, Patrick Pérez, Josef Sivic

    2692-2700

    PDF
  • SCNet: Training Inference Sample Consistency for Instance Segmentation

    Thang Vu, Haeyong Kang, Chang D. Yoo

    2701-2709

    PDF
  • Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

    Chaoqun Wang, Xuejin Chen, Shaobo Min, Xiaoyan Sun, Houqiang Li

    2710-2718

    PDF
  • CHEF: Cross-modal Hierarchical Embeddings for Food Domain Retrieval

    Hai X. Pham, Ricardo Guerrero, Vladimir Pavlovic, Jiatong Li

    2423-2430

    PDF
  • Explainable Models with Consistent Interpretations

    Vipin Pillai, Hamed Pirsiavash

    2431-2439

    PDF
  • Dual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval

    Shengsheng Qian, Dizhan Xue, Huaiwen Zhang, Quan Fang, Changsheng Xu

    2440-2448

    PDF
  • KGDet: Keypoint-Guided Fashion Detection

    Shenhan Qian, Dongze Lian, Binqiang Zhao, Tong Liu, Bohui Zhu, Hai Li, Shenghua Gao

    2449-2457

    PDF
  • Learning Modulated Loss for Rotated Object Detection

    Wen Qian, Xue Yang, Silong Peng, Junchi Yan, Yue Guo

    2458-2466

    PDF
  • MANGO: A Mask Attention Guided One-Stage Scene Text Spotter

    Liang Qiao, Ying Chen, Zhanzhan Cheng, Yunlu Xu, Yi Niu, Shiliang Pu, Fei Wu

    2467-2476

    PDF
  • REFINE: Prediction Fusion Network for Panoptic Segmentation

    Jiawei Ren, Cunjun Yu, Zhongang Cai, Mingyuan Zhang, Chongsong Chen, Haiyu Zhao, Shuai Yi, Hongsheng Li

    2477-2485

    PDF
  • AutoLR: Layer-wise Pruning and Auto-tuning of Learning Rates in Fine-tuning of Deep Networks

    Youngmin Ro, Jin Young Choi

    2486-2494

    PDF
  • DPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch

    Xiaofeng Ruan, Yufan Liu, Bing Li, Chunfeng Yuan, Weiming Hu

    2495-2503

    PDF
  • Efficient Certification of Spatial Robustness

    Anian Ruoss, Maximilian Baader, Mislav Balunović, Martin Vechev

    2504-2513

    PDF
  • Semantic Grouping Network for Video Captioning

    Hobin Ryu, Sunghun Kang, Haeyong Kang, Chang D. Yoo

    2514-2522

    PDF
  • Audio-Visual Localization by Synthetic Acoustic Image Generation

    Valentina Sanguineti, Pietro Morerio, Alessio Del Bue, Vittorio Murino

    2523-2531

    PDF
  • Enhanced Regularizers for Attributional Robustness

    Anindya Sarkar, Anirban Sarkar, Vineeth N Balasubramanian

    2532-2540

    PDF
  • Progressive Network Grafting for Few-Shot Knowledge Distillation

    Chengchao Shen, Xinchao Wang, Youtan Yin, Jie Song, Sihui Luo, Mingli Song

    2541-2549

    PDF
  • Social-DPF: Socially Acceptable Distribution Prediction of Futures

    Xiaodan Shi, Xiaowei Shao, Guangming Wu, Haoran Zhang, Zhiling Guo, Renhe Jiang, Ryosuke Shibasaki

    2550-2557

    PDF
  • Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model

    Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan

    2558-2566

    PDF
  • AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

    Qi Song, Kangfu Mei, Rui Huang

    2567-2575

    PDF
  • To Choose or to Fuse? Scale Selection for Crowd Counting

    Qingyu Song, Changan Wang, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Jian Wu, Jiayi Ma

    2576-2583

    PDF
  • Image Captioning with Context-Aware Auxiliary Guidance

    Zeliang Song, Xiaofei Zhou, Zhendong Mao, Jianlong Tan

    2584-2592

    PDF
  • Unsupervised Model Adaptation for Continual Semantic Segmentation

    Serban Stan, Mohammad Rostami

    2593-2601

    PDF
  • Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

    Ziyi Liu, Le Wang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua

    2242-2250

    PDF
  • PointINet: Point Cloud Frame Interpolation Network

    Fan Lu, Guang Chen, Sanqing Qu, Zhijun Li, Yinlong Liu, Alois Knoll

    2251-2259

    PDF
  • A Global Occlusion-Aware Approach to Self-Supervised Monocular Visual Odometry

    Yao Lu, Xiaoli Xu, Mingyu Ding, Zhiwu Lu, Tao Xiang

    2260-2268

    PDF
  • PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos

    Tianyu Luan, Yali Wang, Junhao Zhang, Zhe Wang, Zhipeng Zhou, Yu Qiao

    2269-2276

    PDF
  • DeepDT: Learning Geometry From Delaunay Triangulation for Surface Reconstruction

    Yiming Luo, Zhenxing Mi, Wenbing Tao

    2277-2285

    PDF
  • Dual-level Collaborative Transformer for Image Captioning

    Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji

    2286-2293

    PDF
  • HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

    Xiaoyang Lyu, Liang Liu, Mengmeng Wang, Xin Kong, Lina Liu, Yong Liu, Xinxin Chen, Yi Yuan

    2294-2301

    PDF
  • SMIL: Multimodal Learning with Severely Missing Modality

    Mengmeng Ma, Jian Ren, Long Zhao, Sergey Tulyakov, Cathy Wu, Xi Peng

    2302-2310

    PDF
  • Pyramidal Feature Shrinking for Salient Object Detection

    Mingcan Ma, Changqun Xia, Jia Li

    2311-2318

    PDF
  • Learning to Count via Unbalanced Optimal Transport

    Zhiheng Ma, Xing Wei, Xiaopeng Hong, Hui Lin, Yunfeng Qiu, Yihong Gong

    2319-2327

    PDF
  • Scene Graph Embeddings Using Relative Similarity Supervision

    Paridhi Maheshwari, Ritwick Chaudhry, Vishwa Vinay

    2328-2336

    PDF
  • Few-Shot Lifelong Learning

    Pratik Mazumder, Pravendra Singh, Piyush Rai

    2337-2345

    PDF
  • CARPe Posterum: A Convolutional Approach for Real-Time Pedestrian Path Prediction

    Matias Mendieta, Hamed Tabkhi

    2346-2354

    PDF
  • Dynamic Anchor Learning for Arbitrary-Oriented Object Detection

    Qi Ming, Zhiqiang Zhou, Lingjuan Miao, Hongwei Zhang, Linhao Li

    2355-2363

    PDF
  • Terrace-based Food Counting and Segmentation

    Huu-Thanh Nguyen, Chong-Wah Ngo

    2364-2372

    PDF
  • Embodied Visual Active Learning for Semantic Segmentation

    David Nilsson, Aleksis Pirinen, Erik Gärtner, Cristian Sminchisescu

    2373-2383

    PDF
  • TDAF: Top-Down Attention Framework for Vision Tasks

    Bo Pang, Yizhuo Li, Jiefeng Li, Muchen Li, Hanwen Cao, Cewu Lu

    2384-2392

    PDF
  • Few-shot Font Generation with Localized Style Representations and Factorization

    Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim

    2393-2402

    PDF
  • Learning Disentangled Representation for Fair Facial Attribute Classification via Fairness-aware Information Alignment

    Sungho Park, Sunhee Hwang, Dohyung Kim, Hyeran Byun

    2403-2411

    PDF
  • Vid-ODE: Continuous-Time Video Generation with Neural Ordinary Differential Equation

    Sunghyun Park, Kangyeol Kim, Junsoo Lee, Jaegul Choo, Joonseok Lee, Sookyung Kim, Edward Choi

    2412-2422

    PDF
  • Single View Point Cloud Generation via Unified 3D Prototype

    Yu Lin, Yigong Wang, Yi-Fan Li, Zhuoyi Wang, Yang Gao, Latifur Khan

    2064-2072

    PDF
  • Self-Supervised Sketch-to-Image Synthesis

    Bingchen Liu, Yizhe Zhu, Kunpeng Song, Ahmed Elgammal

    2073-2081

    PDF
  • TIME: Text and Image Mutual-Translation Adversarial Networks

    Bingchen Liu, Kunpeng Song, Yizhe Zhu, Gerard de Melo, Ahmed Elgammal

    2082-2090

    PDF
  • SA-BNN: State-Aware Binary Neural Network

    Chunlei Liu, Peng Chen, Bohan Zhuang, Chunhua Shen, Baochang Zhang, Wenrui Ding

    2091-2099

    PDF
  • Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

    Daizong Liu, Shuangjie Xu, Xiao-Yang Liu, Zichuan Xu, Wei Wei, Pan Zhou

    2100-2108

    PDF
  • F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation

    Daizong Liu, Dongdong Yu, Changhu Wang, Pan Zhou

    2109-2117

    PDF
  • Toward Realistic Virtual Try-on Through Landmark Guided Shape Matching

    Guoqiang Liu, Dan Song, Ruofeng Tong, Min Tang

    2118-2126

    PDF
  • Large Motion Video Super-Resolution with Dual Subnet and Multi-Stage Communicated Upsampling

    Hongying Liu, Peng Zhao, Zhubo Ruan, Fanhua Shang, Yuanyuan Liu

    2127-2135

    PDF
  • FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion

    Lina Liu, Xibin Song, Xiaoyang Lyu, Junwei Diao, Mengmeng Wang, Yong Liu, Liangjun Zhang

    2136-2144

    PDF
  • Activity Image-to-Video Retrieval by Disentangling Appearance and Motion

    Liu Liu, Jiangtong Li, Li Niu, Ruicong Xu, Liqing Zhang

    2145-2153

    PDF
  • Adaptive Pattern-Parameter Matching for Robust Pedestrian Detection

    Mengyin Liu, Chao Zhu, Jun Wang, Xu-Cheng Yin

    2154-2162

    PDF
  • Temporal Segmentation of Fine-gained Semantic Action: A Motion-Centered Figure Skating Dataset

    Shenglan Liu, Aibin Zhang, Yunheng Li, Jian Zhou, Li Xu, Zhuben Dong, Renhao Zhang

    2163-2171

    PDF
  • Learning Hybrid Relationships for Person Re-identification

    Shuang Liu, Wenmin Huang, Zhong Zhang

    2172-2179

    PDF
  • Translate the Facial Regions You Like Using Self-Adaptive Region Translation

    Wenshuang Liu, Wenting Chen, Zhanjia Yang, Linlin Shen

    2180-2188

    PDF
  • Subtype-aware Unsupervised Domain Adaptation for Medical Diagnosis

    Xiaofeng Liu, Xiongchang Liu, Bo Hu, Wenxuan Ji, Fangxu Xing, Jun Lu, Jane You, C.-C. Jay Kuo, Georges El Fakhri, Jonghye Woo

    2189-2197

    PDF
  • FontRL: Chinese Font Synthesis via Deep Reinforcement Learning

    Yitian Liu, Zhouhui Lian

    2198-2206

    PDF
  • Hierarchical Information Passing Based Noise-Tolerant Hybrid Learning for Semi-Supervised Human Parsing

    Yunan Liu, Shanshan Zhang, Jian Yang, PongChi Yuen

    2207-2215

    PDF
  • Delving into Variance Transmission and Normalization: Shift of Average Gradient Makes the Network Collapse

    Yuxiang Liu, Jidong Ge, Chuanyi Li, Jie Gui

    2216-2224

    PDF
  • Aggregated Multi-GANs for Controlled 3D Human Motion Prediction

    Zhenguang Liu, Kedi Lyu, Shuang Wu, Haipeng Chen, Yanbin Hao, Shouling Ji

    2225-2232

    PDF
  • ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization

    Ziyi Liu, Le Wang, Qilin Zhang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua

    2233-2241

    PDF
  • Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus

    Marius Leordeanu, Mihai Cristian Pîrvu, Dragos Costea, Alina E Marcu, Emil Slusanschi, Rahul Sukthankar

    1882-1892

    PDF
  • Static-Dynamic Interaction Networks for Offline Signature Verification

    Huan Li, Ping Wei, Ping Hu

    1893-1901

    PDF
  • Proposal-Free Video Grounding with Contextual Pyramid Network

    Kun Li, Dan Guo, Meng Wang

    1902-1910

    PDF
  • Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation

    Lincheng Li, Suzhen Wang, Zhimeng Zhang, Yu Ding, Yixing Zheng, Xin Yu, Changjie Fan

    1911-1920

    PDF
  • Exploiting Learnable Joint Groups for Hand Pose Estimation

    Moran Li, Yuan Gao, Nong Sang

    1921-1929

    PDF
  • RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving

    Peixuan Li, Shun Su, Huaici Zhao

    1930-1939

    PDF
  • Adversarial Pose Regression Network for Pose-Invariant Face Recognitions

    Pengyu Li, Biao Wang, Lei Zhang

    1940-1948

    PDF
  • Category Dictionary Guided Unsupervised Domain Adaptation for Object Detection

    Shuai Li, Jianqiang Huang, Xian-Sheng Hua, Lei Zhang

    1949-1957

    PDF
  • Joint Semantic-geometric Learning for Polygonal Building Segmentation

    Weijia Li, Wenqian Zhao, Huaping Zhong, Conghui He, Dahua Lin

    1958-1965

    PDF
  • Generalized Zero-Shot Learning via Disentangled Representation

    Xiangyu Li, Zhe Xu, Kun Wei, Cheng Deng

    1966-1974

    PDF
  • Learning Omni-Frequency Region-adaptive Representations for Real Image Super-Resolution

    Xin Li, Xin Jin, Tao Yu, Simeng Sun, Yingxue Pang, Zhizheng Zhang, Zhibo Chen

    1975-1983

    PDF
  • Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation

    Xueyi Li, Tianfei Zhou, Jianwu Li, Yi Zhou, Zhaoxiang Zhang

    1984-1992

    PDF
  • Inference Fusion with Associative Semantics for Unseen Object Detection

    Yanan Li, Pengyang Li, Han Cui, Donghui Wang

    1993-2001

    PDF
  • Deep Unsupervised Image Hashing by Maximizing Bit Entropy

    Yunqiang Li, Jan van Gemert

    2002-2010

    PDF
  • Sequential End-to-end Network for Efficient Person Search

    Zhengjia Li, Duoqian Miao

    2011-2019

    PDF
  • SD-Pose: Semantic Decomposition for Cross-Domain 6D Object Pose Estimation

    Zhigang Li, Yinlin Hu, Mathieu Salzmann, Xiangyang Ji

    2020-2028

    PDF
  • Temporal Pyramid Network for Pedestrian Trajectory Prediction with Multi-Supervision

    Rongqin Liang, Yuanman Li, Xia Li, Yi Tang, Jiantao Zhou, Wenbin Zou

    2029-2037

    PDF
  • Query-Memory Re-Aggregation for Weakly-supervised Video Object Segmentation

    Fanchao Lin, Hongtao Xie, Yan Li, Yongdong Zhang

    2038-2046

    PDF
  • Augmented Partial Mutual Learning with Frame Masking for Video Captioning

    Ke Lin, Zhuoxin Gan, Liwei Wang

    2047-2055

    PDF
  • Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation

    Yan-Bo Lin, Yu-Chiang Frank Wang

    2056-2063

    PDF
  • Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

    Minsu Kim, Sunghun Joung, Seungryong Kim, JungIn Park, Ig-Jae Kim, Kwanghoon Sohn

    1799-1807

    PDF
  • Bidirectional RNN-based Few Shot Learning for 3D Medical Image Segmentation

    Soopil Kim, Sion An, Philip Chikontwe, Sang Hyun Park

    1808-1816

    PDF
  • DASZL: Dynamic Action Signatures for Zero-shot Learning

    Tae Soo Kim, Jonathan Jones, Michael Peven, Zihao Xiao, Jin Bai, Yi Zhang, Weichao Qiu, Alan Yuille, Gregory D. Hager

    1817-1826

    PDF
  • Multi-level Distance Regularization for Deep Metric Learning

    Yonghyun Kim, Wonpyo Park

    1827-1835

    PDF
  • Dynamic to Static Lidar Scan Reconstruction Using Adversarially Trained Auto Encoder

    Prashant Kumar, Sabyasachi Sahoo, Vanshil Shah, Vineetha Kondameedi, Abhinav Jain, Akshaj Verma, Chiranjib Bhattacharyya, Vinay Vishwanath

    1836-1844

    PDF
  • Regularizing Attention Networks for Anomaly Detection in Visual Question Answering

    Doyup Lee, Yeongjae Cheon, Wook-Shin Han

    1845-1853

    PDF
  • Weakly-supervised Temporal Action Localization by Uncertainty Modeling

    Pilhyeon Lee, Jinglu Wang, Yan Lu, Hyeran Byun

    1854-1862

    PDF
  • Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency

    Seokju Lee, Sunghoon Im, Stephen Lin, In So Kweon

    1863-1872

    PDF
  • Patch-Wise Attention Network for Monocular Depth Estimation

    Sihaeng Lee, Janghyeon Lee, Byungju Kim, Eojindl Yi, Junmo Kim

    1873-1881

    PDF

Primary Sidebar