No. 3: AAAI-21 Technical Tracks 3
AAAI Technical Track on Computer Vision II
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation
PDFMangaGAN: Unpaired Photo-to-Manga Translation Based on The Methodology of Manga Drawing
PDFMAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection
PDFDeep Probabilistic Imaging: Uncertainty Quantification and Multi-modal Solution Characterization for Computational Imaging
PDFDomain General Face Forgery Detection by Learning to Weight
PDFObject-Centric Image Generation from Layouts
PDFStructure-aware Person Image Generation with Pose Decomposition and Semantic Correlation
PDFGradient Regularized Contrastive Learning for Continual Domain Adaptation
PDFAdversarial Training Reduces Information and Improves Transferability
PDFAdversarial Turing Patterns from Cellular Automata
PDFArtificial Dummies for Urban Dataset Augmentation
PDFSCNet: Training Inference Sample Consistency for Instance Segmentation
PDFTask-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning
PDFCHEF: Cross-modal Hierarchical Embeddings for Food Domain Retrieval
PDFExplainable Models with Consistent Interpretations
PDFDual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval
PDFKGDet: Keypoint-Guided Fashion Detection
PDFLearning Modulated Loss for Rotated Object Detection
PDFMANGO: A Mask Attention Guided One-Stage Scene Text Spotter
PDFREFINE: Prediction Fusion Network for Panoptic Segmentation
PDFAutoLR: Layer-wise Pruning and Auto-tuning of Learning Rates in Fine-tuning of Deep Networks
PDFDPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch
PDFEfficient Certification of Spatial Robustness
PDFSemantic Grouping Network for Video Captioning
PDFAudio-Visual Localization by Synthetic Acoustic Image Generation
PDFEnhanced Regularizers for Attributional Robustness
PDFProgressive Network Grafting for Few-Shot Knowledge Distillation
PDFSocial-DPF: Socially Acceptable Distribution Prediction of Futures
PDFRobust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model
PDFAttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing
PDFTo Choose or to Fuse? Scale Selection for Crowd Counting
PDFImage Captioning with Context-Aware Auxiliary Guidance
PDFUnsupervised Model Adaptation for Continual Semantic Segmentation
PDFWeakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context
PDFPointINet: Point Cloud Frame Interpolation Network
PDFA Global Occlusion-Aware Approach to Self-Supervised Monocular Visual Odometry
PDFPC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos
PDFDeepDT: Learning Geometry From Delaunay Triangulation for Surface Reconstruction
PDFDual-level Collaborative Transformer for Image Captioning
PDFHR-Depth: High Resolution Self-Supervised Monocular Depth Estimation
PDFSMIL: Multimodal Learning with Severely Missing Modality
PDFPyramidal Feature Shrinking for Salient Object Detection
PDFLearning to Count via Unbalanced Optimal Transport
PDFScene Graph Embeddings Using Relative Similarity Supervision
PDFFew-Shot Lifelong Learning
PDFCARPe Posterum: A Convolutional Approach for Real-Time Pedestrian Path Prediction
PDFDynamic Anchor Learning for Arbitrary-Oriented Object Detection
PDFTerrace-based Food Counting and Segmentation
PDFEmbodied Visual Active Learning for Semantic Segmentation
PDFTDAF: Top-Down Attention Framework for Vision Tasks
PDFFew-shot Font Generation with Localized Style Representations and Factorization
PDFLearning Disentangled Representation for Fair Facial Attribute Classification via Fairness-aware Information Alignment
PDFVid-ODE: Continuous-Time Video Generation with Neural Ordinary Differential Equation
PDFSingle View Point Cloud Generation via Unified 3D Prototype
PDFSelf-Supervised Sketch-to-Image Synthesis
PDFTIME: Text and Image Mutual-Translation Adversarial Networks
PDFSA-BNN: State-Aware Binary Neural Network
PDFSpatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation
PDFF2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation
PDFToward Realistic Virtual Try-on Through Landmark Guided Shape Matching
PDFLarge Motion Video Super-Resolution with Dual Subnet and Multi-Stage Communicated Upsampling
PDFFCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion
PDFActivity Image-to-Video Retrieval by Disentangling Appearance and Motion
PDFAdaptive Pattern-Parameter Matching for Robust Pedestrian Detection
PDFTemporal Segmentation of Fine-gained Semantic Action: A Motion-Centered Figure Skating Dataset
PDFLearning Hybrid Relationships for Person Re-identification
PDFTranslate the Facial Regions You Like Using Self-Adaptive Region Translation
PDFSubtype-aware Unsupervised Domain Adaptation for Medical Diagnosis
PDFFontRL: Chinese Font Synthesis via Deep Reinforcement Learning
PDFHierarchical Information Passing Based Noise-Tolerant Hybrid Learning for Semi-Supervised Human Parsing
PDFDelving into Variance Transmission and Normalization: Shift of Average Gradient Makes the Network Collapse
PDFAggregated Multi-GANs for Controlled 3D Human Motion Prediction
PDFACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization
PDFSemi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus
PDFStatic-Dynamic Interaction Networks for Offline Signature Verification
PDFProposal-Free Video Grounding with Contextual Pyramid Network
PDFWrite-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation
PDFExploiting Learnable Joint Groups for Hand Pose Estimation
PDFRTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving
PDFAdversarial Pose Regression Network for Pose-Invariant Face Recognitions
PDFCategory Dictionary Guided Unsupervised Domain Adaptation for Object Detection
PDFJoint Semantic-geometric Learning for Polygonal Building Segmentation
PDFGeneralized Zero-Shot Learning via Disentangled Representation
PDFLearning Omni-Frequency Region-adaptive Representations for Real Image Super-Resolution
PDFGroup-Wise Semantic Mining for Weakly Supervised Semantic Segmentation
PDFInference Fusion with Associative Semantics for Unseen Object Detection
PDFDeep Unsupervised Image Hashing by Maximizing Bit Entropy
PDFSequential End-to-end Network for Efficient Person Search
PDFSD-Pose: Semantic Decomposition for Cross-Domain 6D Object Pose Estimation
PDFTemporal Pyramid Network for Pedestrian Trajectory Prediction with Multi-Supervision
PDFQuery-Memory Re-Aggregation for Weakly-supervised Video Object Segmentation
PDFAugmented Partial Mutual Learning with Frame Masking for Video Captioning
PDFExploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation
PDFCross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation
PDFBidirectional RNN-based Few Shot Learning for 3D Medical Image Segmentation
PDFDASZL: Dynamic Action Signatures for Zero-shot Learning
PDFMulti-level Distance Regularization for Deep Metric Learning
PDFDynamic to Static Lidar Scan Reconstruction Using Adversarially Trained Auto Encoder
PDFRegularizing Attention Networks for Anomaly Detection in Visual Question Answering
PDFWeakly-supervised Temporal Action Localization by Uncertainty Modeling
PDFLearning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency
PDFPatch-Wise Attention Network for Monocular Depth Estimation
PDF