No. 1: AAAI-22 Technical Tracks 1
AAAI Technical Track on Cognitive Modeling & Cognitive Systems
Learning Unseen Emotions from Gestures via Semantically-Conditioned Zero-Shot Perception with Adversarial Autoencoders
PDFOptimized Potential Initialization for Low-Latency Spiking Neural Networks
PDFPlanning with Biological Neurons and Synapses
PDFBackprop-Free Reinforcement Learning with Active Neural Generative Coding
PDFVECA: A New Benchmark and Toolkit for General Cognitive Development
PDFBridging between Cognitive Processing Signals and Linguistic Features via a Unified Attentional Network
PDFMulti-Sacle Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning
PDF
AAAI Technical Track on Computer Vision I
Deep Translation Prior: Test-Time Training for Photorealistic Style Transfer
PDFPrivateSNN: Privacy-Preserving Spiking Neural Networks
PDFNaturalInversion: Data-Free Image Synthesis Improving Real-World Consistency
PDFJoint 3D Object Detection and Tracking Using Spatio-Temporal Representation of Camera Image and LiDAR Point Clouds
PDFLearning to Model Pixel-Embedded Affinity for Homogeneous Instance Segmentation
PDFChannelized Axial Attention – considering Channel Relation within Spatial Attention for Semantic Segmentation
PDFUFPMP-Det:Toward Accurate and Efficient Object Detection on Drone Imagery
PDFModality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification
PDFMuMu: Cooperative Multitask Learning-Based Guided Multimodal Fusion
PDFAn Unsupervised Way to Understand Artifact Generating Internal Units in Generative Neural Networks
PDFFrePGAN: Robust Deepfake Detection Using Frequency-Level Perturbations
PDFLearning Disentangled Attribute Representations for Robust Pedestrian Attribute Recognition
PDFDegrade Is Upgrade: Learning Degradation for Low-Light Image Enhancement
PDFHarmoFL: Harmonizing Local and Global Drifts in Federated Learning on Heterogeneous Medical Images
PDFCoarse-to-Fine Generative Modeling for Graphic Layouts
PDFDarkVisionNet: Low-Light Imaging via RGB-NIR Fusion with Deep Inconsistency Prior
PDFLAGConv: Local-Context Adaptive Convolution Kernels with Global Harmonic Bias for Pansharpening
PDFLearning the Dynamics of Visual Relational Reasoning via Reinforced Path Routing
PDFTowards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition
PDFMODNet: Real-Time Trimap-Free Portrait Matting via Objective Decomposition
PDFLearning Mixture of Domain-Specific Experts via Disentangled Factors for Autonomous Driving
PDFTowards Versatile Pedestrian Detector with Multisensory-Matching and Multispectral Recalling Memory
PDFSemantic Feature Extraction for Generalized Zero-Shot Learning
PDFDistinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading
PDFRRL: Regional Rotate Layer in Convolutional Neural Networks
PDFQueryProp: Object Query Propagation for High-Performance Video Object Detection
PDFFlow-Based Unconstrained Lip to Speech Generation
PDFTransFG: A Transformer Architecture for Fine-Grained Recognition
PDFSelf-Supervised Robust Scene Flow Estimation via the Alignment of Probability Density Functions
PDFSVGA-Net: Sparse Voxel-Graph Attention Network for 3D Object Detection from Point Clouds
PDFSECRET: Self-Consistent Pseudo Label Refinement for Unsupervised Domain Adaptive Person Re-identification
PDFVisual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
PDFRanking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives
PDFUncertainty-Driven Dehazing Network
PDFShadow Generation for Composite Image in Real-World Scenes
PDFShape-Adaptive Selection and Measurement for Oriented Object Detection
PDFH^2-MIL: Exploring Hierarchical Representation with Heterogeneous Multiple Instance Learning for Whole Slide Image Analysis
PDFElastic-Link for Binarized Neural Networks
PDFFInfer: Frame Inference-Based Deepfake Detection for High-Visual-Quality Videos
PDFBi-volution: A Static and Dynamic Coupled Filter
PDFAFDetV2: Rethinking the Necessity of the Second Stage for Object Detection from Point Clouds
PDFDivide-and-Regroup Clustering for Domain Adaptive Person Re-identification
PDFCMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes
PDFDeconfounded Visual Grounding
PDFUnsupervised Underwater Image Restoration: From a Homology Perspective
PDFPlaying Lottery Tickets with Vision and Language
PDFFeature Distillation Interaction Weighting Network for Lightweight Image Super-resolution
PDFWeakly-Supervised Salient Object Detection Using Point Supervision
PDFLatent Space Explanation by Intervention
PDFLifelong Person Re-identification by Pseudo Task Knowledge Preservation
PDFAdversarial Robustness in Multi-Task Learning: Promises and Illusions
PDFDeep Confidence Guided Distance for 3D Partial Shape Registration
PDFPredicting Physical World Destinations for Commands Given to Self-Driving Cars
PDFTowards Light-Weight and Real-Time Line Segment Detection
PDFExploiting Fine-Grained Face Forgery Clues via Progressive Enhancement Learning
PDFDelving into the Local: Dynamic Inconsistency Learning for DeepFake Video Detection
PDFAssessing a Single Image in Reference-Guided Image Synthesis
PDFContrastive Learning from Extremely Augmented Skeleton Sequences for Self-Supervised Action Recognition
PDFConvolutional Neural Network Compression through Generalized Kronecker Product Decomposition
PDFMeta Faster R-CNN: Towards Accurate Few-Shot Object Detection with Attentive Feature Alignment
PDFDelving into Probabilistic Uncertainty for Unsupervised Domain Adaptive Person Re-identification
PDFLaneformer: Object-Aware Row-Column Transformers for Lane Detection
PDFModify Self-Attention via Skeleton Decomposition for Effective Point Cloud Transformer
PDFGeneralizable Person Re-identification via Self-Supervised Batch Norm Test-Time Adaption
PDFStyle-Guided and Disentangled Representation for Robust Image-to-Image Translation
PDFDenoised Maximum Classifier Discrepancy for Source-Free Unsupervised Domain Adaptation
PDFModel-Based Image Signal Processors via Learnable Dictionaries
PDFMMA: Multi-Camera Based Global Motion Averaging
PDFGenCo: Generative Co-training for Generative Adversarial Networks with Limited Data
PDFUnbiased IoU for Spherical Image Object Detection
PDFInsCLR: Improving Instance Retrieval with Self-Supervision
PDFSpatio-Temporal Recurrent Networks for Event-Based Optical Flow Estimation
PDFConstruct Effective Geometry Aware Feature Pyramid Network for Multi-Scale Object Detection
PDFComplementary Attention Gated Network for Pedestrian Trajectory Prediction
PDFSVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition
PDFBackdoor Attacks on the DNN Interpretation System
PDFLearning to Learn Transferable Attack
PDFPerceptual Quality Assessment of Omnidirectional Images
PDFPatchUp: A Feature-Space Block-Level Regularization Technique for Convolutional Neural Networks
PDFDuMLP-Pin: A Dual-MLP-Dot-Product Permutation-Invariant Network for Set Feature Extraction
PDFAttention-Aligned Transformer for Image Captioning
PDFModel Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers
PDFOctAttention: Octree-Based Large-Scale Contexts Model for Point Cloud Compression
PDFDOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
PDFText Gestalt: Stroke-Aware Scene Text Image Super-resolution
PDFTowards High-Fidelity Face Self-Occlusion Recovery via Multi-View Residual-Based GAN Inversion
PDFProgressiveMotionSeg: Mutually Reinforced Framework for Event-Based Motion Segmentation
PDFAttacking Video Recognition Models with Bullet-Screen Comments
PDFVITA: A Multi-Source Vicinal Transfer Augmentation Method for Out-of-Distribution Generalization
PDFTransZero: Attribute-Guided Transformer for Zero-Shot Learning
PDFStructured Semantic Transfer for Multi-Label Recognition with Partial Labels
PDFSJDL-Vehicle: Semi-supervised Joint Defogging Learning for Foggy Vehicle Re-identification
PDFImagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification
PDFGuide Local Feature Matching by Overlap Estimation
PDFCausal Intervention for Subject-Deconfounded Facial Action Unit Recognition
PDFDeep One-Class Classification via Interpolated Gaussian Descriptor
PDFTowards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization
PDFDeTarNet: Decoupling Translation and Rotation by Siamese Network for Point Cloud Registration
PDFLCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
PDFEfficient Virtual View Selection for 3D Hand Pose Estimation
PDFPose Adaptive Dual Mixup for Few-Shot Single-View 3D Reconstruction
PDFPureGaze: Purifying Gaze Feature for Generalizable Gaze Estimation
PDF(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
PDFEvent-Image Fusion Stereo Using Cross-Modality Feature Propagation
PDFTowards End-to-End Image Compression and Analysis with Transformers
PDFHandwritten Mathematical Expression Recognition via Attention Aggregation Based Bi-directional Mutual Learning
PDFADD: Frequency Attention and Multi-View Based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images
PDFLUNA: Localizing Unfamiliarity Near Acquaintance for Open-Set Long-Tailed Recognition
PDFPrior Gradient Mask Guided Pruning-Aware Fine-Tuning
PDFContext-Aware Transfer Attacks for Object Detection
PDFOoDHDR-Codec: Out-of-Distribution Generalization for HDR Image Compression
PDFVisual Consensus Modeling for Video-Text Retrieval
PDFProximal PanNet: A Model-Based Deep Network for Pansharpening
PDFCF-DETR: Coarse-to-Fine Transformers for End-to-End Object Detection
PDFA Random CNN Sees Objects: One Inductive Bias of CNN and Its Applications
PDFTexture Generation Using Dual-Domain Feature Flow with Multi-View Hallucinations
PDFResistance Training Using Prior Bias: Toward Unbiased Scene Graph Generation
PDFSASA: Semantics-Augmented Set Abstraction for Point-Based 3D Object Detection
PDFComprehensive Regularization in a Bi-directional Predictive Network for Video Anomaly Detection
PDFKeypoint Message Passing for Video-Based Person Re-identification
PDFDCAN: Improving Temporal Action Detection via Dual Context Aggregation
PDFGeometry-Contrastive Transformer for Generalized 3D Pose Transfer
PDFExplore Inter-contrast between Videos via Composition for Weakly Supervised Temporal Sentence Grounding
PDFAdaptive Image-to-Video Scene Graph Generation via Knowledge Reasoning and Adversarial Learning
PDFJoint Human Pose Estimation and Instance Segmentation with PosePlusSeg
PDFLogic Rule Guided Attribution with Dynamic Ablation
PDFNeural Marionette: Unsupervised Learning of Motion Skeleton and Latent Dynamics from Volumetric Video
PDFDeformable Part Region Learning for Object Detection
PDF