Vol. 36 No. 2: AAAI-22 Technical Tracks 2

Thirty-Sixth AAAI Conference on Artificial Intelligence
Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence
The Twelveth Symposium on Educational Advances in Artificial Intelligence
Sponsored by the Association for the Advancement of Artificial Intelligence
February 22–March 1, 2022, held virtually.
Published by AAAI Press, Palo Alto, California USA
Copyright © 2022, Association for the Advancement of Artificial Intelligence
1900 Embarcadero Road, Suite 101, Palo Alto, California 94303
All Rights Reserved
ISSN 2374-3468 (Online)
ISSN 2159-5399 (Print)
ISBN-10: 1-57735-876-7 (11 issue set)
ISBN-13: 978-1-57735-876-3 (11 issue set)
The Thirty-Sixth AAAI Conference on Artificial Intelligence was held virtually from February 22-March 1, 2022. The conference program cochairs were Vasant Honavar (Pennsylvania State University, USA) and Matthijs Spaan (Delft University of Technology, Netherlands).
The AAAI-22 program consisted of a diverse technical track, student abstracts, poster sessions, invited speakers, tutorials, workshops, and exhibit and competition programs. Additionally, the program included a special track on AI for Social Impact, recognizing that high-quality research on social impact domains often leads to papers that differ from traditional AAAI submissions along multiple dimensions. The conference was colocated with the Thirty-Fourth Innovative Applications of Artificial Intelligence Conference (cochaired by Mark Boddy, Adventium Labs, USA, and Meinolf Sellmann, Shopify, USA). The IAAI conference traditionally consists of case studies of deployed applications with measurable benefits whose value depends on the use of AI technology, as well as emerging applications, which discuss efforts to apply AI tools, techniques, or methods to real world problems. The IAAI papers are included in these proceedings. Also included are the papers from the Twelfth AAAI Symposium on Educational Advances in Artificial Intelligence (cochaired by Michael Guerzhoy, University of Toronto, Canada, and Marion Neumann, Washington University in St. Louis, USA). The EAAI conference invites a broad range of papers on teaching AI and teaching with AI framed as research papers or as experience reports.
The proceedings have been published in 11 consecutive issues. This issue (volume 36 no. 2) consists of 1221 pages and one track:
AAAI Technical Track on Computer Vision II
AAAI Technical Track on Computer Vision II
Amplitude Spectrum Transformation for Open Compound Domain Adaptive Semantic Segmentation
Jogendra Nath Kundu, Akshay R Kulkarni, Suvaansh Bhambri, Varun Jampani, Venkatesh Babu Radhakrishnan1220-1227Siamese Network with Interactive Transformer for Video Object Segmentation
Meng Lan, Jing Zhang, Fengxiang He, Lefei Zhang1228-1236Iteratively Selecting an Easy Reference Frame Makes Unsupervised Video Object Segmentation Easier
Youngjo Lee, Hongje Seong, Euntai Kim1245-1253SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation
Bing Li, Cheng Zheng, Silvio Giancola, Bernard Ghanem1254-1262Shrinking Temporal Attention in Transformers for Video Action Recognition
Bonan Li, Pengfei Xiong, Congying Han, Tiande Guo1263-1271DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer
Buyu Li, Yongchi Zhao, Shi Zhelun, Lu Sheng1272-1279Interpretable Generative Adversarial Networks
Chao Li, Kelu Yao, Jin Wang, Boyu Diao, Yongjun Xu, Quanshi Zhang1280-1288Cross-Modal Object Tracking: Modality-Aware Representations and a Unified Benchmark
Chenglong Li, Tianhao Zhu, Lei Liu, Xiaonan Si, Zilin Fan, Sulan Zhai1289-1296You Only Infer Once: Cross-Modal Meta-Transfer for Referring Video Object Segmentation
Dezhuang Li, Ruoqi Li, Lijun Wang, Yifan Wang, Jinqing Qi, Lu Zhang, Ting Liu, Qingquan Xu, Huchuan Lu1297-1305Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-Guided Feature Imitation
Gang Li, Xiang Li, Yujie Wang, Shanshan Zhang, Yichao Wu, Ding Liang1306-1313Rethinking Pseudo Labels for Semi-supervised Object Detection
Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis1314-1322Action-Aware Embedding Enhancement for Image-Text Retrieval
Jiangtong Li, Li Niu, Liqing Zhang1323-1331Retinomorphic Object Detection in Asynchronous Visual Streams
Jianing Li, Xiao Wang, Lin Zhu, Jia Li, Tiejun Huang, Yonghong Tian1332-1340Learning from Weakly-Labeled Web Videos via Exploring Sub-concepts
Kunpeng Li, Zizhao Zhang, Guanhang Wu, Xuehan Xiong, Chen-Yu Lee, Zhichao Lu, Yun Fu, Tomas Pfister1341-1349Learning Universal Adversarial Perturbation by Adversarial Example
Maosen Li, Yanhua Yang, Kun Wei, Xu Yang, Heng Huang1350-1358Neighborhood-Adaptive Structure Augmented Metric Learning
Pandeng Li, Yan Li, Hongtao Xie, Lei Zhang1367-1375EditVAE: Unsupervised Parts-Aware Controllable 3D Point Cloud Shape Generation
Shidi Li, Miaomiao Liu, Christian Walder1386-1394Self-Training Multi-Sequence Learning with Transformer for Weakly Supervised Video Anomaly Detection
Shuo Li, Fang Liu, Licheng Jiao1395-1403TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition
Shuyuan Li, Huabin Liu, Rui Qian, Yuxi Li, John See, Mengjuan Fei, Xiaoyuan Yu, Weiyao Lin1404-1411Best-Buddy GANs for Highly Detailed Image Super-resolution
Wenbo Li, Kun Zhou, Lu Qi, Liying Lu, Jiangbo Lu1412-1420SCAN: Cross Domain Object Detection with Semantic Conditioned Adaptation
Wuyang Li, Xinyu Liu, Xiwen Yao, Yixuan Yuan1421-1428Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation
Xiang Li, Jinglu Wang, Xiao Li, Yan Lu1429-1437Close the Loop: A Unified Bottom-Up and Top-Down Paradigm for Joint Image Deraining and Segmentation
Yi Li, Yi Chang, Changfeng Yu, Luxin Yan1438-1446Uncertainty Estimation via Response Scaling for Pseudo-Mask Noise Mitigation in Weakly-Supervised Semantic Segmentation
Yi Li, Yiqun Duan, Zhanghui Kuang, Yimin Chen, Wayne Zhang, Xiaomeng Li1447-1455Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li, Hong Liu, Hao Tang1456-1463Defending against Model Stealing via Verifying Embedded External Features
Yiming Li, Linghui Zhu, Xiaojun Jia, Yong Jiang, Shu-Tao Xia, Xiaochun Cao1464-1472Towards an Effective Orthogonal Dictionary Convolution Strategy
Yishi Li, Kunran Xu, Rui Lai, Lin Gu1473-1481ELMA: Energy-Based Learning for Multi-Agent Activity Forecasting
Yuke Li, Pin Wang, Lixiong Chen, Zheng Wang, Ching-Yao Chan1482-1490Equal Bits: Enforcing Equally Distributed Binary Network Weights
Yunqiang Li, Silvia-Laura Pintea, Jan C van Gemert1491-1499SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-training for Spatial-Aware Visual Representations
Zhenyu Li, Zehui Chen, Ang Li, Liangji Fang, Qinhong Jiang, Xianming Liu, Junjun Jiang, Bolei Zhou, Hang Zhao1500-1508Improving Human-Object Interaction Detection via Phrase Learning and Label Composition
Zhimin Li, Cheng Zou, Yu Zhao, Boxun Li, Sheng Zhong1509-1517Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones Is Enough
Zhuo Li, Weiqing Min, Jiajun Song, Yaohui Zhu, Liping Kang, Xiaoming Wei, Xiaolin Wei, Shuqiang Jiang1518-1526Reliability Exploration with Self-Ensemble Learning for Domain Adaptive Person Re-identification
Zongyi Li, Yuxuan Shi, Hefei Ling, Jiazhong Chen, Qian Wang, Fengfan Zhou1527-1535Deconfounding Physical Dynamics with Global Causal Relation and Confounder Transmission for Counterfactual Prediction
Zongzhao Li, Xiangyu Zhu, Zhen Lei, Zhaoxiang Zhang1536-1545One More Check: Making “Fake Background” Be Tracked Again
Chao Liang, Zhipeng Zhang, Xue Zhou, Bing Li, Weiming Hu1546-1554Semantically Contrastive Learning for Low-Light Image Enhancement
Dong Liang, Ling Li, Mingqiang Wei, Shuo Yang, Liyan Zhang, Wenhan Yang, Yun Du, Huiyu Zhou1555-1563Self-Supervised Spatiotemporal Representation Learning by Exploiting Video Continuity
Hanwen Liang, Niamul Quader, Zhixiang Chi, Lizhe Chen, Peng Dai, Juwei Lu, Yang Wang1564-1573Inharmonious Region Localization by Magnifying Domain Discrepancy
Jing Liang, Li Niu, Penghao Wu, Fengjun Guo, Teng Long1574-1582Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Xiwen Liang, Fengda Zhu, Yi Zhu, Bingqian Lin, Bing Wang, Xiaodan Liang1592-1600Interventional Multi-Instance Learning with Deconfounded Instance-Level Prediction
Tiancheng Lin, Hongteng Xu, Canqian Yang, Yi Xu1601-1609A Causal Debiasing Framework for Unsupervised Salient Object Detection
Xiangru Lin, Ziyi Wu, Guanqi Chen, Guanbin Li, Yizhou Yu1610-1619A Causal Inference Look at Unsupervised Video Anomaly Detection
Xiangru Lin, Yuyang Chen, Guanbin Li, Yizhou Yu1620-1629Unpaired Multi-Domain Stain Transfer for Kidney Histopathological Images
Yiyang Lin, Bowei Zeng, Yifeng Wang, Yang Chen, Zijie Fang, Jian Zhang, Xiangyang Ji, Haoqian Wang, Yongbing Zhang1630-1637Dynamic Spatial Propagation Network for Depth Completion
Yuankai Lin, Tao Cheng, Qi Zhong, Wending Zhou, Hua Yang1638-1646Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks
Biyang Liu, Huimin Yu, Yangqi Long1647-1655FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition
Chih-Ting Liu, Chien-Yi Wang, Shao-Yi Chien, Shang-Hong Lai1656-1664Memory-Guided Semantic Learning Network for Temporal Sentence Grounding
Daizong Liu, Xiaoye Qu, Xing Di, Yu Cheng, Zichuan Xu, Pan Zhou1665-1673Exploring Motion and Appearance Information for Temporal Sentence Grounding
Daizong Liu, Xiaoye Qu, Pan Zhou, Yang Liu1674-1682Unsupervised Temporal Video Grounding with Deep Semantic Clustering
Daizong Liu, Xiaoye Qu, Yinzhen Wang, Xing Di, Kai Zou, Yu Cheng, Zichuan Xu, Pan Zhou1683-1691SpikeConverter: An Efficient Conversion Framework Zipping the Gap between Artificial Neural Networks and Spiking Neural Networks
Fangxin Liu, Wenbo Zhao, Yongbiao Chen, Zongwu Wang, Li Jiang1692-1701Perceiving Stroke-Semantic Context: Hierarchical Contrastive Learning for Robust Scene Text Recognition
Hao Liu, Bin Wang, Zhimin Bao, Mobai Xue, Sheng Kang, Deqiang Jiang, Yinsong Liu, Bo Ren1702-1710AnchorFace: Boosting TAR@FAR for Practical Face Recognition
Jiaheng Liu, Haoyu Qin, Yichao Wu, Ding Liang1711-1719Memory-Based Jitter: Improving Visual Recognition on Long-Tailed Data with Diversity in Memory
Jialun Liu, Wenhui Li, Yifan Sun1720-1728Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-identification
Jiawei Liu, Zhipeng Huang, Liang Li, Kecheng Zheng, Zheng-Jun Zha1729-1737Parallel and High-Fidelity Text-to-Lip Generation
Jinglin Liu, Zhiying Zhu, Yi Ren, Wencan Huang, Baoxing Huai, Nicholas Yuan, Zhou Zhao1738-1746SiamTrans: Zero-Shot Multi-Frame Image Restoration with Pre-trained Siamese Transformers
Lin Liu, Shanxin Yuan, Jianzhuang Liu, Xin Guo, Youliang Yan, Qi Tian1747-1755Single-Domain Generalization in Medical Image Segmentation via Test-Time Adaptation from Shape Dictionary
Quande Liu, Cheng Chen, Qi Dou, Pheng-Ann Heng1756-1764Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints
Ruijin Liu, Dapeng Chen, Tie Liu, Zhiliang Xiong, Zejian Yuan1765-1772OVIS: Open-Vocabulary Visual Instance Search via Visual-Semantic Aligned Representation Learning
Sheng Liu, Kevin Lin, Lijuan Wang, Junsong Yuan, Zicheng Liu1773-1781Feature Generation and Hypothesis Verification for Reliable Face Anti-spoofing
Shice Liu, Shitao Lu, Hongyi Xu, Jing Yang, Shouhong Ding, Lizhuang Ma1782-1791Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions
Wenyu Liu, Gaofeng Ren, Runsheng Yu, Shi Guo, Jianke Zhu, Lei Zhang1792-1800Visual Sound Localization in the Wild by Cross-Modal Interference Erasing
Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou1801-1809Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
Xianpeng Liu, Nan Xue, Tianfu Wu1810-1818Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection
Xinpeng Liu, Yong-Lu Li, Cewu Lu1819-1827DMN4: Few-Shot Learning via Discriminative Mutual Nearest Neighbor Neural Network
Yang Liu, Tu Zheng, Jie Song, Deng Cai, Xiaofei He1828-1836Multi-Knowledge Aggregation and Transfer for Semantic Segmentation
Yuang Liu, Wei Zhang, Jun Wang1837-1845Unsupervised Coherent Video Cartoonization with Perceptual Motion Consistency
Zhenhuan Liu, Liang Li, Huajie Jiang, Xin Jin, Dandan Tu, Shuhui Wang, Zheng-Jun Zha1846-1853Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing
Zhili LIU, Jianhua Han, Lanqing Hong, Hang Xu, Kai Chen, Chunjing Xu, Zhenguo Li1854-1862Pose Guided Image Generation from Misaligned Sources via Residual Flow Based Correction
Jiawei Lu, He Wang, Tianjia Shao, Yin Yang, Kun Zhou1863-1871PMAL: Open Set Recognition via Robust Prototype Mining
Jing Lu, Yunlu Xu, Hao Li, Zhanzhan Cheng, Yi Niu1872-1880Barely-Supervised Learning: Semi-supervised Learning with Very Few Labeled Images
Thomas Lucas, Philippe Weinzaepfel, Gregory Rogez1881-1889Learning Optical Flow with Adaptive Graph Reasoning
Ao Luo, Fan Yang, Kunming Luo, Xin Li, Haoqiang Fan, Shuaicheng Liu1890-1898A Fusion-Denoising Attack on InstaHide with Data Augmentation
Xinjian Luo, Xiaokui Xiao, Yuncheng Wu, Juncheng Liu, Beng Chin Ooi1899-1907Deep Neural Networks Learn Meta-Structures from Noisy Labels in Semantic Segmentation
Yaoru Luo, Guole Liu, Yuanhao Guo, Ge Yang1908-1916Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration
Ziwei Luo, Jing Hu, Xin Wang, Shu Hu, Bin Kong, Youbing Yin, Qi Song, Xi Wu, Siwei Lyu1917-1925Adaptive Poincaré Point to Set Distance for Few-Shot Classification
Rongkai Ma, Pengfei Fang, Tom Drummond, Mehrtash Harandi1926-1934Generative Adaptive Convolutions for Real-World Noisy Image Denoising
Ruijun Ma, Shuyi Li, Bob Zhang, Zhengming Li1935-1943REMOTE: Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos
Xianzheng Ma, Hossein Rahmani, Zhipeng Fan, Bin Yang, Jun Chen, Jun Liu1944-1952Learning from the Target: Dual Prototype Network for Few Shot Semantic Segmentation
Binjie Mao, Xinbang Zhang, Lingfeng Wang, Qian Zhang, Shiming Xiang, Chunhong Pan1953-1961MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Safa C. Medin, Bernhard Egger, Anoop Cherian, Ye Wang, Joshua B. Tenenbaum, Xiaoming Liu, Tim K. Marks1962-1971Towards Bridging Sample Complexity and Model Capacity
Shibin Mei, Chenglong Zhao, Shengchao Yuan, Bingbing Ni1972-1980Towards Accurate Facial Motion Retargeting with Identity-Consistent and Expression-Exclusive Constraints
Langyuan Mo, Haokun Li, Chaoyang Zou, Yubing Zhang, Ming Yang, Yihong Yang, Mingkui Tan1981-1989Can Vision Transformers Learn without Natural Images?
Kodai Nakashima, Hirokatsu Kataoka, Asato Matsumoto, Kenji Iwata, Nakamasa Inoue, Yutaka Satoh1990-1998Restorable Image Operators with Quasi-Invertible Networks
Hao Ouyang, Tengfei Wang, Qifeng Chen2008-2016TEACh: Task-Driven Embodied Agents That Chat
Aishwarya Padmakumar, Jesse Thomason, Ayush Shrivastava, Patrick Lange, Anjali Narayan-Chen, Spandana Gella, Robinson Piramuthu, Gokhan Tur, Dilek Hakkani-Tur2017-2025Label-Efficient Hybrid-Supervised Learning for Medical Image Segmentation
Junwen Pan, Qi Bi, Yanzhan Yang, Pengfei Zhu, Cheng Bian2026-2034Less Is More: Pay Less Attention in Vision Transformers
Zizheng Pan, Bohan Zhuang, Haoyu He, Jing Liu, Jianfei Cai2035-2043Unsupervised Representation for Semantic Segmentation by Implicit Cycle-Attention Contrastive Learning
Bo Pang, Yizhuo Li, Yifan Zhang, Gao Peng, Jiajun Tang, Kaiwen Zha, Jiefeng Li, Cewu Lu2044-2052Graph-Based Point Tracker for 3D Object Tracking in Point Clouds
Minseong Park, Hongje Seong, Wonje Jang, Euntai Kim2053-2061SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro2062-2070Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
Wanli Peng, Jianhang Yan, Hongtao Wen, Yi Sun2082-2090Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels
Tao Pu, Tianshui Chen, Hefeng Wu, Liang Lin2091-2098ReX: An Efficient Approach to Reducing Memory Cost in Image Classification
Xuwei Qian, Renlong Hang, Qingshan Liu2099-2107CPRAL: Collaborative Panoptic-Regional Active Learning for Semantic Segmentation
Yu Qiao, Jincheng Zhu, Chengjiang Long, Zeyao Zhang, Yuxin Wang, Zhenjun Du, Xin Yang2108-2116Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation
Jie Qin, Jie Wu, Xuefeng Xiao, Lujun Li, Xingang Wang2117-2125TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework Using Self-Supervised Multi-Task Learning
Linhao Qu, Shaolei Liu, Manning Wang, Zhijian Song2126-2134Deep Implicit Statistical Shape Models for 3D Medical Image Delineation
Ashwin Raju, Shun Miao, Dakai Jin, Le Lu, Junzhou Huang, Adam P. Harrison2135-2143Decompose the Sounds and Pixels, Recompose the Events
Varshanth R. Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu2144-2152Learning from Label Proportions with Prototypical Contrastive Clustering
Laura Elena Cué La Rosa, Dário Augusto Borges Oliveira2153-2161Beyond Learning Features: Training a Fully-Functional Classifier with ZERO Instance-Level Labels
Deepak Babu Sam, Abhinav Agarwalla, Venkatesh Babu Radhakrishnan2162-2170Reference-Guided Pseudo-Label Generation for Medical Semantic Segmentation
Constantin Marc Seibold, Simon Reiß, Jens Kleesiek, Rainer Stiefelhagen2171-2179Information-Theoretic Bias Reduction via Causal View of Spurious Correlation
Seonguk Seo, Joon-Young Lee, Bohyung Han2180-2188Improving Scene Graph Classification by Exploiting Knowledge from Texts
Sahand Sharifzadeh, Sina Moayed Baharlou, Martin Schmitt, Hinrich Schütze, Volker Tresp2189-2197Reliable Inlier Evaluation for Unsupervised Point Cloud Registration
Yaqi Shen, Le Hui, Haobo Jiang, Jin Xie, Jian Yang2198-2206Explainable Survival Analysis with Convolution-Involved Vision Transformer
Yifan Shen, Li Liu, Zhihao Tang, Zongyi Chen, Guixiang Ma, Jiyan Dong, Xi Zhang, Lin Yang, Qingfeng Zheng2207-2215Un-mix: Rethinking Image Mixtures for Unsupervised Visual Representation Learning
Zhiqiang Shen, Zechun Liu, Zhuang Liu, Marios Savvides, Trevor Darrell, Eric Xing2216-2224On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals
Haizhou Shi, Youcai Zhang, Siliang Tang, Wenjie Zhu, Yaqian Li, Yandong Guo, Yueting Zhuang2225-2234Social Interpretable Tree for Pedestrian Trajectory Prediction
Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Fang Zheng, Nanning Zheng, Gang Hua2235-2243P^3-Net: Part Mobility Parsing from Point Cloud Sequences via Learning Explicit Point Correspondence
Yahao Shi, Xinyu Cao, Feixiang Lu, Bin Zhou2244-2252Improving Zero-Shot Phrase Grounding via Reasoning on External Knowledge and Spatial Relations
Zhan Shi, Yilin Shen, Hongxia Jin, Xiaodan Zhu2253-2261Iterative Contrast-Classify for Semi-supervised Temporal Action Segmentation
Dipika Singhania, Rahul Rahaman, Angela Yao2262-2270JPV-Net: Joint Point-Voxel Representations for Accurate 3D Object Detection
Nan Song, Tianyuan Jiang, Jian Yao2271-2279Fully Attentional Network for Semantic Segmentation
Qi Song, Jie Li, Chenghong Li, Hao Guo, Rui Huang2280-2288Self-Supervised Object Localization with Joint Graph Partition
Yukun Su, Guosheng Lin, Yun Hao, Yiwen Cao, Wenjun Wang, Qingyao Wu2289-2297Correlation Field for Boosting 3D Object Detection in Structured Scenes
Jianhua Sun, Hao-Shu Fang, Xianghui Zhu, Jiefeng Li, Cewu Lu2298-2306Boost Supervised Pretraining for Visual Transfer Learning: Implications of Self-Supervised Contrastive Representation Learning
Jinghan Sun, Dong Wei, Kai Ma, Liansheng Wang, Yefeng Zheng2307-2315Dual Contrastive Learning for General Face Forgery Detection
Ke Sun, Taiping Yao, Shen Chen, Shouhong Ding, Jilin Li, Rongrong Ji2316-2324SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal
Zhaoyang Sun, Yaxiong Chen, Shengwu Xiong2325-2334Adversarial Bone Length Attack on Action Recognition
Nariki Tanaka, Hiroshi Kera, Kazuhiko Kawamoto2335-2343Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Chuanxin Tang, Yucheng Zhao, Guangting Wang, Chong Luo, Wenxuan Xie, Wenjun Zeng2344-2351Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective
Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng2352-2360Transfer Learning for Color Constancy via Statistic Perspective
Yuxiang Tang, Xuejing Kang, Chunxiao Li, Zhaowen Lin, Anlong Ming2361-2369TVT: Three-Way Vision Transformer through Multi-Modal Hypersphere Learning for Zero-Shot Sketch-Based Image Retrieval
Jialin Tian, Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen2370-2378GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference
Peng Tu, Yawen Huang, Feng Zheng, Zhenyu He, Liujuan Cao, Ling Shao2379-2387MTLDesc: Looking Wider to Describe Better
Changwei Wang, Rongtao Xu, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Bin Fan, Xiaopeng Zhang2388-2396Active Boundary Loss for Semantic Segmentation
Chi Wang, Yunke Zhang, Miaomiao Cui, Peiran Ren, Yin Yang, Xuansong Xie, Xian-Sheng Hua, Hujun Bao, Weiwei Xu2397-2405Online-Updated High-Order Collaborative Networks for Single Image Deraining
Cong Wang, Jinshan Pan, Xiao-Ming Wu2406-2413FCA: Learning a 3D Full-Coverage Vehicle Camouflage for Multi-View Physical Adversarial Attack
Donghua Wang, Tingsong Jiang, Jialiang Sun, Weien Zhou, Zhiqiang Gong, Xiaoya Zhang, Wen Yao, Xiaoqian Chen2414-2422When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Guangting Wang, Yucheng Zhao, Chuanxin Tang, Chong Luo, Wenjun Zeng2423-2430Self-Supervised Representation Learning Framework for Remote Physiological Measurement Using Spatiotemporal Augmentation Loss
Hao Wang, Euijoon Ahn, Jinman Kim2431-2439