Year
Month
(Preprint) Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain
Guangyao Chen 陈光耀 ¹, Peixi Peng 彭佩玺 ¹ ³, Li Ma 马力 ¹ ³, Jia Li 李甲 ² ³, Lin Du ⁴, Yonghong Tian 田永鸿 ¹ ³
¹ Department of Computer Science and Technology, Peking University
北京大学计算机科学技术系
² State Key Laboratory of Virtual Reality Technology and Systems, SCSE, Beihang University
北京航空航天大学 虚拟现实技术与系统国家重点实验室
³ Peng Cheng Laborotory
鹏城实验室
⁴ AI Application Research Center, Huawei
华为AI应用研究中心
arXiv, 2021-08-19
Abstract

Recently, the generalization behavior of Convolutional Neural Networks (CNN) is gradually transparent through explanation techniques with the frequency components decomposition. However, the importance of the phase spectrum of the image for a robust vision system is still ignored. In this paper, we notice that the CNN tends to converge at the local optimum which is closely related to the high-frequency components of the training images, while the amplitude spectrum is easily disturbed such as noises or common corruptions.

In contrast, more empirical studies found that humans rely on more phase components to achieve robust recognition. This observation leads to more explanations of the CNN's generalization behaviors in both robustness to common perturbations and out-of-distribution detection, and motivates a new perspective on data augmentation designed by re-combing the phase spectrum of the current image and the amplitude spectrum of the distracter image. That is, the generated samples force the CNN to pay more attention to the structured information from phase components and keep robust to the variation of the amplitude.

Experiments on several image datasets indicate that the proposed method achieves state-of-the-art performances on multiple generalizations and calibration tasks, including adaptability for common corruptions and surface variations, out-of-distribution detection, and adversarial attack.
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain_1
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain_2
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain_3
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain_4
  • Integrated photonic synapses, neurons, memristors, and neural networks for photonic neuromorphic computing
  • Shufei Han, Weihong Shen, Min Gu, Qiming Zhang
  • Opto-Electronic Technology
  • 2025-12-25
  • Photoacoustic spectroscopy and light-induced thermoelastic spectroscopy based on inverted-triangular lithium niobate tuning fork
  • Junjie Mu, Guowei Han, Runqiu Wang, Shunda Qiao, Ying He Yufei Ma
  • Opto-Electronic Science
  • 2025-12-25
  • Thin-film lithium niobate-based detector: recent advances and perspectives
  • Xiaoli Sun, Yuechen Jia, Feng Chen
  • Opto-Electronic Science
  • 2025-12-25
  • In-situ and ex-situ twisted bilayer liquid crystal computing platform for reconfigurable image processing
  • Kang Zeng, Yougang Ke, Zhangming Hong, Linzhou Zeng, Xinxing Zhou
  • Opto-Electronic Advances
  • 2025-12-25
  • Highly textured single-crystal-like perovskite films for large-area, high-performance photodiodes
  • Runkai Liu, Feng Li, Rongkun Zheng
  • Opto-Electronic Advances
  • 2025-12-25
  • Robust performance of PTQ10:DTY6 in halogen-free photovoltaics across deposition techniques and configurations for industrial scale-up
  • Atiq Ur Rahman, Tanner M. Melody, Sydney Pfleiger, Acacia Patterson, Andrea Reale, Brian A. Collins
  • Opto-Electronic Advances
  • 2025-12-25
  • Surpassing the diffraction limit in long-range laser engineering via cross-scale vectorial optical field manipulation: perspectives and outlooks
  • Yinghui Guo, Mingbo Pu, Yang Li, Mingfeng Xu, Xiangang Luo
  • Opto-Electronic Advances
  • 2025-12-25
  • Spatiotemporal multiplexed photonic reservoir computing: parallel prediction for the high-dimensional dynamics of complex semiconductor laser network
  • Tong Yang, Li-Yue Zhang, Song-Sui Li, Wei Pan, Xi-Hua Zou, Lian-Shan Yan
  • Opto-Electronic Advances
  • 2025-12-25
  • Filament based ionizing radiation sensing
  • Pengfei Qi, Haiyi Liu, Jiewei Guo, Nan Zhang, Lu Sun, Shishi Tao, Binpeng Shang, Lie Lin Weiwei Liu
  • Opto-Electronic Advances
  • 2025-12-25
  • Separation and identification of mixed signal for distributed acoustic sensor using deep learning
  • Huaxin Gu, Jingming Zhang, Xingwei Chen, Feihong Yu, Deyu Xu, Shuaiqi Liu, Weihao Lin, Xiaobing Shi, Zixing Huang, Xiongji Yang, Qingchang Hu, Liyang Shao
  • Opto-Electronic Advances
  • 2025-11-25
  • Scale-invariant 3D face recognition using computer-generated holograms and the Mellin transform
  • Yongwei Yao, Yaping Zhang, Huanrong He, Xianfeng David Gu, Daping Chu, Ting-Chung Poon
  • Opto-Electronic Advances
  • 2025-11-25
  • Partially coherent optical chip enables physical-layer public-key encryption
  • Bo Wu, Wenkai Zhang, Hailong Zhou, Jianji Dong, Yilun Wang, Xinliang Zhang
  • Opto-Electronic Advances
  • 2025-11-25



  • Blind Estimation of Sparse Simo Channels: Quadratic Vs. Linear Constraints                                A Non-Stationary Channel Model with Correlated NLoS/LoS States for ELAA-mMIMO
    About
    |
    Contact
    |
    Copyright © PubCard