Three-Dimensional Reconstruction Pre-Training as a Prior to Improve Robustness to Adversarial Attacks and Spurious Correlation.

Yamada, Yutaro; Zhang, Fred Weiying; Kluger, Yuval; Yildirim, Ilker

Yamada, Yutaro; Zhang, Fred Weiying; Kluger, Yuval; Yildirim, Ilker.

Affiliation

Yamada Y; Department of Statistics & Data Science, Yale University, New Haven, CT 06511, USA.
Zhang FW; Department of Statistics & Data Science, Yale University, New Haven, CT 06511, USA.
Kluger Y; Department of Pathology, Yale University School of Medicine, New Haven, CT 06511, USA.
Yildirim I; Department of Applied Mathematics, Yale University, New Haven, CT 06511, USA.

Entropy (Basel) ; 26(3)2024 Mar 14.

Article in En | MEDLINE | ID: mdl-38539769

ABSTRACT

ABSTRACT

Ensuring robustness of image classifiers against adversarial attacks and spurious correlation has been challenging. One of the most effective methods for adversarial robustness is a type of data augmentation that uses adversarial examples during training. Here, inspired by computational models of human vision, we explore a synthesis of this approach by leveraging a structured prior over image formation the 3D geometry of objects and how it projects to images. We combine adversarial training with a weight initialization that implicitly encodes such a prior about 3D objects via 3D reconstruction pre-training. We evaluate our approach using two different datasets and compare it to alternative pre-training protocols that do not encode a prior about 3D shape. To systematically explore the effect of 3D pre-training, we introduce a novel dataset called Geon3D, which consists of simple shapes that nevertheless capture variation in multiple distinct dimensions of geometry. We find that while 3D reconstruction pre-training does not improve robustness for the simplest dataset setting, we consider (Geon3D on a clean background) that it improves upon adversarial training in more realistic (Geon3D with textured background and ShapeNet) conditions. We also find that 3D pre-training coupled with adversarial training improves the robustness to spurious correlations between shape and background textures. Furthermore, we show that the benefit of using 3D-based pre-training outperforms 2D-based pre-training on ShapeNet. We hope that these results encourage further investigation of the benefits of structured, 3D-based models of vision for adversarial robustness.

Key words

3D vision; adversarial examples; robust vision

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Entropy (Basel) Year: 2024 Document type: Article Affiliation country: Estados Unidos Country of publication: Suiza

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google