Towards Achieving Adversarial Robustness Beyond Perceptual ...

● Varying epsilon training schedule with TRADES-AWP loss (CE loss for attack) ● Standard Adversarial training till the perceptual limit of ε = 12/255 ● From ε = 12/255 till 16/255, training on Oracle-Sensitive and Oracle-Invariant samples in alternate iterations ● Sensitivity towards Oracle-Sensitive attacks generated by maximizing CE loss ● Robustness to Oracle-Invariant attacks ○ Generation of Oracle-Invariant attacks by minimizing LPIPS distance (using the model being trained) between clean and perturbed images Towards Achieving Adversarial Robustness Beyond Perceptual Limits Sravanti Addepalli*, Samyak Jain*, Gaurang Sriramanan, Shivangi Khare, R.Venkatesh Babu Video Analytics Lab, Indian Institute of Science, Bangalore, India Preliminaries: Threat Model and Terminology ● Threat model considered: Moderate-ε norm bound of 16/255 ● Human prediction is referred to as the Oracle Label ● Types of perturbations within the deﬁned threat model: ○ Oracle-Invariant images: Do not change Oracle prediction ■ Adversarial examples generated from Normally trained models ■ Adversarial examples at low perturbation bounds (ε = 8/255) ○ Oracle-Sensitive images: Flip the Oracle prediction ■ Adversarial examples generated from Adversarially Trained models at large perturbation bounds (ε = 32/255) Clean Images (along with their labels) Adversarial Perturbations generated from a Normally Trained (NT) Model Oracle-Invariant Adversarial examples (along with NT model predictions) ε = 32/255 ε = 16/255 Aero Dog Auto Bird Frog Horse Cat Ship Truck Deer Bird Cat Bird Cat Bird Bird Bird Frog Bird Bird Clean Images (along with their labels) Adversarial Perturbations generated from an Adversarially Trained (AT) Model Oracle-Sensitive Adversarial examples (along with AT model predictions) ε = 32/255 ε = 16/255 Deer Auto Horse Dog Auto Frog Frog Truck Horse Truck Aero Dog Auto Bird Frog Horse Cat Ship Truck Deer ε = 8/255 Partially Oracle-Sensitive Adversarial examples Oracle-Invariant Adversarial examples Goals and Evaluation Metrics ● Robustness against all attacks within ε = 8/255 ○ Auto-Attack (AA) and Guided Adversarial Margin Attack (GAMA) PGD-100 ● Robustness to Oracle-Invariant samples within ε = 16/255 ○ Gradient-Free Attacks ● Robustness-Accuracy trade-off Adversarial examples generated using Square Attack (along with AT model predictions) Horse Ship Horse Dog Frog Frog Frog Ship Truck Truck Aero Dog Auto Bird Frog Horse Cat Ship Truck Deer Clean Images (along with their labels) Scaling existing AT methods to larger ε bounds Signiﬁcant drop (16%) in Clean Accuracy when compared to Adversarial Training at ε = 8/255 Frog ---------------------------> Deer Original 8/255 16/255 24/255 Frog Conflict! Oracle-Aligned Adversarial Training LPIPS metric computed using an AT model can distinguish well between Oracle-Sensitive (Attack from PGD-AT model) and Oracle-Invariant (Attacks from Normally trained model) examples successfully. Cat Bird Bird Dog Dog Cat Cat Bird Dog Cat Clean λ = 0 λ = 0 λ = 1 λ = 1 λ = 2 λ = 2 λ = 0 λ = 1 λ = 2 ε = 24/255 ε = 16/255 Results Ablations Acknowledgements: This work was supported by Uchhatar Avishkar Yojana (UAY) project (IISC 10), MHRD, Govt. of India. Sravanti Addepalli is supported by a Google PhD Fellowship in Machine Learning. We are thankful for the support. Deer/ Frog? 16/255 ε = 16/255 Clean image α * + (1-α) * ε = Deer/ Frog? Frog Deer

Upload
others
Category

Documents
view
5
download
0

Embed Size (px):

Transcript of Towards Achieving Adversarial Robustness Beyond Perceptual ...

Page 1: Towards Achieving Adversarial Robustness Beyond Perceptual ...

● Varying epsilon training schedule with TRADES-AWP loss (CE loss for attack)● Standard Adversarial training till the perceptual limit of ε = 12/255

● From ε = 12/255 till 16/255, training on Oracle-Sensitive and Oracle-Invariant samples in alternate iterations

● Sensitivity towards Oracle-Sensitive attacks generated by maximizing CE loss

● Robustness to Oracle-Invariant attacks

○ Generation of Oracle-Invariant attacks by minimizing LPIPS distance (using the model being trained) between clean and perturbed images

Towards Achieving Adversarial Robustness Beyond Perceptual LimitsSravanti Addepalli*, Samyak Jain*, Gaurang Sriramanan, Shivangi Khare, R.Venkatesh BabuVideo Analytics Lab, Indian Institute of Science, Bangalore, India Preliminaries: Threat Model and Terminology

● Threat model considered: Moderate-ε norm bound of 16/255● Human prediction is referred to as the Oracle Label● Types of perturbations within the defined threat model:

○ Oracle-Invariant images: Do not change Oracle prediction■ Adversarial examples generated from Normally trained models■ Adversarial examples at low perturbation bounds (ε = 8/255)

○ Oracle-Sensitive images: Flip the Oracle prediction■ Adversarial examples generated from Adversarially Trained models at

large perturbation bounds (ε = 32/255)

Clean Images (along with their labels)

Adversarial Perturbations generated from a Normally Trained (NT) Model

Oracle-Invariant Adversarial examples (along with NT model predictions)

ε = 32/255

ε = 16/255

Aero Dog Auto Bird Frog Horse Cat Ship Truck Deer

Bird Cat Bird Cat Bird Bird Bird Frog Bird Bird

Clean Images (along with their labels)

Adversarial Perturbations generated from an Adversarially Trained (AT) Model

Oracle-Sensitive Adversarial examples (along with AT model predictions)

ε = 32/255

ε = 16/255

Deer Auto Horse Dog Auto Frog Frog Truck Horse Truck

Aero Dog Auto Bird Frog Horse Cat Ship Truck Deer

ε = 8/255

Partially Oracle-Sensitive Adversarial examples

Oracle-Invariant Adversarial examples

Goals and Evaluation Metrics

● Robustness against all attacks within ε = 8/255○ Auto-Attack (AA) and Guided Adversarial Margin Attack (GAMA) PGD-100

● Robustness to Oracle-Invariant samples within ε = 16/255○ Gradient-Free Attacks

● Robustness-Accuracy trade-off

Adversarial examples generated using Square Attack (along with AT model predictions)Horse Ship Horse Dog Frog Frog Frog Ship Truck Truck

Aero Dog Auto Bird Frog Horse Cat Ship Truck DeerClean Images (along with their labels)

Scaling existing AT methods to larger ε bounds

Significant drop (16%) in Clean Accuracy when compared to Adversarial Training at ε = 8/255

Frog ---------------------------> Deer

Original 8/255 16/255 24/255

Frog

Conflict!

Oracle-Aligned Adversarial Training

LPIPS metric computed using an AT model can distinguish well between Oracle-Sensitive (Attack from PGD-AT model) and Oracle-Invariant (Attacks from Normally trained model) examples successfully.

Cat Bird Bird Dog Dog Cat Cat Bird Dog Cat

Clean λ = 0 λ = 0 λ = 1 λ = 1 λ = 2 λ = 2 λ = 0 λ = 1 λ = 2ε = 24/255ε = 16/255

Results

Ablations

Acknowledgements: This work was supported by Uchhatar Avishkar Yojana (UAY) project (IISC 10), MHRD, Govt. of India. Sravanti Addepalli is supported by a Google PhD Fellowship in Machine Learning. We are thankful for the support.

Deer/ Frog?

16/255

ε = 16/255 Clean image

α * + (1-α) *

ε =

Deer/ Frog? Frog Deer

Robust MRAC for a Wing Rock Phenomenon in Delta Wing …miscj.aut.ac.ir/article_825_342f4bf89fbd624d6eec91333035c022.pdf · Adaptive control theory are still seeking for robustness

On the path to Bose-Einstein condensate (BEC) Basic concepts for achieving temperatures below 1 μK Author: Peter Ferjančič Mentors: Denis Arčon and Peter.

The causal role of α-oscillations in feature binding · 1.711, P > 0.208] and the interactions between perceptual state and electrode [all F(60, 1,020) < 0.965, P > 0.363]

Olympia Operating System Overview Basics - Electronics · PDF fileOlympia Operating System Overview Basics OOS v.1-edit.0 ... Autoliv Production System (APS) 1997 UTC - Achieving Competitive

Computation of τ-estimates for regressionmatias/algorithmsShort_v4.pdf · (Yohai and Zamar 1988) also combine good robustness and high eﬃciency. Moreover, they have two theoretical

Secure Distributed Framework for Achieving ϵ -Differential Privacy

Online Appendix - Curation Algorithms and Filter Bubbles in … · Online Appendix - Curation Algorithms and Filter Bubbles in Social Networks A Robustness to Timing We test the robustness

Robust simulation optimization using φ- · PDF fileIn a model-based approach, ... consider robustness in both optimality and feasibility of a ... the proposed methods are applied

Efficiency, Robustness and Stochasticity of Gene ...Efficiency, Robustness and Stochasticity of Gene Regulatory Networks in Systems Biology: λ Switch as a Working Example Xiaomei

The Impact of Attending a School with High-Achieving Peers ... · BS =0.158 (0.046)! ST =0.483 (0.071) 2 2.5 3 3.5 ms Brooklyn Tech Bronx Science Stuyvesant Entrance Exam Score 8.

BANK OF GREECE · ability to access bank financing and exports of goods both in the long and in ... Robustness is established by estimating alternative specifications producing

Raytheon Six-Sigma – Its Our Difference Pres April 13-15 2011_Haggh_Final.pdfPage 6 R6σis a Driving Force For Business Success R6σis a strategic enabler for achieving business

Mistranslation drives the evolution of robustness in TEM-1 ... · stability. To study how these mechanisms are exploited by popu-lations evolving in the laboratory, we evolved the

HORNET SPECTROMETER - LightMachinery · The Hornet spectrometer from LightMachinery is an ultra compact, low cost spectrometer capable of achieving the resolu

Capacity-achieving Sparse Regression Codes via Aproximate Message …rv285/ita15_sparc_amp_talk.pdf · 2015. 2. 7. · A is n N measurement matrix, i.i.d. with known prior AMP iteratively

$AUTOMATIC PHOROPTOR FOR TODAY‘S MODERN PRACTICE · 2017. 10. 5. · quiet Phoropter. • High speed rotation disk achieving smooth motion. •Starting the refraction as close to$

AUTOMATIC PHOROPTOR FOR TODAY‘S MODERN PRACTICE · 2017. 10. 5. · quiet Phoropter. • High speed rotation disk achieving smooth motion. •Starting the refraction as close to

Achieving elusive transformations with organocatalysis: direct ......Key words: Organocatalysis, N-heterocyclic carbenes, sp3 carbon activation, regioselectivity, photoredox catalysis

Homomorphic Encryption with Graph Reductions Achieving ...math.mit.edu/research/highschool/primes/materials/... · Given a graph with two input nodes and some desired outputs, ﬁnd

The Impact of Attending a School with High-Achieving Peers: … · 2013. 12. 16. · Notes: These gures plot exam school eligibility and average peer characteristics in a student’s

Evaluating and Understanding the Robustness of Adversarial ...secml18-poster.pdfEvaluatingandUnderstandingtheRobustnessofAdversarialLogitPairing LoganEngstrom∗ AndrewIlyas∗ AnishAthalye∗

Towards Achieving Adversarial Robustness Beyond Perceptual ...

Documents

Transcript of Towards Achieving Adversarial Robustness Beyond Perceptual ...