Hidden randomness in defenses #65

yi-sun · 2018-12-19T17:29:46Z

The contest proposal states:

The following would not be shared exact sequence of randomness during evaluation (e.g. np.seed)

We have a few clarifying questions:

When submitting a defense, is the defense required to perform well for all values of np.seed, or may the defenders specify a specific value which is hidden from attackers?
In the latter case, how would this be implemented in the Docker framework?

carlini · 2018-12-19T17:36:04Z

We haven't carefully considered this yet. I would be partial to saying that a defense should work with any random seed, but that it is free to choose a fresh seed every time it classifies an image.

If we instead allow the defense to only work with one seed the defender knows and the attacker doesn't, we're no longer in a fully white-box threat model: the defender now gets to hold something secret.

But I think it would be worth discussing this to make sure there aren't any unintended consequences. Can you think of a defense where it makes sense to only work for one random seed but not others?

yi-sun · 2018-12-21T22:51:19Z

We have been testing a specific defense idea leveraging private randomness which I've emailed you about privately. Please let me know if you'd prefer to keep the rules discussion on this thread, in which case I'll try to rephrase our idea in a less specific way.

carlini · 2018-12-21T22:52:55Z

Let me take a look at your email.

carlini · 2019-01-08T18:48:17Z

I've been giving this some thought. I'm inclined to say "no" that defenses must work with an arbitrary seed. If we allow defenses to have a secret seed, then what's to say that they don't use this to initialize some weights of the neural network and now we have a grey-box threat model which we want explicitly to avoid.

@catherio @nottombrown do you have any thoughts?

catherio · 2019-01-10T17:16:13Z

That's my inclination, too, but maybe you could forward the email so I can think about this specific case?

catherio · 2019-01-10T17:33:15Z

Ok, having read this, I agree with @carlini. The randomness is be viewed as coming from "the world"; the defender has to accept what it is given, and work well under all such situations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hidden randomness in defenses #65

Hidden randomness in defenses #65

yi-sun commented Dec 19, 2018

carlini commented Dec 19, 2018

yi-sun commented Dec 21, 2018

carlini commented Dec 21, 2018

carlini commented Jan 8, 2019

catherio commented Jan 10, 2019

catherio commented Jan 10, 2019

Hidden randomness in defenses #65

Hidden randomness in defenses #65

Comments

yi-sun commented Dec 19, 2018

carlini commented Dec 19, 2018

yi-sun commented Dec 21, 2018

carlini commented Dec 21, 2018

carlini commented Jan 8, 2019

catherio commented Jan 10, 2019

catherio commented Jan 10, 2019