Author:
Rademaker Thomas J.,Bengio Emmanuel,François Paul
Abstract
Machine learning algorithms can be fooled by small well-designed adversarial perturbations. This is reminiscent of cellular decision-making where ligands (called antagonists) prevent correct signalling, like in early immune recognition. We draw a formal analogy between neural networks used in machine learning and models of cellular decision-making (adaptive proofreading). We apply attacks from machine learning to simple decision-making models, and show explicitly the correspondence to antagonism by weakly bound ligands. Such antagonism is absent in more nonlinear models, which inspired us to implement a biomimetic defence in neural networks filtering out adversarial perturbations. We then apply a gradient-descent approach from machine learning to different cellular decision-making models, and we reveal the existence of two regimes characterized by the presence or absence of a critical point for the gradient. This critical point causes the strongest antagonists to lie close to the decision boundary. This is validated in the loss landscapes of robust neural networks and cellular decision-making models, and observed experimentally for immune cells. For both regimes, we explain how associated defence mechanisms shape the geometry of the loss landscape, and why different adversarial attacks are effective in different regimes. Our work connects evolved cellular decision-making to machine learning, and motivates the design of a general theory of adversarial perturbations, both for in vivo and in silico systems.
Publisher
Cold Spring Harbor Laboratory
Reference68 articles.
1. Deep learning
2. Alex Krizhevsky , Ilya Sutskever , and Geoffrey E Hinton , “Imagenet classiffication with deep convolutional neural networks,” in Advances in Neural Information Processing Systems (2012) pp. 1097–1105.
3. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups;IEEE Signal processing magazine,2012
4. Ilya Sutskever , Oriol Vinyals , and Quoc V Le , “Sequence to sequence learning with neural networks,” in Advances in Neural Information Processing Systems (2014) pp. 3104–3112.
5. Intriguing properties of neural networks;arXiv preprint,2013