Ahmadi, M. M., Alrahis, L., Colucci, A., Sinanoglu, O., & Shafique, M. (2022). NeuroUnlock: Unlocking the Architecture of Obfuscated Deep Neural Networks. In Proceedings 2022 International Joint Conference on Neural Networks (IJCNN) (pp. 01–10). https://doi.org/10.1109/IJCNN55064.2022.9892545
E191-02 - Forschungsbereich Embedded Computing Systems
-
Published in:
Proceedings 2022 International Joint Conference on Neural Networks (IJCNN)
-
ISBN:
978-1-7281-8671-9
-
Volume:
2022-July
-
Date (published):
2022
-
Event name:
2022 International Joint Conference on Neural Networks (IJCNN)
en
Event date:
18-Jul-2022 - 23-Jul-2022
-
Event place:
Padua, Italy
-
Number of Pages:
10
-
Keywords:
hardware architecture; Deep neural networks; Model extraction; Obfuscation; Side-channel-based attacks
en
Abstract:
The advancements of deep neural networks (DNNs) have led to their deployment in diverse settings, including safety and security-critical applications. As a result, the characteristics of these models (e.g., the architecture of layers and weight values/distributions) have become sensitive intellectual properties that require protection from malicious users. Extracting the architecture of a DNN through leaky side-channels (e.g., memory access) allows adversaries to (i) clone the model (i.e., build proxy models with similar accuracy profiles), and (ii) craft adversarial attacks. DNN obfuscation thwarts side-channel-based architecture stealing (SCAS) attacks by altering the run-time traces of a given DNN while preserving its functionality. In this work, we expose the vulnerability of state-of-the-art DNN obfuscation methods (based on predictable and reversible modifications employed in a given DNN architecture) to these attacks. We present NeuroUnlock, a novel SCAS attack against obfuscated DNNs. Our NeuroUnlock employs a sequence-to-sequence model that learns the obfuscation procedure and automatically reverts it, thereby recovering the original DNN architecture. We demonstrate the effectiveness of NeuroUnlock by recovering the architecture of 200 randomly generated and obfuscated DNNs running on the Nvidia RTX 2080 TI graphics processing unit (GPU). Moreover, NeuroUnlock recovers the architecture of various other obfuscated (and publicly available) DNNs, such as the VGG-11, VGG-13, ResNet-20, and ResNet-32 networks. After recovering the architecture, NeuroUnlock automatically builds a near-equivalent DNN with only a 1.4% drop in the testing accuracy. We further show that launching a subsequent adversarial attack on the recovered DNNs boosts the success rate of the adversarial attack by 51.7% in average compared to launching it on the obfuscated versions. Additionally, we propose a novel methodology for DNN obfuscation, ReDLock, which eradicates the deterministic nature of the obfuscation and achieves 2.16 x more resilience to the NeuroUnlock attack. We release the NeuroUnlock and the ReDLock as open-source frameworks11https://github.com/Mahya-Ahmadi/NeuroUnlock.
en
Research Areas:
Computer Engineering and Software-Intensive Systems: 100%