Rathi, Pulkit and Bhadauria, Saumya and Rathi, Sugandha (2022) Watermarking of Deep Recurrent Neural Network Using Adversarial Examples to Protect Intellectual Property. Applied Artificial Intelligence, 36 (1). ISSN 0883-9514
Watermarking of Deep Recurrent Neural Network Using Adversarial Examples to Protect Intellectual Property.pdf - Published Version
Download (5MB)
Abstract
In the present era, deep learning algorithms are the key elements of several state-of-the-art solutions. But developing these algorithms for production requires a huge volume of data, computational resources, and human expertise. Thus, illegal reproduction, distribution, and modification of these models can cause economic damage to developers and can lead to copyright infringement. We propose a novel watermarking algorithm for deep recurrent neural networks based on adversarial examples that can verify the ownership of the model in a black-box way. In this paper, a novel algorithm to watermark a popular pre-trained speech-to-text deep recurrent neural network model Deep Speech without affecting the accuracy of the model is demonstrated. Watermarking is done by generating a set of adversarial examples by adding noise to the input such that the DeepSpeech model predicts the given input as the target string. In the case of copyright infringement, these adversarial examples can be used to verify ownership of the model. If the alleged stolen model predicts the same target string for the adversarial examples, the ownership of the model is verified. This novel watermarking algorithm can minimize the economic damage to the owners of the deep learning models due to stealing and plagiarizing.
Item Type: | Article |
---|---|
Subjects: | STM Digital Library > Computer Science |
Depositing User: | Unnamed user with email support@stmdigitallib.com |
Date Deposited: | 19 Jun 2023 06:34 |
Last Modified: | 18 Oct 2024 04:16 |
URI: | http://archive.scholarstm.com/id/eprint/1431 |