Research Article

Blockchain-based Framework for Generating and Managing Unlearnable Examples for Enhancing Data Privacy and Access Control

DOI:

10.3791/68338

August 22nd, 2025

In This Article

Summary

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This paper proposes a blockchain-based framework for generating unlearnable examples, integrating dynamic perturbation with access control. It enhances privacy protection by ensuring that unauthorized users receive perturbed data, safeguarding sensitive information while enabling efficient data management and access through smart contracts.

Abstract

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

In the context of the rapid development of large language models (LLMs), contrastive learning has become widely adopted due to its ability to bypass costly data annotation by leveraging vast amounts of network data for model training. However, this widespread use raises significant concerns regarding data privacy protection. Unlearnable Examples (UEs), a technique that disrupts model learning by perturbing data, effectively prevents unauthorized models from misusing sensitive data. However, existing methods for generating UEs face two primary challenges: first, perturbations may be reversed using techniques such as reverse purification or denoising, including diffusion models that remove protective perturbations in image UEs; second, once data is published, ensuring data traceability and managing access control becomes difficult. To address these issues, this paper proposes a Blockchain-Integrated Unlearnable Example Generation and Management Framework (B-UEGMF) for generating and managing UEs. By leveraging the decentralized and immutable properties of blockchain, we store example hash values on the blockchain and dynamically manage data access rights through smart contracts. Additionally, UEs are generated using a multi-objective perturbation technique, Dynamic Error-Minimizing Noise (DEM), which enhances robustness against reversal methods. We also provide a quantitative evaluation of the privacy protection capabilities of the generated examples. Experimental results demonstrate that the proposed framework significantly improved the defense of UEs against reverse attacks while ensuring efficient data privacy management.

Introduction

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

In recent years, with the rapid advancement of deep learning and large language models, contrastive learning has emerged as an efficient unsupervised learning approach due to its independence from costly manual annotations1,2. However, the extensive use of public datasets has raised significant concerns about privacy breaches and data misuse. Instances of unauthorized utilization of publicly available data for model training have become increasingly common3. For example, in 2017, unauthorized public photographs were employed to train facial recognition models4. S....

Access restricted. Please log in or start a trial to view this content.

Protocol

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Setup
We considered a supervised classification task with a dataset Dataset notation D={(xi,yi)} in mathematical formula, data pair representation., where Mathematical notation for vector elements in set X within Euclidean space R, symbolic representation. represents the input features and

Access restricted. Please log in or start a trial to view this content.

Results

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Blockchain and smart contract framework
Experimental results demonstrated that the proposed Blockchain-Integrated Unlearnable Example Generation and Management Framework (B-UEGMF), combined with smart contracts, enabled effective dynamic management of client-specific access to data. For authorized users, the retrieved clean data achieved a test accuracy of 90.2% on a ResNet-18 surrogate model evaluated on the CIFAR-10 dataset. In contrast, unauthorized users accessing UEs generated by DEM obtained a .......

Access restricted. Please log in or start a trial to view this content.

Discussion

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The integration of blockchain and UEs has advanced the field of data privacy protection by providing a transparent and decentralized solution for managing data access. Unlike conventional privacy-preserving methods, which often rely solely on perturbation techniques31, this study bridges the gap between data protection and responsibility tracing. In federated learning scenarios, the proposed framework ensures secure and private model training across decentralized datasets, mitigating the risk of u.......

Access restricted. Please log in or start a trial to view this content.

Disclosures

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The authors have nothing relevant to this publication to disclose.

Acknowledgements

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This work was supported by the School of Cyberspace Security, Zhengzhou University, which provided an excellent research environment and academic resources. We are deeply grateful to our supervisor, Prof. Zijiao Zhang, for his invaluable guidance, insightful suggestions, and continuous encouragement throughout this research. We also extend our sincere thanks to the Network Management Center of Zhengzhou University for providing experimental servers, high-performance computing resources, and blockchain testbed infrastructure, which were essential for the successful implementation of this study.

Author contribution:
Rui....

Access restricted. Please log in or start a trial to view this content.

Materials

List of materials used in this article
NameCompanyCatalog NumberComments
CUDA 12.1NVIDIAUsed to enhance the performance of deep learning applications
NVIDIA A800 80GB PCIe A800 80GB PCIeNVIDIAUsed for deep learning model training
Python 3.10Python Software FoundationUsed for data preprocessing and analysis
PyTorch 2.5.1FacebookDeep learning framework used for model training
Ubuntu 22.04CanonicalOperating system used for setting up the environment

References

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,
  1. Multi-level Cross-view Contrastive Learning for Knowledge-aware Recommender System. Zou, D., Chen, Y., Wang, X. Proc 45th Int ACM SIGIR Conf Res Dev Info Retrieval, , 1358-1368 (2022).
  2. A simple framework for contrastive learning of visual representations. Chen, T., Kornblith, S., Norouzi, M., Hinton, G. Proc 37th Int Conf Mach Learn, 119, 1597-1607 (2020).
  3. Guo, J., et al. Domain watermark: Effective and harmless dataset copyri....

Access restricted. Please log in or start a trial to view this content.

Reprints and Permissions

Request permission to reuse the text or figures of this JoVE article

Request Permission

Tags

Blockchain Data PrivacyUnlearnable ExamplesAccess ControlContrastive LearningData TraceabilitySmart ContractsData PerturbationPrivacy ProtectionReverse Attack DefenseDynamic Error Minimizing Noise

Related Articles