Curriculum Learning and Pattern-Aware Highly Efficient Privacy-Preserving Scheme for Mixed Data Outsourcing with Minimal Utility Loss

Majeed, Abdul; Lee, Kyunghyun; Hwang, Seong Oun

doi:10.3390/app152111849

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

Curriculum Learning and Pattern-Aware Highly Efficient Privacy-Preserving Scheme for Mixed Data Outsourcing with Minimal Utility Loss

by

Abdul Majeed

^1,*

,

Kyunghyun Lee

²

and

Seong Oun Hwang

^2,*

¹

Secure Cyber Systems Research Group, WMG, University of Warwick, Coventry, CV4 7AL, UK

²

Department of Computer Engineering, Gachon University, Seongnam 13120, Republic of Korea

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2025, 15(21), 11849; https://doi.org/10.3390/app152111849

Submission received: 30 September 2025 / Revised: 2 November 2025 / Accepted: 4 November 2025 / Published: 6 November 2025

(This article belongs to the Special Issue Progress in Information Security and Privacy)

Download Versions Notes

Abstract

A complex problem when outsourcing personal data for public use is balancing privacy protection with utility, and anonymization is a viable solution to address this issue. However, conventional anonymization methods often overlook global information regarding the composition of attributes in data, leading to unnecessary computations and high utility loss. To address these problems, we propose a curriculum learning (CL)-based, pattern-aware privacy-preserving scheme that exploits information about attribute composition in the data to enhance utility and privacy without performing unnecessary computations. The CL approach significantly reduces time overheads by sorting data by complexity, and only the most complex (e.g., privacy-sensitive) parts of the data are processed. Our scheme considers both diversity and similarity when forming clusters to effectively address the privacy–utility trade-off. Our scheme prevents substantial changes in data during generalization by protecting generic portions of the data from futile anonymization, and only a limited amount of data is anonymized through a joint application of differential privacy and k-anonymity. We attain promising results by rigorously testing the proposed scheme on three benchmark datasets. Compared to recent anonymization methods, our scheme reduces time complexity by 74.33%, improves data utility by 19.67% and 68.33% across two evaluation metrics, and enhances privacy protection by 29.19%. Our scheme performs 82.66% fewer lookups in generalization hierarchies than existing anonymization methods. In addition, our scheme is very lightweight and is 1.95× faster than the parallel implementation architectures. Our scheme can effectively solve the trade-off between privacy and utility better than prior works in outsourcing personal data enclosed in tabular form.

Keywords: anonymization; curriculum learning; differential privacy; k-anonymity; diversity; data outsourcing; privacy; utility; clustering; similarity; personal data

Share and Cite

MDPI and ACS Style

Majeed, A.; Lee, K.; Hwang, S.O. Curriculum Learning and Pattern-Aware Highly Efficient Privacy-Preserving Scheme for Mixed Data Outsourcing with Minimal Utility Loss. Appl. Sci. 2025, 15, 11849. https://doi.org/10.3390/app152111849

AMA Style

Majeed A, Lee K, Hwang SO. Curriculum Learning and Pattern-Aware Highly Efficient Privacy-Preserving Scheme for Mixed Data Outsourcing with Minimal Utility Loss. Applied Sciences. 2025; 15(21):11849. https://doi.org/10.3390/app152111849

Chicago/Turabian Style

Majeed, Abdul, Kyunghyun Lee, and Seong Oun Hwang. 2025. "Curriculum Learning and Pattern-Aware Highly Efficient Privacy-Preserving Scheme for Mixed Data Outsourcing with Minimal Utility Loss" Applied Sciences 15, no. 21: 11849. https://doi.org/10.3390/app152111849

APA Style

Majeed, A., Lee, K., & Hwang, S. O. (2025). Curriculum Learning and Pattern-Aware Highly Efficient Privacy-Preserving Scheme for Mixed Data Outsourcing with Minimal Utility Loss. Applied Sciences, 15(21), 11849. https://doi.org/10.3390/app152111849

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Curriculum Learning and Pattern-Aware Highly Efficient Privacy-Preserving Scheme for Mixed Data Outsourcing with Minimal Utility Loss

Abstract

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI