A Differentially Private Big Data Nonparametric Bayesian Clustering Algorithm in Smart Grid

Zhitao Guan*, Zefang Lv, Xianwen Sun, Longfei Wu, Jun Wu, Xiaojiang Du, Mohsen Guizani

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)


Smart systems, including smart grid (SG) and Internet of Things (IoT), have been playing a critical role in addressing contemporary issues. Taking full advantage of the big data generated by the smart grid can enhance the system stability and reliability, increase asset utilization, and offer better customer experience. To better support the data-driven smart grid, the machine learning technologies such as cluster analysis can be applied to process the massive data generated in smart grid. However, the process of cluster analysis may cause the disclosure of personal private information. In this paper, to achieve privacy-preserving cluster analysis in smart grid, we propose IDPC, a Differentially Private Clustering algorithm based on the Infinite Gaussian mixture model (IGMM). IDPC uses a combination of nonparametric Bayesian method and differential privacy. The nonparametric Bayesian method allows certain parameters to change along with the data and it is usually adopted in a clustering algorithm without a fixed number of clusters. The Laplace mechanism is used in data releasing process to make IDPC differentially private. We present how to make the nonparametric Bayesian clustering algorithm differentially private by adding Laplace noise. By security analysis and performance evaluation, IDPC is proved to be privacy-preserving as well as efficient.

Original languageEnglish
Article number9057414
Pages (from-to)2631-2641
Number of pages11
JournalIEEE Transactions on Network Science and Engineering
Issue number4
Publication statusPublished - 2020 Oct 1
Externally publishedYes


  • Big data
  • Clustering
  • Differential privacy
  • Nonparametric Bayesian Method
  • Smart grid

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Computer Science Applications
  • Computer Networks and Communications


Dive into the research topics of 'A Differentially Private Big Data Nonparametric Bayesian Clustering Algorithm in Smart Grid'. Together they form a unique fingerprint.

Cite this