Title : ( Entropy-based Consensus for Distributed Data Clustering )
Authors: MOEIN OWHADI KARESHK , Mohammad Reza Akbarzadeh Totonchi ,Access to full-text not allowed by authors
Abstract
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with consideration for the confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in the consensus process, hence no private data is transferred. With the proposed use of entropy as an internal measure of consensus clustering validation at each machine, the cluster centers of the local machines with higher expected clustering validity have more influence on the final consensus centers. We also employ the relative cost function of the local Fuzzy C-Means (FCM) and the number of data points in each machine as measures of relative machine validity as compared to other machines and its reliability, respectively. The utility of the proposed consensus strategy is examined on 18 datasets from the UCI repository in terms of clustering accuracy and speed-up against the centralized version of FCM. Several experiments confirm that the proposed approach yields to higher speed-up and accuracy, while maintaining data security due to its protected and distributed processing approach.
Keywords
, Consensus Clustering; Distributed Clustering; Fuzzy C, Means; Ensemble Learning; Entropy.@article{paperid:1078453,
author = {OWHADI KARESHK, MOEIN and Akbarzadeh Totonchi, Mohammad Reza},
title = {Entropy-based Consensus for Distributed Data Clustering},
journal = {Journal of Artificial Intelligence and Data Mining},
year = {2018},
month = {July},
issn = {2322-5211},
keywords = {Consensus Clustering; Distributed Clustering; Fuzzy C-Means; Ensemble Learning; Entropy.},
}
%0 Journal Article
%T Entropy-based Consensus for Distributed Data Clustering
%A OWHADI KARESHK, MOEIN
%A Akbarzadeh Totonchi, Mohammad Reza
%J Journal of Artificial Intelligence and Data Mining
%@ 2322-5211
%D 2018