2015/6/201 minimum spanning tree partitioning algorithm for microaggregation 報告者:林惠珍
Post on 20-Dec-2015
222 views
TRANSCRIPT
112/04/18 1
Minimum Spanning Tree Partitioning Algorithm fo
r Microaggregation
報告者:林惠珍
112/04/18 2
Outline
The Microaggregation problem MST partitioning algorithm Experimental results Conclusions and future work
112/04/18 3
The Microaggregation problem
Microdata set( n records, p numerical attributes)
To partition the n points into groups so as to minimize the objective function: SSE
subject to
It is equivalent to minimizing a standardized information loss measure L= SSE / SST.
112/04/18 4
Microaggregation vs. clustering problem
Constraint
Problem
The number of clusters
cluster size
Microaggregation No Yes
Clustering Yes No
So, modify the strategy for selecting edges for deletion from the MST for microaggregation problem. In decreasing length order Each tree has at least k nodes
112/04/18 5
MST Partitioning Algorithm
MST construction( Prim’s algorithm) Edge cutting Cluster formation
112/04/18 6
k=5
112/04/18 7
112/04/18 8
Experimental methods
D: Diameter-based fixed size method C: Centroid-based fixed size method M: MST-partitioning algorithm M-d: MST-partitioning followed by clusters of
size >= 2k partitioned by D M-c: MST-partitioning followed by clusters of
size >= 2k partitioned by C
112/04/18 9
112/04/18 10
Experimental results
112/04/18 11
Conclusions
The more pronounced the inherent clustering effects in the data, the greater is the advantage of using the our methods.
MST partitioning-based method should be considered as a potential candidate for any practical application.
112/04/18 12
Future work
To adapt some of the ideas used to solve other clustering problems to this constrained version.
To explore methods where minimum group size is treated as a soft constraint associated with a preference level.