2015/6/201 minimum spanning tree partitioning algorithm for microaggregation 報告者:林惠珍

12
111/04/23 1 Minimum Spanning Tr ee Partitioning Alg orithm for Microagg regation 報報報 報報報

Post on 20-Dec-2015

222 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 1

Minimum Spanning Tree Partitioning Algorithm fo

r Microaggregation

報告者:林惠珍

Page 2: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 2

Outline

The Microaggregation problem MST partitioning algorithm Experimental results Conclusions and future work

Page 3: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 3

The Microaggregation problem

Microdata set( n records, p numerical attributes)

To partition the n points into groups so as to minimize the objective function: SSE

subject to

It is equivalent to minimizing a standardized information loss measure L= SSE / SST.

Page 4: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 4

Microaggregation vs. clustering problem

Constraint

Problem

The number of clusters

cluster size

Microaggregation No Yes

Clustering Yes No

So, modify the strategy for selecting edges for deletion from the MST for microaggregation problem. In decreasing length order Each tree has at least k nodes

Page 5: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 5

MST Partitioning Algorithm

MST construction( Prim’s algorithm) Edge cutting Cluster formation

Page 6: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 6

k=5

Page 7: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 7

Page 8: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 8

Experimental methods

D: Diameter-based fixed size method C: Centroid-based fixed size method M: MST-partitioning algorithm M-d: MST-partitioning followed by clusters of

size >= 2k partitioned by D M-c: MST-partitioning followed by clusters of

size >= 2k partitioned by C

Page 9: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 9

Page 10: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 10

Experimental results

Page 11: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 11

Conclusions

The more pronounced the inherent clustering effects in the data, the greater is the advantage of using the our methods.

MST partitioning-based method should be considered as a potential candidate for any practical application.

Page 12: 2015/6/201 Minimum Spanning Tree Partitioning Algorithm for Microaggregation 報告者:林惠珍

112/04/18 12

Future work

To adapt some of the ideas used to solve other clustering problems to this constrained version.

To explore methods where minimum group size is treated as a soft constraint associated with a preference level.