cluster analysis market segmentation document similarity
TRANSCRIPT
![Page 1: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/1.jpg)
Cluster Analysis
• Market Segmentation• Document Similarity
![Page 2: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/2.jpg)
Segment Members
![Page 3: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/3.jpg)
Segment Members
Biz
Tech Math
= 64
MainGroups
![Page 4: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/4.jpg)
• Each object is assigned to its own cluster and then the algorithm proceeds iteratively, at each stage joining the two most similar clusters, continuing until there is just a single cluster.
• At each stage distances between clusters are recomputed by the Lance–Williams dissimilarity update formula according to the particular clustering method being used.
Hierarchical Clustering
![Page 5: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/5.jpg)
![Page 6: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/6.jpg)
biztech <- read.csv("survey-biztech.csv")biztech <- as.matrix(biztech)
#hierarchical clusteringd <- dist(as.matrix(biztech))dm <- data.matrix(d)write.csv(dm, "distance_matrix.csv")
Hierarchical Clustering
![Page 7: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/7.jpg)
hc <- hclust(d)plot(hc)rect.hclust(hc, k=6, border="red")
![Page 8: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/8.jpg)
Hierarchical Clustering
ct <- cutree(hc, k=6) #write to filewrite.csv(ct, "survey-hclust.csv")
![Page 9: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/9.jpg)
![Page 10: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/10.jpg)
![Page 11: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/11.jpg)
• hierarchical clustering is very expensive in terms of time complexity
• though it provides better result
![Page 12: Cluster Analysis Market Segmentation Document Similarity](https://reader030.vdocument.in/reader030/viewer/2022032607/56649ec95503460f94bd6bbd/html5/thumbnails/12.jpg)
Cold Weather