visualizing symmetric square matrices with rainbow boxes
TRANSCRIPT
![Page 1: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/1.jpg)
1
iV2018 – Salerno
Visualizing symmetric square matrices with rainbow boxes:
Methods and application to character co-occurrence matrices in literary texts
Jean-Baptiste Lamy
jean-baptiste.lamy @ univ-paris13.fr
LIMICSUniversité Paris 13, Sorbonne Paris Cité, 93017 BobignySorbonne Universités, ParisINSERM UMRS 1142
![Page 2: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/2.jpg)
2
Introduction
Symmetric square matrices / co-occurrence matrices [Leydesdorff]
Frequent type of dataset
Example: distant reading indigital humanities [Jänicke]
Character matrix in a novel
![Page 3: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/3.jpg)
3
Matrix-based visualization
Example on‘Les Misérables’(Victor Hugo)
Problem
A character canbelong to at mostone group / cluster
![Page 4: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/4.jpg)
4
Objectives
Propose a visualization techniques for symmetric square matrices
By transforming the matrix into overlapping sets
And visualizing these overlapping sets with rainbow boxes
Focus on the representation of subsets of interrelated elements
For example in a novel: groups of characters that know each other
Rainbow boxes have been used only in the biomedical domain
=> a new application domain
Symmetricsquarematrix
Overlapping sets
Visualization
![Page 5: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/5.jpg)
5
Rainbow boxes
Example on amino acids
![Page 6: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/6.jpg)
6
General methods
A symmetric square matrix
Extracting overlapping sets
One element for each row / column
One set for each group of interrelated elements
The selection function
Returns True if two elements are related,depending on the value in the matrix
Compute S, the set of sets:
One group
![Page 7: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/7.jpg)
7
General methods
Each set corresponds to several values in the matrix
The aggregation function returns a single valuefrom these values:
The colorization function converts the resulting value to a color:
Each set => One rectangular box in rainbow boxes
Visualization parameters: the 3 functions
select(), aggregate(), colorize()
![Page 8: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/8.jpg)
8
Application to a small dataset(21 characters)
Matrix produced manually by the authors
Number in the matrix: part of the novel in which the two characters begin their relation
0: never
1: before the novel begins
2-5: parts 1-4
Definitions of the 3 functions:
![Page 9: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/9.jpg)
9
Application to a small dataset(21 characters)
![Page 10: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/10.jpg)
10
Application to a small dataset(21 characters)
![Page 11: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/11.jpg)
11
Application to a small dataset(21 characters)
![Page 12: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/12.jpg)
12
Application to a small dataset(21 characters)
![Page 13: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/13.jpg)
13
Application to a small dataset(21 characters)
Hero (main character)
![Page 14: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/14.jpg)
14
Application to a small dataset(21 characters)
Hero + heroin
![Page 15: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/15.jpg)
15
Application to a small dataset(21 characters)
A group of isolated characters(« ghetto »)
![Page 16: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/16.jpg)
16
Application to a large dataset(Les Misérables, 80 characters)
Problem:
The optimization of the order of 80 columns
Factorial complexity
80! ≈ 10e118
Previously published heuristic algorithm limited to 20-25 columns
![Page 17: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/17.jpg)
AFB metaheuristic
Artificial Feeding Birds (AFB) [Springer]Inspired by the behaviour of pigeons
➔ Simple➔ Performant➔ Generic
An optimisation problem = A triplet ( coût(), vol(), marche() )
Flies......to land at a new random
position (2)
...to join the position of
another bird (4)
Walks to a close position (1)
...to return to a memorized position rich
in food (3)
![Page 18: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/18.jpg)
18
Application to a large dataset(Les Misérables, 80 characters)
z
Same parameter functions as previously
Blue : part 1 Cyan : part 2
Green : part 3 Red : part 4
(part 5)
![Page 19: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/19.jpg)
19
Application to a large dataset(Les Misérables, 80 characters)
z
Most new encounters occur in parts 1 and 3, and seems well-separated
Blue : part 1 Cyan : part 2
Green : part 3 Red : part 4
(part 5)
![Page 20: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/20.jpg)
20
Application to a large dataset(Les Misérables, 80 characters)
z
Gavroche appears in part 3 but encounters many more characters in part 4
Blue : part 1 Cyan : part 2
Green : part 3 Red : part 4
(part 5)
![Page 21: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/21.jpg)
21
Application to a large dataset(Les Misérables, 80 characters)
z
« Christmas tree »pattern: 1 characterrelated to manyothers in 1-to-1relations
Blue : part 1 Cyan : part 2
Green : part 3 Red : part 4
(part 5)
![Page 22: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/22.jpg)
22
Application to a large dataset(Les Misérables, 80 characters)
Another matrix
Same novel and same characters
Numbers in the matrix are the number of times the characters meet
Different parameter functions
![Page 23: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/23.jpg)
23
Application to a large dataset(Les Misérables, 80 characters)
z
![Page 24: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/24.jpg)
24
Application to a large dataset(Les Misérables, 80 characters)
z
The characters of this group have very few relations outside the group
![Page 25: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/25.jpg)
25
Application to a large dataset(Les Misérables, 80 characters)
z
In this group, Jean Valjean is the most active character
![Page 26: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/26.jpg)
26
Discussion
Interesting for highly connected networks
Rainbow boxes are more compact that matrices
We inferred groups of characters from pairwise relations
A knows B, A knows C, B knows C
=> A, B, C know each others
But is A aware that B knows C?
Perspectives
Extend the method to non-symmetric matrices (directed graphs)
Adapt the method to other overlapping set visualization techniques
Venn diagrams,...
Apply the method to other domains
Social media (FOAF), bioinformatics (protein interaction matrices),...
![Page 27: Visualizing symmetric square matrices with rainbow boxes](https://reader035.vdocument.in/reader035/viewer/2022062222/62a487a470748f4ffd0766ad/html5/thumbnails/27.jpg)
27
References[iV2016] : Lamy JB, Berthelot H, Favre M. Rainbow boxes: a technique for visualizing overlapping sets and an application to the comparison of drugs properties. International Conference Information Visualisation (iV) 2016;:253-260
[JVLC] : Lamy JB, Berthelot H, Capron C, Favre M. Rainbow boxes: a new technique for overlapping set visualization and two applications in the biomedical domain. Journal of Visual Language and Computing 2017;43:71-82
[Springer] : Lamy JB. Artificial Feeding Birds (AFB): a new metaheuristic inspired by the behavior of pigeons. Advances in nature-inspired computing and applications 2018;under press
[Leydesdorff] : Leydesdorff L and Vaughan L. Co-occurrence matrices and their applications in information science: Extending ACA to the Web environment. Journal of the association for information science and technology 2006;57(12):1616-1628
[Jänicke] : Jänicke S, Franzini G, Cheema MF,and Scheuermann G. Visual text analysis indigital humanities. 2016;226-250 ?
? ???
?