spherical similarity explorer for comparative case analysis · with a series of analysis,...

Spherical Similarity Explorer for Comparative Case AnalysisLeishi Zhang1, Chris Rooney1, Lev Nachmanson2, William Wong1, Bum Chul Kwon3,Florian Stoffel3, Michael Hund3, Nadeem Qazi1, Uchit Singh1, and Daniel Keim3

1 Middlesex University, London, UK2 Microsoft Research, Redmond, USA3 University of Konstanz, Konstanz, Germany

AbstractComparative Case Analysis (CCA) is an important tool for

criminal investigation and crime theory extraction. It analyzesthe commonalities and differences between a collection of crimereports in order to understand crime patterns and identify abnor-mal cases. A big challenge of CCA is the data processing andexploration. Traditional manual approach can no longer copewith the increasing volume and complexity of the data. In this pa-per we introduce a novel visual analytics system, Spherical Simi-larity Explorer (SSE) that automates the data processing processand provides interactive visualizations to support the data explo-ration. We illustrate the use of the system with uses cases thatinvolve real world application data and evaluate the system withcriminal intelligence analysts.

1. IntroductionComparative Case Analysis (CCA) is a widely used tech-

nique for criminal intelligence analysis [15]. Given a collectionof crime reports that contain both structured fields such as crimetype and unstructured fields such as modus operandi, CCA aimsat analyzing the commonalities and differences between them forpredicting criminal activities, determining current, new or emerg-ing problems and highlighting prevention, reduction or diversionopportunities [31]. Currently, CCA is a manual process. Afterreading the crime reports, the analyst identifies relevant headings(factors) that are considered to be useful for their understanding ofthe cases. Information from the reports is then collated under theheadings for comparative analysis, result in a CCA table, wherethe analyst scans across crime rows to identify (dis)similarities.

With the increasing size of data and variety of informationthat is collected presently, the manual process becomes increas-ingly infeasible. First of all, identifying meaningful factors fromlarge amount of data and collating corresponding information toform a CCA table is labor intensive and error-prone. Secondlycomparison across tables with large amount of rows (crime re-ports) and columns (factors) is difficult. Analysts are looking fortools that can help them to extract relevant features from the data,and to visualize the data in graphical representations such thatcomparison can be made easy [4]. Software systems such as IBMi2 [1] have been developed to assist the analyst with extractinginformation from large amounts of criminal data and link relatedinformation. But tools that support automatic data processing andsimilarity analysis are lacking.

We see parallels between CCA and traditional documentanalysis where structure needs to be extracted from unstructuredtextual fields. Therefore, we envision an automated CCA work-flow starting with the extraction of meaningful features and con-

cepts from the document collection. Subsequently, a CCA tableis compiled by combining the information of the extracted fea-tures and concepts with existing structured fields in the data. The(dis)similarities between documents are then computed using apredefined distance measure that take into consideration valuesin both structured fields in the data and information of extractedfeatures. Finally, the (dis)similarities are visualized in graphicalrepresentations to help the analyst identify patterns and relationsin the data. These visual representations can then be presented inan interactive graphical user interface where the analyst can inter-act with the data and look at the data from different perspectives.

In this paper we propose a visual analytics pipeline thatadvances existing CCA approach by automating the CCA tablegeneration process, computing the (dis)similarity between docu-ments, and visualizing the (dis)similarity and the data in an inter-active visual display to support reasoning and sense making. Weimplement the pipeline in Spherical Similarity Explorer (SSE), avisual analytics tool that integrates a spherical embedding methodwith a series of analysis, visualization and interaction techniquesto support Comparative Case Analysis (see Fig. 1).

The contributions of this paper include 1) a visual analyt-ics pipeline for CCA; and 2) a novel interactive visualizationtechnique that maps textual documents to a 3D spherical surfaceand allows interactive exploration of the similarity between docu-ments. The remainder of this paper is organized as follows: Sec-tion 2 introduces the research background; Section 3 discussesrelated work; Section 4 introduces the visual analytics pipelineof our SSE tool, as well as the detailed embedding, interactionand visualization techniques; Section 5 demonstrates two usecases with real-world criminal data and presents feedback fromend users; and Section 6 draws conclusions and discusses futurework.

2. BackgroundThe work proposed in this paper is motivated by challenges

in CCA as part of the mission of the EU FP7 funded project “Vi-sual Analytics for Sense-making and Criminal Intelligence Anal-ysis” [4]. The project looks into the development of VA solutionsthat supports evidential reasoning and sense-making from a largevolume of criminal intelligence data.

CCA is also called Similar Fact Analysis (SFA) [5]. The ideais to analyze similar features (factors) in different crime cases inorder to support criminal investigation and crime theory extrac-tion. The former aims at identifying similar crime cases that arecommitted by the same offender(s) or criminal organizations, andthe latter aims at identifying the causes of certain types of crimefor future crime reduction and prevention.

©2016 Society for Imaging Science and Technology

IS&T International Symposium on Electronic Imaging 2016Visualization and Data Analysis 2016 VDA-496.1

https://www.researchgate.net/publication/30975532_'Intelligence_Led_Policing_or_Policing_Led_Intelligence'_Integrating_Volume_Crime_Analysis_into_Policing?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/profile/Bum_Chul_Kwon?el=1_x_100&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/profile/BL_Wong?el=1_x_100&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/profile/Lev_Nachmanson?el=1_x_100&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/profile/Michael_Hund?el=1_x_100&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/profile/Daniel_Keim?el=1_x_100&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/profile/Nadeem_Qazi2?el=1_x_100&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

Figure 1. Spherical Similarity Explorer: Ê The Crime reports are mapped to an interactive spherical surface according to their (dis)similarities. Ë The user

can decide which features to include for projection. Ì The user can select a document on the spherical surface and see similar documents highlighted in purple,

the density of the color indicates the similarity level, higher density indicates higher similarity score. Close neighbors are projected onto a Geo-map. The point

in the orange rectangular frame is an example of a very similar crime that happened at a distant location. Í The crimes are also projected to a time line. Î

The details of the selected document Ï The word cloud visualization allows the user to analyze the common features of a set of documents; in this case, terms

such as “rear”, “property”, “smashed”, “kitchen” and “window” occur frequently in the crime reports; these terms may be used to generate hypothesis about the

crime pattern. Ï The keyword search allows the user to investigate documents that contain the given keywords. The user can click on a result item to see its

projection on the sphere.

CCA involves data exploration. For example, new clues maybe found while viewing key features of a particular crime. In thatcase analysts often want to view details of other similar crimes.This key feature identification is an iterative and incremental pro-cess. Consequently the analyst’s focus of interest often movesfrom one case to another. A convenient visualization tool wouldsupport such exploration by moving the “point of interest” to thecenter of the drawing canvas such that the analyst can pay closerlook into the details of the data point and examine the neighbor-hood easily.

One of the most commonly used visualization techniques fordisplaying (dis)similarities within large document collections isthe so called “galaxy view”, as it is for example implemented inIN-SPIRE [2], a state-of-the-art visual document analysis soft-ware. In the “galaxy view” each document is represented by apoint, and the dissimilarities between documents are representedas Euclidean distances between points. By placing similar objectclose to each other and dissimilar object far apart, the visualiza-tion reveals patterns such as groups and outliers in the data.

The “galaxy view” helps the analyst to see patterns and re-lations in the data. But the method lacks flexibility in terms ofnavigation. Once a visual embedding has been generated, eachobject has a fixed location in the view. When the analyst’s fo-

cus of interest moves from one document to another, it is hardto adjust the “point of interest” to the center of the display ac-cordingly. Moving an arbitrary point in the display to the centerrequires either moving the drawing canvas, which may cause lossof information, or re-generating the layout, which may destroy theexisting mental map of the analyst and demand additional compu-tation. A“fish-eye” view [21] may be applied to enlarge the areaof interest and at the same time preserve the mental map of theviewer, but every time the point of interest moves, a new layoutneeds to be computed.

Exploring the world map on a globe is an intuitive way of un-derstanding distances and relations between different locations.The metaphor can be effectively adopted to support explorativeanalysis of distances and relations between objects in HD data. Inthe past various visual embedding methods have been proposedto map HD data to a 3D spherical surface [8, 10, 17, 27, 45], butlittle work has been reported on integrating them to VA tools forvisual data exploration. This is hardly surprising, because with-out effective interactions and detail-on-demand visualizations, themethod cannot fully support explorative analysis.

In SSE we utilize the advantage of the 3D globe metaphor byintegrating a spherical embedding method with a series of interac-tions and linked-view-visualizations. At the back-end we develop



https://www.researchgate.net/publication/232970799_Multidimensional_scaling_on_a_sphere?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/229580838_Functional_relations_in_multidimensional_scaling?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/229100011_Generalized_Fisheye_Views?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/222944508_Restricted_multidimensional_scaling_models?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/24062326_Constrained_multidimensional_scaling_inN_spaces?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/6995964_Spherical_self-organizing_map_using_efficient_indexed_geodesic_data_structure?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

a series of text analytics methods to extract features and conceptsfrom the text and compute the (dis)similarity between documents.The aim is to support interactive similarity analysis and data ex-ploration required by CCA. The proposed approach is not lim-ited to the Spherical MDS [17] that is currently implemented inthe system – it can be easily substituted by any other embeddingmethod that generates spherical embeddings. And the interactivespherical projection is also not limited to textual data – it can beadapted easily for analyzing multidimensional data of other datatypes.

3. Related WorkIn this section, we review previous studies that motivate and

inspire our approach.

3.1 Comparative Case AnalysisCCA is based on the notion of “comparison”, which is a fun-

damental tool of analysis and widely used by many social sciencesand scientific domains [14]. CCA starts with data preprocessing.Typically the data contains information gathered from differentsources including police reports, witness statements, etc. and isstored in a semi-structured manner. A crime record may containstructured fields such as location, time, and type of crime, as wellas unstructured textual fields such as modus operandi (abbreviatedas MO) and intelligence notes [15].

A challenge of CCA is the feature extraction and similar-ity computation. Most of the feature extraction work reported inthe literature is done by hand. For example, Bennell et al. [7]manually extracted different types of behavioral features such asentry behavior (front door, back door, climb to second floor etc.),target selection choices (petrol pump), property stolen (jewelery)and offender’s spatial behavior as linking features to compute thepairwise similarity between crime cases. These features are ex-tracted from MO of 86 solved commercial burglaries committedby 43 serial offenders.

It is not difficult to imagine the significant amount of workthat is involved in the aforementioned process. Nowadays the dataCCA has to handle often contains thousands of records or more.The task cannot be carried out without the help of automated fea-ture extraction techniques. SSE tackles the challenge by designingeffective entity extraction and concept extraction techniques thatautomatically extract important features from the data and com-pute corresponding measures. The technical details can be foundin Section 4.1.

Bennell et al. [7] used the dichotomously coded valuesof the above mentioned linking features to examine if high de-gree of similarity between them enables different cases to bevalidly linked to a common offender. The similarity measurewas computed on the basis of Jaccard coefficients between pair-wise crimes. Their approach was twofold, first behavioral featureswere identified using logistic regression analysis capable of dis-tinguishing between related and unrelated crime pairs. Secondly,Receiver Operating Characteristic (ROC) analysis was performedto assign each behavioral feature an overall predictive score.

Canter [13] investigated the consistency of features acrossorganized and disorganized cases based on content analysis andMultidimensional Scaling (MDS). His research revealed that dis-organized features were either easy to identify or more common,probably due to their vast number compared to organized fea-

tures. Jaccard coefficient was used to measure the proportion ofco-occurring features. Given that CCA data contains both textualand non-textual fields, a simple metric that measures the textualfeatures can miss important clues in the comparison such as timeand location of a crime. In SSE we enhance the similarity com-putation by designing a composite measure that measures geo-spatial, temporal and textual features separately and merges themtogether for comparison. Details of the method can be found inSection 4.2.

3.2 Visual Embedding of Multi-dimensional DataMapping HD data to a lower-dimensional (LD) visual dis-

play is a difficult problem that has significant research interest.The main objective is to generate a meaningful LD (visual) rep-resentation of data in HD space such that people can gain under-standing of the structure of the data, as well as the relationshipdis(similarity) between data items. The process of representingHD data in a LD space such as a sheet of paper or computer screenis referred to with various terms or expressions. A general namefor this is embedding. In the specific case of Cartesian data, di-mensionality reduction, projection, mapping, and manifold learn-ing are commonly encountered.

The key characteristic of any embedding technique is its un-derlying model, namely, the way it transforms data. This coor-dinate transformation can be linear or nonlinear. It can also beparametric or non-parametric. Parametric models provide out-of-sample extensions for free, that is, the transformation applies tonew data points. The output of nonparametric methods is not re-ally a model or transformation, but rather the transformed data.As the transformation remains implicit, the generalization to newdata points is not straightforward.

The generic idea hidden behind all embedding techniques isthat they should produce LD representations that preserve mean-ingful structural properties of data. In general, these propertiesformalize proximity relationships. They can be similarities (adja-cencies, dot products) or dissimilarities (distances, angles).

The ancestor of all embedding methods is undoubtedly PCA[33]. The method can be interpreted in various ways (maxi-mal variance preservation, minimal encoding/decoding error, to-tal least squares, variable decorrelation). It can be carried out withvarious spectral decompositions (eigenvalues and singular values)or neural algorithms [32]. PCA is a linear projection technique.Refined cost functions have led to many variants, such as projec-tion pursuit [25]. It is noteworthy that PCA yields exactly thesame results as classical metric multidimensional scaling (MDS)[46]. Classical MDS is actually the dual form of PCA, in whichthe covariance matrix is replaced with the centered Gram matrixof dot products. Hence, classical MDS (and PCA) optimally pre-serve dot products.

More recent forms of MDS rather implement the closely re-lated principle of distance preservation. Various cost functions(often called stress) allow a wide variety of methods to emerge[16]. The most famous is probably Sammon’s nonlinear map-ping [36]. Like many MDS variants, the stress optimization withan elaborate gradient descent technique produces nonlinear em-beddings. Yet another variant is curvilinear component analysis(CCA) [19], which behaves in a radically different way as otherMDS-like techniques.

Quite naturally, the input data for MDS often comprise Eu-



https://www.researchgate.net/publication/239065741_A_Nonlinear_Mapping_for_Data_Structure_Analysis?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/234113288_The_Comparative_Method?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==


https://www.researchgate.net/publication/232453753_The_OrganizedDisorganized_Typology_of_Serial_Murder_Myth_or_Model?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/225107077_Discussion_of_a_set_of_points_in_terms_of_their_mutual_distances?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/222460491_Minor_components_and_linear_neural_networks_Neural_Networks_5_927-935?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/52012631_On_Lines_and_Planes_of_Closest_Fit_to_Points_in_Space?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==


https://www.researchgate.net/publication/10973124_Linking_commercial_burglaries_by_modus_operandi_Tests_using_regression_and_ROC_Analysis?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==


https://www.researchgate.net/publication/3302253_Curvilinear_Component_Analysis_a_Self-Organizing_Neural_Network_for_Nonlinear_Mapping_of_Data_Sets?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

clidean distances, which are easily converted into the correspond-ing dot products. Actually this is not mandatory at all and othertypes of distances can be used as well, such as the so calledgeodesic distances [40, 26]. The transformation of the refer-ence distances in the stress function can also be identified in anoptimization process. This is the principle of nonmetric MDS[38, 23]: a nonparametric and monotonic distance transformationis identified with either isotonic optimization or semidefinite pro-gramming [44].

Dot products or distances are not the only structural featuresthat are useful in embedding techniques. Recent successful meth-ods are based on similarity preservation, such as stochastic neigh-bor embedding (SNE) [22] and its variants [41, 42]. Yet anotherpossibility close to the world of artificial neural networks is thewell-known self-organizing map [43]. Though the SOM is not anembedding technique in the usual sense, it remains a very pow-erful visualization tool. The idea is similar to the one of PCA:fit a plane within the data cloud. In this case the plane is a kindof articulated and elastic grid, which can deform itself to repro-duce curved shapes and clusters. This shows that the SOM worksthe other way round, compared to most embedding techniques.Instead of embedding data into a LD space, a predefined two-dimensional grid is fitted in the HD space. The heuristic algorithmof the SOM has been reformulated in more principled methods,such as the generative topographic mapping [9], which relies onan expectation-maximization procedure.

The review of the state of the art would not be completewithout mentioning spectral embeddings. This family of meth-ods shares a common framework: they apply classical MDS in afeature space, that is, nonlinearly transformed data variables. Thisframework extends classical MDS to nonlinear embeddings whilekeeping most of its advantages, like computational simplicity andconvex optimization. The oldest method in the family is kernelPCA [37].

Isomap [40] is classical MDS applied to geodesic distancesinstead of Euclidean ones. Assuming that data are sampled froma manifold, the shortest paths in a neighborhood graph are com-puted in order to approximate the manifold geodesics. Usinggeodesic distances instead of Euclidean ones in classical MDSis expected to lead to better unfolding and hence to better em-bedding [26]. Another well known spectral method is locally lin-ear embedding (LLE) [34]. The idea here is to first determineweights in order to reconstruct each data point from a mixtureof its neighbors and secondly to use these weights to compute alow-dimensional representation of all points.

3.3 Spherical-based VisualizationSpherical visualization is a commonly used visualization

technique for displaying relationships between objects, especiallywhen the objects come with their geographical locations on Earth,or their positions in space [28]. If the geographical locations arenot available or, as in our case, is not the sole attribute, sphericalvisualization maintains some useful features when compared torendering on the plane. For example, some systems [29, 30] mapthe objects to the sphere to achieve focus+context distortion.

Mapping objects to a spherical surface instead of a 2D planehas several advantages. First of all, unlike a 2D plane, the spheri-cal surface does not have a boundary, which provides an intuitiveGestalt association of relationships between nearby objects [11].

Secondly the spherical mapping helps avoiding the “corner effect”of 2D mappings [12] although projection from HD data space to aLD space will potentially result in distortions [6]. When the num-ber of points is large, the method also lessens the over-plottingproblem by having a larger projection space (see Fig.3). Further-more, the design aligns with the “Focus + context” principle [39]that hides distant data points without losing context.

Figure 3. Spherical surface - given that the area of a squared 2D visual

display (width = length = 2r) is 4r2, the surface of an interactive sphere that

can be fit in the display is 4πr2.

The methods described in Section 3.2 can be applied to gen-erate 2D or 3D embeddings of multi-dimensional data. In thispaper we want to generate embeddings that lie on the surface ofa sphere. A 3D spherical mapping of multi-dimensional data canbe achieved in many ways. In theory any Dimenionlity Reduction(DR) method can be extended to fit the purpose by adding con-straints. For example, a “dummy point” that has the same distanceto all the other points can be added to a MDS approach to achievea circular or spherical embedding [10, 8, 27]. Also, methods weredeveloped to generate spherical embeddings using a SOM [45].

In SSE we implement the Cox and Cox’s Spherical MDS ap-proach [17]. Compared to other spherical embedding approachesthat requires the generation of an artificial “dummy point” [10],[8] and [27] or tuning of parameters[45], Cox and Cox’s methodachieves non-metric MDS on a sphere in a simpler manner. SSEextends the capability of Cox and Cox’s method by integrating itwith a series of interactions and detail-on-demand visualizations,and embedding the spherical view in a multiple-coordinated-viewwhere the analyst can examine the data from different perspec-tives. Details of the methods can be found in the section 4.3.

4. Visual Analytics PipelineFig. 2 illustrates the visual analytics pipeline of SSE: it takes

crime and incident reports as input, processes the reports by ex-tracting relevant features and concepts from the reports, combinesthem with the headings of structured fields to form headings ofa CCA table. Once the headings are generated, the system willcheck through each crime report to fill in the corresponding val-ues in the CCA table. These values are used to compute thepairwise similarities between crime cases, resulting in a distance(dissimilarity) matrix. The distance matrix is then fed into thespherical MDS algorithm that generates spherical coordinates foreach of the crime reports. The coordinates are used to map thecrime reports to the spherical surface. The 3D sphere is interac-tive; the user can move the sphere for data exploration. Detail-on-demand analysis such as hot spot identification and visualization,and common feature analysis and visualization are also supportedby the system.



https://www.researchgate.net/publication/285599593_A_Global_Geometric_Framework_for_Nonlinear_Dimensionality_Reduction?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==


https://www.researchgate.net/publication/284035345_Nonlinear_Dimensionality_Reduction_by_Locally_Linear_Embedding?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/259005676_Sphere-based_Information_Visualization_Challenges_and_Benefits?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==




https://www.researchgate.net/publication/228339739_Viualizing_data_using_t-SNE?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==



https://www.researchgate.net/publication/222850290_Visualizing_distortions_and_recovering_topology_in_continuous_projection_techniques?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/221619363_GTM_A_Principled_Alternative_to_the_Self-Organizing_Map?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/220945173_Design_method_of_interaction_techniques_for_large_information_spaces?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/220500224_Nonlinear_Component_Analysis_as_a_Kernel_Eigenvalue_Problem?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/220320050_Information_Retrieval_Perspective_to_Nonlinear_Dimensionality_Reduction_for_Data_Visualization?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==



https://www.researchgate.net/publication/24061627_The_analysis_of_proximities_Multidimensional_scaling_with_an_unknown_distance_function_II?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/13618327_Methods_for_spherical_data_analysis_and_visualization?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==



https://www.researchgate.net/publication/4082326_Unsupervised_learning_of_image_manifolds_by_semidefinite_programming?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/4050111_Corner_effect_in_double_and_triple_gate_FinFETs?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/3723287_H3_Laying_out_Large_Directed_Graphs_in_3D_Hyperbolic_Space?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/3651286_The_Eyes_Have_It_A_Task_by_Data_Type_Taxonomy_for_Information_Visualizations?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/2934728_Stochastic_Neighbor_Embedding?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/2551965_Curvilinear_Distance_Analysis_versus_Isomap?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==


Figure 2. The Visual Analytics Pipeline for Comparative Case Analysis (CCA): 1) Given a collection of crime reports our approach starts with extracting

important features from the data, resulting in a CCA table where each record encodes the key feature values of a crime record; 2) similarities (distances)

between crime records are computed using a composite distance measure, resulting in a distance matrix; 3) the distance matrix is then fed into a spherical

MDS algorithm that computes the spherical coordinates for each crime record on the 3D spherical surface; 4) the coordinates are used to map crime records to

the surface of the sphere; 5) Detail-on-demand analysis and visualization: (top) hot spots extraction and visualization, (bottom) common feature extraction and

visualization.

4.1 Feature Extraction

The features extracted from the crime reports are basedon predefined patterns built to capture sequences like unsecuredpremises or entered through rear door. Compared with bag-of-word stemming features computed from documents [35], the ex-tracted features are characteristic for the analyzed description ofthe crime procedure. This approach reflects the fact that two crimereports referring to offenders entering through a rear door at nightare more similar to each other than to a report omitting the timeof day. The patterns generating the features and sequences fromthe text are matched to a lemmatized version of the crime reportsso not to miss any matches because of word inflections. The re-sult is organized in groups of concepts, which are predefined byexperts from the field of the analyzed data. These concepts can beparts of a building, makes and models of vehicles, and more. Forexample, having crime reports dealing with burglary, the conceptscontain parts of buildings, expressions for the time of the day, andtypically-used tools. This expert knowledge makes sure that theextracted features are relevant to the analyzed data and carry im-portant information which not only the machine but also the ana-lyst understands. In addition, the features represent chunks of theanalyzed crime report, can be interpreted by a machine as well ashumans, and make the information extraction process transparentto an analyst working with the generated data.

The extraction process is based on four main components:1. Preprocessing: the reports are normalized, a cleanup removesall characters which do not carry any information (non printableand control characters). Based on a dictionary, abbreviations areexpanded. 2. POS Tagging: a POS tagger determines the partof speech of each token in the report. 3. Lemmatization: a rule-based lemmatizer computes the lemma of each token. 4. PatternMatcher: the pattern matcher finds occurrences of the predefinedpatterns (including permutations). It extracts the text matches,and generates the corresponding feature vector per crime report.The resulting feature vector is similar to a classical term featurevector. Instead of single word terms, it keeps track of the occur-rences of multi-word terms extracted from the predefined patterns.

4.2 Distance ComputationWhen computing the dissimilarity between two crime re-

ports we apply a so-called composite distance measure to handlethe heterogeneous structure of the reports. As mentioned above,each report is described by a feature vector consisting of differentdata types such as time and geographically related information ofthe incident as well as extracted concepts of the modus operandifield. Our distance measure combines the different feature typesinto a single similarity value.

To combine feature types a feature type dependent distancefunction is applied to every considered feature. For time-relatedfeatures, the similarity is measured by the time difference, forgeo-related features the geographical distance between two loca-tions is considered, and for the modus operandi field the distanceis computed by counting the number of common concepts. Moreformally, the distance between the concepts {c0, . . . ,cn−1} of themodus operandi MOm and MOn fields are computed as follows:

dist(MOm,MOn) =1

#concepts·

n−1

∑i=0

dist(MOmci,MOnci

)

where #concepts refers to the number of non-missing conceptsand

dist(MOmci,MOnci

) =

{0 i f mci = nci

1 else

As indicated above, a concept is ignored if both crime recordshave a missing value.

After computing and normalizing the distances betweenthe separate features, the resulting distance between two crimerecords is computed by averaging the individual distances. Asindicated in Fig. 2, the result of the distance computation stepis a matrix containing all pair-wise distances between all crimereports.

4.3 Spherical MDSSSE implements Cox and Cox’s Spherical MDS algo-

rithm [17] for mapping the distance between documents to the vi-sual display. The algorithm adapts the original non-metric MDS




https://www.researchgate.net/publication/200773081_A_Vector_Space_Model_for_Automatic_Indexing?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

of Kruskal [24] to give a configuration of points that lie on thesurface of a sphere.

Given a set of n objects under consideration and the dissimi-larity between pairs of objects in the multidimensional data spacegiven by {δi j}, and the Euclidean distance between correspond-ing points in the lower dimensional embedding given by {di j},Kruskal’s non-metric MDS tries to “best” approximate the dis-tance between objects in the data space to the embedding spaceby minimizing the cost function

S =

√√√√∑(di j − di j)2

∑d2i j

where di j is the primary monotone least squares regressionof {di j} on {δi j}.

Spherical MDS maps points onto a surface of a unit sphere.Each point i on a surface of the sphere is determined by two anglesθi and φi (see Fig. 4). Assuming − π

2 ≤ θi ≤ π

2 and 0 ≤ φi ≤ 2π

for the spherical angles, the point i has the Cartesian coordinates

xi = cosθi sinφi (1)

yi = sinθi sinφi (2)

zi = cosφi (3)

Figure 4. Spherical Coordinates

The distance between two points is defined as the shortest arclength along the great circle which passes through the two points(geodesic distance on the sphere).

Figure 5. Geodesic distance between two points on a sphere

The Euclidean distance dei j between two points i and j on a

sphere is given by

dei j = (xi − x j)

2 +(yi − y j)2 +(zi − z j)

2

substituting Eqs. 1-3, we have

dei j

2 = 2−2sinφi sinφ j cos(θi −θ j)−2cos(φi)cosφ j

By applying the law of cosine to the triangle depicted in Fig.5, where ϕ is the planar angle for the unit sphere r = 1

dei j

2 = 2r2 −2r cosϕ

the geodesic distance on the sphere dgi j is given by

dgi j = arccos

2−dei j

2

2

Starting from an initial configuration of points (which is gen-erated by a metric MDS), a configuration giving minimal stresscan be found using the method of steepest decent. As Cox andCox pointed out, since there is a one-to-one increasing relation-ship between the Euclidean distance and the arc length (geodesicdistance) either of them can be used for the stress minimization.The resulting MDS solution will not be affected.

4.4 InteractionsWhile a variety of 3D interaction devices are available to in-

teract with 3D visualizations [18], we wanted our application to bedevice-independent and work both with and without novel hard-ware, making the application more accessible for those with stan-dard interaction devices. Using simple, cursor-based interaction,the sphere can be rotated in both the y- and x-axis by clicking anddragging the cursor across the sphere. Scroll functionality, suchas a two-finger swipe on a touch-pad, can be used to zoom in, re-vealing further relationships in dense regions. These gestures alsolend themselves well to touch devices, where zoom is controlledusing the de-facto pinch gesture.

We also investigated mid-air gestures through the use of aLeap Motion device [3]. Users could rotate and zoom into thesphere by holding their hand over the device and moving side-to-side and backwards-and-forwards respectively. After some initialnovelty, we found the interaction to be clumsy. With no point ofreference, it was difficult to rotate the sphere without accidentallyzooming, it was also no surprise that fatigue soon set in. Onegesture that did work well was the ability to change the opacity ofthe sphere by making a grabbing gesture.

Since the data points are small, and dense in some regions,we recommend cursor-based interaction, where we also have theaffordance of hovering over data points on the sphere. This can beused to reveal details of a crime, such as showing when and whereit occurred on the geographic and temporal visualizations, andshowing a summary of the crime details, including the offender’smodus operandi. Clicking on a crime highlights its nearest neigh-bors, using a three-tier color scale to group projected distance.These points are also placed on the geo-spatial and temporal viewsto allow for potential new relationships to be discovered.

We intend the smooth and fluid interaction with the sphere toencourage playful exploration, discovering new and unexpectedrelationships in a serendipitous manner. We combine this withsmooth animations that allow users to view data items, selectedthrough the search and retrieval interface, at the front-center byrotating the sphere into position. This helps the user to maintaincontext as they move from one data item to the next. We alsouse animation to move data points to their new locations when theprojection is changed.

4.5 Visualizations for Similarity AnalysisWe design the visualization components of SSE in such a

way that different types of analysis can be performed throughcoordinated views. According to the National Policing Improve-ment Agency [31] there are four main types of Crime Pattern



Figure 6. Similarity Analysis based on Modus Operandi: the pattern shows a group of crimes that are related to theft inside non-residential dwellings such

as offices and school buildings. The geo-map highlights a potential hot spot of the crime type. The time line visualization indicates this type of crime cases is

evenly distributed over time. The third crime record of the search query contains the term “office” but relates to a car crime that is projected to a distant location.

Analysis: hot spot identification, crime trend identification, crimeseries identification and general profile analysis. We use thesetasks to drive the design of our different views:

3D Sphere: The spherical embedding provides an overview ofthe similarities between crime cases. It is useful in understandingglobal patterns in the data such as groups and outliers. The usercan explore the data by various interactions as introduced inSection 4.4. It also allows the user to select a particular documenton the sphere and see the detailed modus operandi. Similardocuments will be highlighted in corresponding views.Feature Selection: SSE implements a feature selection panelthat allows the user to select different feature set combinations,namely MO (modus operandi), spatial (geo-spatial locations), andtemporal (time and date) of the crimes. The similarities betweencrime cases are always computed based on selected features.By adding or removing a feature for similarity computation, theanalyst can test different hypotheses.Geo-spatial visualization: The system also includes a map viewthat shows the distribution of crimes on a geographical map. Byprojecting crimes to the map, the analyst can easily identify crimehot spots and drill down to individual crime cases that happenedin a particular geographical region.Time-line visualization: A time-line visualization is imple-mented to show the distribution of crimes over time for crossreferencing between geo-spatial and temporal relations in thedata.Commonality Analysis: The user can select a group of docu-ments and see the common features in a word cloud visualization.

This provides an overview of the key terms in the data and helpsthe analyst to identify clues for reasoning and sense-making.Search and Filtering: The search panel allows the analyst toperform keyword search. Details of the matching documents canbe viewed in the panel below. The filtering helps the analyst toquickly focus on a subset of documents that are of interest.Linked Views: all the views mentioned above are linked. Changein one view will trigger corresponding changes in other views.For example, selecting a document in the spherical surface willupdate the map view.

5. Use Cases and User EvaluationIn this section we present two use cases of the system and

some feedback from the end user. We use 1588 anonymizedreal crime records as input data of the use cases and use them todemonstrate the functionality of the system to the end user. Thedata contains 27 structured fields recording various types of infor-mation about the crime plus a free text field that stores the modusoperandi of each crime.

5.1 Similarity Analysis Based on Modus OperandiFirst we use the features extracted from the modus

operandi for similarity analysis. Extracted features from themodus operandi field include concept based features such asMO Position (rare, front, size of the entry), MO FixtureType(window, door, etc.), MO fixtureMaterial (plastic, metal, etc.),and MO search (tidy, untidy) as well as key terms contained inthe modus operandi descriptions such as alarmed, dog, and car-



Figure 7. Similarity analysis based on a different subset of features: given the same document, the neighborhood changes when different feature sets are

selected for comparison.

key.Once the data has been processed and projected onto the

spherical surface, one can easily see a “spoon” shaped pattern(cluster) (see Fig. 6). By clicking on one of the center pointsof the cluster and checking the word cloud visualization of theneighborhood crimes, it is not difficult to see that most of thecrime cases occurred in non-residential dwelling such as offices,school buildings and commercial premises. The word cloud helpswith identifying more commonalities between the crimes — termsstole and office frequently occur in these crime cases. This leadsto the hypothesis that the pattern is related to burglary/theft in-side non-residential dwellings. By double-checking the modusoperandi descriptions of the cluster members, the hypothesis canbe further confirmed.

The geographical map at the top left panel helps the analystsunderstand the geographical distribution of the crime cases. Thesmall region at the southern part of the city looks like a hot spotof theft in commercial premises. This can lead to further investi-gation. For example, the link between the specific type of crimeto the highway next to the area, the distribution of commercialvs. residential buildings in the area, or the temporal pattern of thecrimes (office or non-office hour, week or week-end days), etc.

The analyst can also search for other crime cases that containthe keyword office and see whether there are other types of crimethat are related to commercial premises. In this case, the thirditem in the query result is projected some distance away from theidentified pattern (see Fig. 6). This is not surprising, becausealthough the modus operandi does contain the term premise, thecrime itself is a car crime that happened outside the building —the offender stole the car key in an office and drove the car away.

5.2 Similarity Analysis Based on a Different Sub-set of Features

Next we investigate the embeddings of the same data us-ing different combinations of three feature sets that are imple-mented by the current version of the SSE tool: MO features(modus operandi), as described in the previous use case, spa-tial features, which includes the latitude and longitude of thecrime location, and temporal features, which include the date andtime when the crime happened. We randomly select a crime re-port (ID “99DE1/14452/04”) from the data as an “anchor” pointand search for similar cases using different feature set combina-tions. We look at 3 different combinations: MO, MO+Spatial, andMO+Spatial+Temporal.

Fig. 7 shows the resulting embeddings. Reasoning can be

made based on some of the comparisons. For example, from themap view, the MO-only projection does not seem to have a stronggeo-pattern. The crimes that are similar to the anchor documentare at distant geographical locations. Adding spatial features doesnot result in significant changes in the spread of the region, apartfrom filtering out a few documents that happened up north (high-lighted in the red rectangular frame).

After adding temporal features to the existing two featuresets, the analyst can identify a different set of similar documents.Most of the crimes that are similar to the anchor crime seem tohave occurred within a much smaller region. By analyzing thecommonalities between these document sets, further hypothesescan be generated. For example, these documents could be a set ofsimilar burglary cases that happened in a park area during night,assuming the term “dark” or “darkness” appears in several docu-ments, and one of the document contains “broken” and “lamp”.The “broken lamp” could be a key cue to the cases. Identificationof these types of causal relationships in the data is crucial to futurecrime prevention and reduction.

5.3 User EvaluationIn order to validate our design and assess the usability of the

tool, an evaluation session was carried out with criminal intelli-gence analysts. Three groups participated in the evaluation: onefrom a UK police force (2 analysts), one from a Belgium fed-eral police force (3 analysts) and one from a Belgium local policeforce (2 analysts). During the session the tool was introduced toeach group separately alongside 11 other prototype tools that havebeen developed for the project for different analysis tasks.

The evaluation session was interactive. A demo of the toolis given in front of the end user group. The analysts were en-couraged to ask questions and provide feedback during the demosession. At the end of the session each analyst had to complete aquestionnaire independently. The questionnaire consists of threetypes of questions: usability, analytical work, and satisfaction.

The feedback was encouraging. The analysts welcomed theexploratory nature offered by the interactivity of the tool and wereable to see its potential to help with the tasks of CCA. Theythought the tool is easy to understand and use. They liked themultiple-linked visualizations that allows them to inspect datafrom different perspectives. The analyst also felt that the under-lying similarity computation could be used effectively to detectwhen particular modus operandi traits reoccur. For example, thetool could alert the analyst that a particular trait occurred ten timesin the last fortnight, or that a trait that was frequent last summer



is becoming frequent again.One rather unexpected feedback was on the word cloud vi-

sualization. One may argue a list of frequently occurred words isa better way of summarizing a documents collection than a wordcloud. The analysts said they prefer the latter. The word couldvisualization seems to be a natural tool for sense-making and rea-soning – often when their eyes see a word cloud, their brain auto-matically start to piece words together to form stories.

The questionnaire result also looks promising. Among the12 tools that have been evaluated, SSE received one of the highestoverall score (4.25 out of 5) , as well as satisfaction (4.83 out of 5),expectation (4.56 out of 5) and evidential reasoning scores (4.61out of 5).

The analysts could also foresee a number of ways in whichthe tool could be improved. For example, they would like thesystem to reveal more details of the underlying similarity compu-tation and provide a summary of why two crimes are consideredeither similar or different. This enables analysts to validate thecomputation and could be accompanied by functionality for mod-ifying and correcting the similarity measure. A simplified methodfor this is semantic interaction [20] where analysts use drag-and-drop to reposition data points on the surface of the sphere, thusadjusting the underlying similarity weightings.

6. Conclusion and Future WorkIn this paper we introduced the Spherical Similarity Ex-

plorer, a visual analytics tool for Comparative Cases Analysis.The system addresses several challenges in analyzing criminal in-telligence data, including entity extraction and concept genera-tion from textual fields, combining both “soft ” (features extractedfrom textual fields) and “hard ” (features that exist in structuredfields) information for similarity analysis, and providing effec-tive visual representations of the data. A visual analytics pipelineis designed and developed to automate the CCA process. Thepipeline incorporates a series of text analytics techniques and in-teractive visualizations to automate the manual CCA process andsupport visual data exploration. A novel visualization techniquethat integrates the spherical MDS approach with interactions tosupport data exploration is proposed to avoid the over-plottingproblem of 2D visual embeddings and to support flexible navi-gation. Various visualization and interaction techniques are inte-grated into the system to facilitate filtering, hot spot identification,and commonality analysis. Positive feedback was collected frompolice analysts, as well as further development directions

For future work we would like to improve our system ac-cording to some of the feedback from the end users. In particu-lar, we intend to provide more transparency, flexibility and usercontrol of the similarity computation, allow weighting on differ-ent features, highlight not only commonalities but also differencesbetween crime cases, integrate semantic interactions with the sys-tem, and develop more filtering facilities to support “slicing anddicing” of the data before comparison.

In addition to the above mentioned functionality, we plan toextend the time line visualization to support more sophisticatedtemporal analysis. Visualizing the overlapping features betweendifferent sets of documents is another future direction that wewould like to explore. We also plan to extend the spherical map-ping by adding dynamic Azimuthal projections. This would allowthe analyst to compare the absolute distances from one particular

document to all the other documents on the spherical surface. Al-though the projection will need to be recomputed every time theanalysis changes their focus, accurate mapping could be helpfulwhen a criminal investigation focuses on a particular crime.

The tool presented in this paper is a system prototype as partof the VALCRI project. Our next goal is to improve the functional-ity, usability, scalability and robustness of the system and developa fully functional software. Further user evaluations of the systemwill be conducted to guarantee the usability and effectiveness ofthe tool.

AcknowledgmentsThis work was supported by the EU project Visual Analyt-

ics for Sense-making in Criminal Intelligence Analysis (VALCRI)under grant number FP7-SEC-2013-608142. The authors wouldlike to thank Dr Olaf Chitil for his help with code debugging andproof read of the paper.

References[1] Ibm i2, http://www-01.ibm.com/software/uk/industry/i2software/,

last retrieved 25/03/2015.[2] In-spire, http://in-spire.pnnl.gov/, last retrieved 25/03/2015,

2015.[3] Leap motion device, https://www.leapmotion.com/, last re-

trieved 25th march,2015.[4] Visual analytics for sense-making and criminal intelli-

gence analysis, http://www.valcri.org/, last retrieved 25thmarch,2015.

[5] Working Manual of Criminal Law. Carswell Legal Pubns,Mar 2000.

[6] M. Aupetit. Visualizing distortions and recovering topologyin continuous projection techniques. Neurocomputing, 70(7-9):1304–1330, 2007.

[7] C. Bennell and D. V. Canter. Linking commercial burglariesby modus operandi: tests using regression and ROC analy-sis. Science & Justice, 42(3), 2002.

[8] P. M. Bentler and D. G. Weeks. Restricted multidimen-sional scaling models. Journal of Mathematical Psychology,17:138–151, 1978.

[9] C. Bishop, M. Svensen, and C. Williams. GTM: A princi-pled alternative to the self-organizing map. In M. Mozer,M. Jordan, and T. Petsche, editors, Advances in NeuralInformation Processing Systems (NIPS 1996), volume 9,pages 354–360. MIT Press, Cambridge, MA, 1997.

[10] B. Bloxom. Constrained multidimensional scaling innspaces. Psychometrika, 43(3):397–408, 1978.

[11] R. Brath and P. Macmurchy. Sphere-based information vi-sualization: Challenges and benefits. In Information Visual-isation (IV), 2012 16th International Conference on, pages1–6, July 2012.

[12] A. Burenkov and J. Lorenz. Corner effect in double andtriple gate finfets. In European Solid-State Device Research,2003. ESSDERC ’03. 33rd Conference on, pages 135–138,Sept 2003.

[13] D. V. Canter, L. J. Alison, E. Alison, and N. Wentink. Theorganized/disorganized typology of serial murder: Myth ormodel? Psychology, Public Policy, and Law, 10(3):293–320, September 2004.

[14] D. Collier. The comparative method. In POLITICAL SCI-



http://www-01.ibm.com/software/uk/industry/i2software/

http://in-spire.pnnl.gov/

http://www.leapmotion.com/

http://www.valcri.org/





https://www.researchgate.net/publication/254005323_Semantic_Interaction_for_Visual_Text_Analytics?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

https://www.researchgate.net/publication/234113288_The_Comparative_Method?el=1_x_8&enrichId=rgreq-9b1651fd31fdb38e0a42f1d17d9a9cbf-XXX&enrichSource=Y292ZXJQYWdlOzMxMjU0MTg0NjtBUzo0NTMxMDcxMDU3MDE4OTBAMTQ4NTA0MDU2MDQ2Mg==

























ENCE: THE STATE OF DISCIPLINE II, pages 105–118,1993.

[15] N. Cope. Intelligence led policing or policing led intelli-gence?: Integrating volume crime analysis into policing. Br.J. Criminol., 2004.

[16] T. Cox and M. Cox. Multidimensional Scaling. Chapman &Hall, London, 1995.

[17] T. F. Cox and M. A. Cox. Multidimensional scaling on asphere. Communications in Statistics - Theory and Methods,20(9):2943–2953, 1991.

[18] M. Csisinko and H. Kaufmann. Towards a universal imple-mentation of 3d user interaction techniques. In Mixed Re-ality User Interfaces: Specification, Authoring, Adaptation(MRUI’07), pages 17–24, 2007. Vortrag: IEEE Virtual Re-ality 2007, Charlotte, NC, USA; 2007-03-10 – 2007-03-14.

[19] P. Demartines and J. Herault. Curvilinear component anal-ysis: A self-organizing neural network for nonlinear map-ping of data sets. IEEE Transactions on Neural Networks,8(1):148–154, Jan. 1997.

[20] A. Endert, P. Fiaux, and C. North. Semantic interaction forvisual text analytics. In Proceedings of the SIGCHI Con-ference on Human Factors in Computing Systems, CHI ’12,pages 473–482, New York, NY, USA, 2012. ACM.

[21] G. W. Furnas. Generalized fisheye views. In Proceedingsof the SIGCHI Conference on Human Factors in ComputingSystems, CHI ’86, pages 16–23, New York, NY, USA, 1986.ACM.

[22] G. Hinton and S. Roweis. Stochastic neighbor embedding.In S. Becker, S. Thrun, and K. Obermayer, editors, Advancesin Neural Information Processing Systems (NIPS 2002), vol-ume 15, pages 833–840. MIT Press, 2003.

[23] J. Kruskal. Multidimensional scaling by optimizing good-ness of fit to a nonmetric hypothesis. Psychometrika, 29:1–28, 1964.

[24] J. Kruskal. Nonmetric multidimensional scaling: A numer-ical method. Psychometrika, 29(2):115–129, 1964.

[25] J. Kruskal. Toward a practical method which helps uncoverthe structure of a set of multivariate observations by findingthe linear transformation which optimizes a new index ofcondensation. In R. Milton and J. Nelder, editors, StatisticalComputation. Academic Press, New York, 1969.

[26] J. Lee and M. Verleysen. Curvilinear distance analysis ver-sus isomap. Neurocomputing, 57:49–76, Mar. 2004.

[27] S.-Y. Lee and P. M. Bentler. Functional relations in multi-dimensional scaling. British Journal of Mathematical andStatistical Psychology, 33(2):142–150, 1980.

[28] P. Leong and S. Carlile. Methods for spherical data anal-ysis and visualization. Journal of neuroscience methods,80(2):191–200, 1998.

[29] T. Munzner. H3: Laying out large directed graphs in 3dhyperbolic space. In Information Visualization, 1997. Pro-ceedings., IEEE Symposium on, pages 2–10. IEEE, 1997.

[30] L. Nigay and F. Vernier. Design method of interaction tech-niques for large information spaces. In Proceedings of theworking conference on Advanced visual interfaces, pages37–46. ACM, 1998.

[31] NPIA. National policing improvement agency: Professionalpractice on analysis, 2008.

[32] E. Oja. Principal components, minor components, and linear

neural networks. Neural Networks, 5:927–935, 1992.[33] K. Pearson. On lines and planes of closest fit to systems of

points in space. Philosophical Magazine, 2:559–572, 1901.[34] S. Roweis and L. Saul. Nonlinear dimensionality reduc-

tion by locally linear embedding. Science, 290(5500):2323–2326, 2000.

[35] G. Salton, A. Wong, and C. S. Yang. A vector space modelfor automatic indexing. Commun. ACM, 18(11):613–620,1975.

[36] J. Sammon. A nonlinear mapping algorithm for datastructure analysis. IEEE Transactions on Computers, CC-18(5):401–409, 1969.

[37] B. Scholkopf, A. Smola, and K.-R. Muller. Nonlinear com-ponent analysis as a kernel eigenvalue problem. NeuralComputation, 10:1299–1319, 1998. Also available as tech-nical report 44 at the Max Planck Institute for BiologicalCybernetics, Tubingen, Germany, December 1996.

[38] R. Shepard. The analysis of proximities: Multidimensionalscaling with an unknown distance function (parts 1 and 2).Psychometrika, 27:125–140, 219–249, 1962.

[39] B. Shneiderman. The eyes have it: A task by data type tax-onomy for information visualizations. In Proceedings of the1996 IEEE Symposium on Visual Languages, VL ’96, pages336–, Washington, DC, USA, 1996. IEEE Computer Soci-ety.

[40] J. Tenenbaum, V. de Silva, and J. Langford. A global ge-ometric framework for nonlinear dimensionality reduction.Science, 290(5500):2319–2323, Dec. 2000.

[41] L. van der Maaten and G. Hinton. Visualizing data using t-SNE. Journal of Machine Learning Research, 9:2579–2605,2008.

[42] J. Venna, J. Peltonen, K. Nybo, H. Aidos, and S. Kaski. In-formation retrieval perspective to nonlinear dimensionalityreduction for data visualization. Journal of Machine Learn-ing Research, 11:451–490, 2010.

[43] C. von der Malsburg. Self-organization of orientation sensi-tive cells in the striate cortex. Kybernetik, 14:85–100, 1973.

[44] K. Weinberger and L. Saul. Unsupervised learning of im-age manifolds by semidefinite programming. InternationalJournal of Computer Vision, 70(1):77–90, 2006.

[45] Y. Wu and M. Takatsuka. Spherical self-organizing map us-ing efficient indexed geodesic data structure. Neural Net-works, 19(6-7):900–910, 2006.

[46] G. Young and A. Householder. Discussion of a set of pointsin terms of their mutual distances. Psychometrika, 3:19–22,1938.



The author has requested enhancement of the downloaded file. All in-text references underlined in blue are linked to publications on ResearchGate.The author has requested enhancement of the downloaded file. All in-text references underlined in blue are linked to publications on ResearchGate.























































































spherical similarity explorer for comparative case analysis · with a series of analysis,...

Documents