using pivots to explore heterogeneous collections a case study in musicology daniel alexander smith...
TRANSCRIPT
![Page 1: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/1.jpg)
Using Pivots to Explore Heterogeneous CollectionsA Case Study in Musicology
Daniel Alexander Smith8 December 2009
![Page 2: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/2.jpg)
musicSpace
http://mspace.fm/projects/musicspace
• IAM Group, School of Electronics and Computer Science
• Music, School of Humanities
2
![Page 3: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/3.jpg)
Outline• How musicologists use data
• Limitations of existing approaches
• Our data extraction and integration methodology
• Interface walkthrough
3
![Page 4: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/4.jpg)
musicSpace Tasks• Triage data partners sources
• Extract information
• Map data sources to schemas/ontologies
• Produce interface over aggregated data
• Customise interface based on feedback
4
![Page 5: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/5.jpg)
Data in Musicology
5
![Page 6: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/6.jpg)
Musicologists consult many data sources
6
![Page 7: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/7.jpg)
. . . but what if they could use just one?
7
![Page 8: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/8.jpg)
Intractable research questions• Which scribes have created manuscripts of a
composer’s works, and which other composers’ works have they inscribed?
• Which poets have had their poems set to music by Schubert, which of these musical settings were only published posthumously, and where can I find recordings of them?
• Which electroacoustic works were published within five years of their premier?
8
![Page 9: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/9.jpg)
Why they are intractable (1)• Need to consult several sources
• Metadata from one source cannot be used to guide searches of another source
• Solution: Integrate sources
9
![Page 10: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/10.jpg)
Why they are intractable (2)• They are multi-part queries, and need to be broken
down with results collated manually
• Requires pen and paper!
• Solution: Optimally interactive UI
10
![Page 11: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/11.jpg)
Why they are intractable (3)• Insufficient granualrity of metadata and/or search
option
• Solution: Increase granularity
11
![Page 12: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/12.jpg)
Metadata Extraction
12
![Page 13: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/13.jpg)
Previous work• Comb-e-chem modelled Chemistry data
• We use similar approach
• Translated this work to the arts
• Musicology modelled using Semantic Web technologies
13
![Page 14: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/14.jpg)
Musicology Data Sources• Disparate data
• How to pull them together and view on demand
14
![Page 15: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/15.jpg)
15
Public British Library Music Collections British Library Sound Archive Cecilia Copac RISM (UK and Ireland)
Commercial
Grove Music Online Naxos Music LibraryRILM
Future? Alexander Street Press MusicOnlineCHARM‘Personal’ datasets
musicSpace Data Partners
![Page 16: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/16.jpg)
Data and Info Management problems• Sources allow searching, but not over everything
• Data export (MARC typically) shows extra fields, e.g. characters in opera, document types hidden amongst metadata
• Sometimes viewable on original site, but not searchable
• Offering extracted metadata already a benefit with one source
16
![Page 17: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/17.jpg)
Grove Extraction Example• More complicated, as Grove is a full text
encyclopaedia
• Some digitisation via Grove Music Online
• Weak semantic metadata extraction
• Thus we performed some data entry
17
![Page 18: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/18.jpg)
Grove Works Lists Source Data
18
![Page 19: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/19.jpg)
Works List Metadata Tool
19
![Page 20: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/20.jpg)
Data Integration
![Page 21: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/21.jpg)
Integration• Domain Expert + Technologist partnership
• This will be case for some time now
• Technology to best automate tasks to make domain expert’s job less onerous
21
![Page 22: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/22.jpg)
Metadata mapping• Domain experts devise single schema
• Provide mappings of fields in a particular data source to that unified schema
• Enables an interface across all sources
22
![Page 23: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/23.jpg)
Downside• New source comes online with information not
covered by unified schema
• Have to make changes to all mappings to ensure accurate coverage
23
![Page 24: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/24.jpg)
New Approach: Pivoting• Marking up a single source, versus pushing all to a
single schema
• Use a pivot instead to situate metadata for integration
• Essentially means that the interface does the heavy lifting of integration
• Reduced effort by domain experts
24
![Page 25: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/25.jpg)
Interface Video
25
![Page 26: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/26.jpg)
Interface Video• Find a composer
• See all copyists of their manuscripts
• Choose a copyist and see which other composers that copyist has worked on
26
![Page 27: Using Pivots to Explore Heterogeneous Collections A Case Study in Musicology Daniel Alexander Smith 8 December 2009](https://reader035.vdocument.in/reader035/viewer/2022070306/5516050b55034694308b4da3/html5/thumbnails/27.jpg)
27