march 18 niso two part webinar: is granularity the next discovery frontier? part 2: the business...
TRANSCRIPT
NISO Two Part Webinar: Is Granularity the Next Discovery Frontier?
Part 2: The Business Complexities of Granular Discovery
Wednesday, March 18, 2015
Speakers:
Andrea Eastman-Mullins, Chief Operating Officer, Alexander Street Press
Dan Valen, Product Specialist, figshare
Dave Hovenden, Content Operations Manager,Summon® Service, ProQuest
http://www.niso.org/news/events/2015/webinars/granularity_pt2/
Granular Discovery: A Discipline-Based Approach
Andrea Eastman-Mullins, COO
March 18, 2015
ww
w.a
lexand
ers
treet.
com
Agenda
• About Alexander Street Press
• Granularity for teaching and research
• Discipline context
• Challenges & Solutions
Alexander Street Press
• One of the first and largest vendors of streaming video and
music into libraries
• Publisher of 100+ high quality collections
• Based in Alexandria, Virginia
• About 120 people
Individual Faculty
Broad range of Libraries
Harvard
University
University
of TokyoUniversity of
Phoenix
National U.
of Singapore
Columbia
University
New York
Public Library
Manhattan
Community
College
Okanagan
College
Albuquerque
Public LibraryBritish
Library
We serve 40m faculty and students
in more than 30,000 institutions.
• Anthropology
• Counseling
• History
• Music
• Performing Arts
• Diversity Studies
• Education
• Feature Film
• Business
• Health Sciences
Award-Winning Collections
Our roots are in granular content…
If the source is We index
Feature Film Script Segments
Journal Letter
Diary Daily Entry
Play Scenes
Local History Text Photographs
Archival Collection Item
Classical Work Movement
“I’ll research this on YouTube?”
• Anthropology: Examine marriage rituals across
cultures from 1975-2010.
• History: What words did JFK use most frequently in
his early career vs. his later career?
• Counseling: Show me every time Albert Ellis used
the word “emotion” in his counseling demonstrations.
• Music: Show me performances of Strauss by Kiri Te
Kanawa.
• Drama: Examine performances of Shakespearean
jokes over time.
Discipline-specific
Opera: DVD>Works>Movements
Theatre: Play>Act>Scene
Film: DVD>Documentary>Extras
Music: CD>Work>Movement
History: Diary>Diary Entry
News: Series>Episode>Segment
Education: DVD>Demonstrations
All:
User created & publisher curated
Contiguous & select
We work with all major
discovery services Free MARC records
ShareCOUNTER 4-compliant
usage stats Embed
Tools enable discovery
Usage: What is a playback?
• Each segment
• Extras
• Each full opera or play
• End-user created segments
• Overlapping segments
What level is for sale?
• Patron Driven Acquisition playback triggers sale
• Single title sale
• Total content count in a collection
• Free preview
Answers must serve:
Libraries Faculty Content Partners
How figshare makes content discoverable
4 ways…• Persistent identifiers through DataCite
• Institutional offering and API access
• Working alongside publishers
• SEO through figshare.com
Why data citations?
Pepe, Alberto; Goodman, Alyssa; Muench, August; Crosas, Merce; Erdmann, Christopher (2014): Some descriptive statistics about top domains linked in astronomy publications. Table_1.xls. PLOS ONE. 10.1371/journal.pone.0104798.t001.
Pepe, Alberto; Goodman, Alyssa; Muench, August; Crosas, Merce; Erdmann, Christopher (2014): Volume of potential data links in astronomy publications. Figure_1.tif. PLOS ONE. 10.1371/journal.pone.0104798.g001.
• Enabling easy reuse and verification of data• Allowing the impact of data to be tracked• Creating a scholarly structure that recognizes
and rewards data producers
Source: https://www.datacite.org/services/cite-your-data.html
Why DataCite?
Number of DataCite DOI registrations
figshare-hosted file views
figshare
Open data repository 1
Open data repository 2
figshare content on DataCite
✔
Reporting DashboardImpact and Usage Reporting
Administrative Workflow PortalA portal where administrators can manage curation of files to be made public, storage space allocation and user rights
Public Digital Research RepositoryA customizable public portal with all digital files made public at aninstitutional, departmental and group level
Research Data ManagementPrivate, controlled storage and collaborative spacesfor every academic at the institution
figshare for institutions: key pieces of functionality
✔
✔
✔
Work smart. Discover more.
Supported by
figshare’s APIThe figshare API allows you to push data to figshare, or pull data out
This allows you to build applications on top of academic research
PLOS and discovery
• figshare hosts the supplemental data for all seven PLOS journals
• You can see inside the SI files (datasets, text files, documents, animations, videos and presentation files)without leaving the article or opening them
• When you spot interesting files you can download them singly or altogether
figshare as a recommendation engine
• PLOS’ data availability policy
• Recommendations delivered by figshare extend beyond research outputs attached to PLOS publications
• Not all scholarly outputs may fit in an article, but might very well be critically instrumental to others’ research
Google Scholar
• We mark-up all figshare.comcontent for inclusion in Google Scholar
• Google bots recognize our content as open and academic
• Faster way to get to content–proof of research v. paper
Work smart. Discover more.
Supported by
Thank you so much for
your time!
@figshare
http://www.figshare.com
WHEN GRANULARITY MET
DISCOVERY: THE COMPLEXITIES
OF GRANULAR CONTENT
DISCOVERYNISO Webinar: Is Granularity the Next Discovery Frontier? Part 2: The
Business Complexities of Granular Discover
Dave Hovenden, Content Operations Manager – Summon
March 18, 2015
When Granularity Met Discovery
3/18/2015
When Harry Met Sally. Digital image. Wonder Magazine. Wonder Magazine. 4 Mar. 2015.
What Do We Mean When We Talk About Granular Content?
3/18/2015 47
Journal
Journal
Issue
Journal
Article
Subsection
Table
Granular Content Discovery – How it Should Work When All
the Pieces are in Place
3/18/2015 48
Granular Content
Discovery
Quick
Painless
Simple
Without a lot of ‘noise’
(i.e., irrelevant results)
Gets you want you need the first time
How it Should Work!
3/18/2015 49
Book Chapter Example
q= “On the City Walls and Built Environment of Pavia (1330)"
The Complexities of Granular Content Discovery
3/18/2015 51When Harry Met Sally Quotes. Digital image. Bustle. Bustle.com. 4 Mar. 2015.
The Complexities of Granular Content Discovery
3/18/2015 52When Harry Met Sally Quotes. Digital image. Bustle. Bustle.com. 4 Mar. 2015.
What are Some of the Complexities?
• System Changes to Accommodate New Levels of Granularity
– The changes can be done, but is the level of investment worth it
in all cases?
• Granularity Conflicts between Content Providers when Content
is Aggregated
– Disagreements on the granularity level of the content exist
between various content providers when they aggregate the
same content
• Metadata Quality
– Content can be granular, but if the metadata quality is
inconsistent or poor, it can limit discoverability
3/18/2015 53
Encountering New Levels of Granularity
3/18/2015 54When Harry Met Sally. Digital image. G-Pop. G-Pop. 11 Mar. 2015.
Introducing Brand New Levels of Granularity Discovery
3/18/2015 56
Granular Content
Discovery
Schema Changes
UI Changes
API Changes
Match & Merge
Changes Relevancy Changes
Content Mapping Changes
Link Resolver Changes
• What’s the scope/scale of this granular content? Do we have a lot of content coming in at this level of granularity, or is it only a small amount?
• Is this in high demand for librarians and their patrons?
• What other priorities do we have? Could the resources, time, and money be put to better use focusing on other product enhancements and improvements?
3/18/2015 57
Weighing the Business Considerations
Granularity Conflicts between Content Sources
3/18/2015 58
Source A
Source B
Archival News
by Taves, Brian
Cinema Journal, 03/1999, Volume 38, Issue 3. pg. 109
Video
by Taves, Brian
Cinema Journal, 03/1999, Volume 38, Issue 3, pg. 113
3/18/2015 59
The Consequences of Granularity Conflicts
• When the user tries to access the content, the metadata is sent to the link resolver to generate an OpenURL link
• Because the library has access to the journal from multiple sources, the user is presented with several options to decide from which source the linker should resolve to
• If the user tries to access the content from Source A, the link resolver will fail. This is because Source A’s metadata is slightly different because they’ve indexed the content at a different level of granularity
What Metadata the Link Resolver Uses to Construct the
OpenURL Link to Source A’s Platform
• rft_val_fmt=info:ofi/fmt:kev:mtx:journal&
• rft.genre=article&
• rft.atitle=Video&
• rft.jtitle=Cinema+Journal&
• rft.au=Taves%2C+Brian&
• rft.date=1999-03-22&
• rft.pub=University+of+Texas+at+Austin+%28University+of+Texas+Press%29&
• rft.issn=0009-7101&
• rft.eissn=1527-2087&
• rft.volume=38&
• rft.issue=3&
• rft.spage=113&
• rft.externalDBID=n%2Fa&
• rft.externalDocID=160417287¶mdict=en-US
3/18/2015 60
When Conflicts in Granular Content Can be Confusing Due to
Conflicting Metadata
3/18/2015 61
Reading list on Turkey
by Akyol, Mustafa
The World Today, ISSN 0043-9134, 08/2013,
Volume 69, Issue 4
The Original Article
3/18/2015 62
Reading list on Turkey
by Akyol, Mustafa
The World Today, ISSN 0043-9134, 08/2013,
Volume 69, Issue 4
Granular Content, Metadata Conflict
3/18/2015 65
Source A
Source B
Different start page
Different start page
Wrong issue
number
Granularity Conflicts Have the Potential to Become More
Common
3/18/2015 66
• Approx. 40% of the
Summon Index is
made up of content
from aggregation
databases
• More and more content
is being made available
to aggregation
databases from
Publishers and Open
Access resources
• Aggregation databases
are still popular among
libraries due to the
breadth of content they
provide
How Summon Enables Granular Content Discovery
3/18/2015 68
Book Chapter Example
q= “On the City Walls and Built Environment of Pavia (1330)"
Closing Thoughts
3/18/2015 69
• Accurate, detailed, and consistent metadata
for the granular content is important
• Explore what standards for granular content
discovery are needed
• The goal is to make it easier for people to find
granular content – that’s the whole point of
discovery!
3/18/2015 70
The EndWhen Harry Met Sally. Digital image. Parade Magazine. Parade Publications, INc.. 15 Mar. 2015.
NISO Webinar • March 11, 2015
Questions?All questions will be posted with presenter answers on
the NISO website following the webinar:
http://www.niso.org/news/events/2015/webinars/granularity_pt1/
NISO Two-Part Webinar
Is Granularity the Next Discovery Frontier?
Part 1: Supporting Direct Access to Increasingly Granular
Chunks of Content