humanidades digitales por ryan shaw (university of north carolina at chapel hill, estados unidos)

65
Editors’ Notes Managing Working Research Notes Ryan Shaw School of Information and Library Science University of North Carolina at Chapel Hill

Upload: innovatics

Post on 20-Jun-2015

179 views

Category:

Technology


2 download

DESCRIPTION

Ryan Shaw is an assistant professor in the School of Information and Library Science at the University of North Carolina at Chapel Hill. In 2010 he received his Ph.D. from the University of California, Berkeley School of Information, where he wrote a dissertation on how events and periods function as concepts for organizing historical knowledge. He is also the author of the LODE (Linking Open Descriptions of Events) ontology, recently adopted by the UK Archives Hub for their Linked Data effort. In 2012 he received a three-year Early Career Development grant from the Institute of Museum and Library Services to invent new tools for applying computational text processing techniques to organize large collections of civil rights histories. He is also a co-PI of the Editors' Notes project , a Mellon Foundation-funded effort to develop open, collaborative notebooks for humanists, and the PeriodO project, an NEH-funded gazetteer of scholarly assertions about the extents of historical, art-historical, and archaeological periods. In the past he has been involved in a number of digital humanities projects through his work with the Electronic Cultural Atlas Initiative. In a previous life, he worked as a software engineer in Tokyo, Japan.

TRANSCRIPT

Page 1: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Editors’ Notes Managing Working Research Notes

Ryan Shaw School of Information and Library Science University of North Carolina at Chapel Hill

Page 2: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

What are “working research notes”?!

Current tools for organizing editorial research!

The Editors’ Notes system!

Initial experiments with using Linked Data!

Current efforts and where we’re going!

Parting thoughts on Library Linked Data

Page 3: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

➡ What are “working research notes”?!

Current tools for organizing editorial research!

The Editors’ Notes system!

Initial experiments with using Linked Data!

Current efforts and where we’re going!

Parting thoughts on Library Linked Data

Page 4: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Our goal

is to provide a “medium by which much valuable information may become a sort of common property among those who can appreciate and use it”

Thoms, William J. 1849. “Notes and Queries.”

http://nq.oxfordjournals.org/content/s1-I/1/1.full.pdf+html

Page 5: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Documentary editing

Editors prepare collections of documents: letters, articles, diaries, essays, etc. !

Printed volumes provide context for better understanding subjects’ experiences and general milieu through footnotes, images, chronologies, articles

Page 6: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Documentary editing: workflow

1. Gather documents!

2. Contextualize select items!

3. Publish final product!

4. Repeat as funding allows

Page 7: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Example: !

Emma Goldman

Papers

Page 8: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 9: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Patrick— !Lenin: Had any of his family members beside his brother, been imprisoned? !What was the book he had written on ‘political economy’ that was used in Russian Universities? !New York (Evening?) Post, September 1918 editorial on IWW verdict for the huge IWW trial in Chicago.

Page 10: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Sources consulted, notes taken based on findings Notes stored in a Word document? Yellow notebook? Email? Negative conclusion reached to question, (but few will ever see this)

Page 11: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Working notes

… are relevant to source documents, but are not necessarily tied to any specific document (as annotations are)!

… may or may not become a formal finished product!

… critical to the functioning of scholarly projects at any scale

Page 12: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

What are “working research notes”?!

➡ Current tools for organizing editorial research!

The Editors’ Notes system!

Initial experiments with using Linked Data!

Current efforts and where we’re going!

Parting thoughts on Library Linked Data

Page 13: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 14: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 15: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 16: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 17: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

ProblemsPublished volumes & necessary work are expensive!

Lack of space for all footnotes!

Much of research done is either glossed over in footnotes or not included at all!

Fact checking!

Falsification or dead ends!

Tangential biographical details!

Preservation & legacy

Page 18: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

What are “working research notes”?!

Current tools for organizing editorial research!

➡ The Editors’ Notes system!

Initial experiments with using Linked Data!

Current efforts and where we’re going!

Parting thoughts on Library Linked Data

Page 19: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

http://editorsnotes.org/

Page 20: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 21: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 22: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 23: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 24: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 25: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 26: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 27: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 28: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 29: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

NotesDescription

Status

Assigned users

Sections

Citation with optional notes

Stored as HTML

Revision history

Page 30: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 31: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 32: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Documents

Zotero for document metadata

High quality, zoomable scans

Transcripts in HTML with interface to annotate passages of text

Page 33: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 34: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 35: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 36: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 37: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Topics

Primary method of indexing items

Classified by type

Interface for clustering and merging

Page 38: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 39: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 40: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 41: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Design principles

Minimal amount of “friction” for researchers!

Flexibility for different work habits!

Consistency in data models!

Existing technology wherever possible!

Adherence to web standards

Page 42: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Open sourcehttps://github.com/editorsnotes

HTTP API built on Django web framework!

PostgreSQL database!

Haystack for full-text searching!

Zotero for document description!

Google Refine for duplicate detection!

Mozilla Persona for ID management

Page 43: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

What has changed for researchers?

Implicit people, places, events

Explicit linkable entities

Free text Structured blocks

Filing cabinets Open access

Page 44: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

BenefitsConnections linking topics are freed from the minds of editors & researchers and indexed for anyone to see!

Standardized records of work can easily be revisited from within a project or from outside!

New way of seeing the outer edges of humanities research!

Evidence of intense, often messy, scholarship behind concise, clean footnotes

Page 45: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

What are “working research notes”?!

Current tools for organizing editorial research!

The Editors’ Notes system!

➡ Initial experiments with using Linked Data!

Current efforts and where we’re going!

Parting thoughts on Library Linked Data

Page 46: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 47: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

http://metadata.berkeley.edu/emma/

Page 48: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 49: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

topic names statements

(triples)

candidate URIs

Page 50: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 51: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Lessons learnedPossible to automatically harvest relevant linked data from libraries and other institutions!

Editorial control over the harvested data needs to be better integrated into the note-taking process!

Did not adequately demonstrate the benefits of structured data !

Do not simply aggregate and edit linked data— need to usefully exploit it to researchers’ benefit.

Page 52: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

What are “working research notes”?!

Current tools for organizing editorial research!

The Editors’ Notes system!

Initial experiments with using Linked Data!

➡ Current efforts and where we’re going!

Parting thoughts on Library Linked Data

Page 53: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

In-process reconciliation

Editors create topics to label and index their notes; later reconciled to external identifiers in a separate batch process.

Old

Editors fluidly create, link to, and reconcile

topics within the note-taking process.

New

Page 54: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Backbone.js (client-side JavaScript app framework)

Indexed Database API

JSON-LD storage API

JSON document store

Existing Editors’ Notes API

client-side

server-side

Page 55: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)
Page 56: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Motivating structured data use

enabled storing and editing of structured data, but provided no incentive for editors to do this

Old

storing and editing structured data

immediately enables sorting and filtering and creating simple

visualizations

New

Page 57: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Sorting & filtering

Page 58: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Sorting & filteringFilter and sort notes not only using the dates of the cited documents (as they currently can), but also using:!

locations and birth and death dates of the people referenced in the notes!

locations and dates of existence of the organizations referenced!

locations and dates of the events referenced

Page 59: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Backbone.js (client-side JavaScript app framework)

Indexed Database API

JSON-LD storage API

JSON document store

Existing Editors’ Notes API

D3.js (data-driven visualization)

client-side

server-side

Page 60: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Visualizing notesA note on Dhanvanthi Rama Rau & the Fourth International Conference on Planned Parenthood can become viewable as: !

a map of specific locations in Stockholm and Bombay!

a timeline of dates associated with the conference!

a network of relationships among people and organizations.

Page 61: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Expected benefits

Working notes become repurposable!

Working notes become more discoverable!

Shift of focus from one-shot product to continuous data curation process

Page 62: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

What are “working research notes”?!

Current tools for organizing editorial research!

The Editors’ Notes system!

Initial experiments with using Linked Data!

Current efforts and where we’re going!

➡ Parting thoughts on Library Linked Data

Page 63: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Bibliographic metadata

Currently we rely on Zotero for bibliographic metadata!

Professional cataloging ⇒ HTML pages ⇒ web scraping ⇒ manual editing!

This is madness: libraries (and archives) should be the premier source of high-quality, easy to use bibliographic metadata

Page 64: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

obtain

New user task: reuse

find

identify

select

explore

contextualize

justify

FRBR

FRAD FRSAD

Page 65: Humanidades digitales por Ryan Shaw (University of North Carolina at Chapel Hill, Estados Unidos)

Thank YouWe are grateful for funding from:!

The Andrew W. Mellon Foundation!Coleman Fung!

!Ryan Shaw [email protected]!Project information http://ecai.org/mellon2010/!Project site http://editorsnotes.org/!Source code https://github.com/editorsnotes