pattern recognition in action for cataloging and metadata

42
OCLC Online Computer Library Center Pattern Recognition in Action for Cataloging and Metadata 2006 OLC Technical Services Retreat Chris Grabenstatter April 25, 2006

Upload: jonah

Post on 19-Feb-2016

46 views

Category:

Documents


0 download

DESCRIPTION

Pattern Recognition in Action for Cataloging and Metadata. 2006 OLC Technical Services Retreat Chris Grabenstatter April 25, 2006. Agenda. OCLC Cataloging/Metadata strategic directions Architecture to support strategy Examples of projects . Cataloging Environment. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Pattern Recognition in Action for Cataloging and Metadata

2006 OLC Technical Services RetreatChris GrabenstatterApril 25, 2006

Page 2: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Agenda

OCLC Cataloging/Metadata strategic directions

Architecture to support strategy Examples of projects

Page 3: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Cataloging Environment

Fewer catalogers, reduced budgets Little growth in print materials

acquisitions E-resources increasing – cataloged?

Page 4: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Deliver more automatically

Build on PromptCat, Cataloging Partners program success

Partner with major materials providers Cataloging tied to selection – possible

new service

Page 5: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

More Scripts/Language Support

Growing WorldCat Supporting libraries’ diverse collections Easier to get materials cataloged

Growing membership One stop shopping Both US and global libraries

Page 6: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Metadata support for e-content

Support automated metadata generation for e-resources

Facilitate storage and discovery of digital content

Support new metadata schemes - crosswalks Enrich WorldCat with e-serials records and

holdings

Page 7: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Continue to deliver value

Ongoing Connexion maintenance Standards Simplify pricing

Page 8: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Lego Era

“The Internet is entering its Lego era. Indeed, blocks of interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

--John Markoff, The New York Times, April 5, 2006

Page 9: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Library 2.0

“Library 2.0 is about small pieces of software loosely joined, … requires business models where multiple vendors bring value to consumers together … to reduce duplication of effort and reduce barriers to innovation…”

--Paul Miller, “Library 2.0: the challenge of disruptive innovation.”

http://www.talis.com/resources/documents/447_Library_2_prf1.pdf

Page 10: Pattern Recognition in Action for Cataloging and Metadata

OAI RepositoriesLocal DB’s

Metadata Capture

LanguageService

Validate

Reports &Stats

Profiling

FormatCrosswalks

Terminologies

ProfilingData

UsageStats

Connexion Digital Archive Content Coop ILS PICA NetLibrary Material Vendors Publishers

SRW/Zing Update

Shelf Ready

Metadata Creation

OAI Harvest

OCLC Metadata Management Service

DigitalArchive

Acquisitions/Selection

Web Services/Portal/API Layer

Z39.50(authoritiesNon-roman

LocalHoldings

(MFHD)DA Ingest DA Extract

Pan/Zoom

DA Access

Page 11: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Projects Metadata support for e-content

Extraction/Creation Web Service Crosswalk Web service OCLC Terminologies Service Content Cooperative Pilot OCLC eSerials Holdings Service

Page 12: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Extraction/Creation Web ServiceExtract metadata from Web sites, PDF files,

and Word files Re-implementing and enhancing functionality

currently available in Connexion browser Connexion browser – May 2006 Connexion client 1.60 – June 2006

Page 13: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Connexion extract metadata

Enter URL or path to extract metadata Supported file types .htm, .doc, .pdf

Create multiple records from Web sites linked to the parent URL

Specify to display or save created workforms, apply default constant data, and define My Status value

Future – add tools to “create” metadata

Page 14: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 15: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 16: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 17: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 18: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 19: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Crosswalk Web Service Batchload/PromptCat

ONIX to MARC OAI harvesting – Dublin Core to MARC

Future Import and export Dublin Core data from

Connexion client Support for other metadata schemes both browser

and client interfaces

Page 20: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

OCLC Terminologies service Introduction, June 2006 Add more access points using other controlled

vocabularies, e.g., MeSH, GSAFD Available to all OCLC Cataloging subscribers Subscriptions available for non-Cataloging

users Use with a variety of metadata editors, e.g.,

Connexion browser and client

Page 21: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

List of Terminologies in initial release aat - Art & Architecture Thesaurus (J. Paul Getty Trust) dct - Dublin Core Metadata Initiative Type Vocabulary (Dublin Core

Metadata Initiative) gmgpc - Thesaurus of Graphic Materials, TGM I (Library of Congress) gsafd - Guidelines On Subject Access To Individual Works Of Fiction,

Drama, Etc. (American Library Association) lctgm - Thesaurus of Graphic Materials, TGM II (Library of Congress) mesh - Medical Subject Headings (MeSH®) (National Library of

Medicine) ngl - Newspaper Genre List (University of Washington) tgn - Thesaurus of Geographic Names (J. Paul Getty Trust) ulan - Union List of Artists' Names (J. Paul Getty Trust

Page 22: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

12

TerminologyPane A separate application

Page 23: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Content Cooperative Pilot Upload content objects to the OCLC Digital Archive

from Connexion browser and client interfaces Digital image, thesis & dissertation, oral history, e-

book, video, etc. Replace WorldCat records to automatically add a

URL pointing to the content object Access digital content from FirstSearch, Group

Catalogs, and OpenWorldCat

Page 24: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 25: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 26: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 27: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 28: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 29: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 30: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Planned Open WorldCat Page for Digital Image

Page 31: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Planned Zoom & Pan

Page 32: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

OCLC eSerials Holdings service Automatically updates eSerials holdings in

WorldCat Access to eSerials via WorldCat Resource Sharing Access to eSerials through OCLC discovery

platform Compare electronic and print serials collections

No additional work for the library!

Page 33: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Digital CollectionsPrint Collections

D

P

Vendors

E

E

E

E

E

E

OCLC FirstSearchWorldCat Resource SharingWorldCat Collection AnalysisOpen WorldCat

E P

OCLC

P

P

P

P

P

P

P

P

P

P

P

OCLC Libraries

E

E

E

E

E

P

P

P

P

P

E

E

E

E

E

E

P

Resolver / A-Z serials listP

Resolver / A-Z serials list

Links to online Full TextResource Sharing ServicesLinks to OPACs

Page 34: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Pilot Partners

35 Pilot libraries EBSCO Ex Libris Serials Solutions TDNet More to come

Page 35: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Benefits to the library

Increased operational efficiencies in ILL Filling where possible A revenue opportunity for some You control requests via automatic deflection

Increased visibility at the point of need Leverages investment in services

Page 36: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Progress

Initial production system available late June 2006 Web-based registration form No charge to participate in the eSerials holdings

service Future enhancements projected to include

options for local holdings data, MARC record update service, and additional deflection choices

Page 37: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

ProjectsDeliver more automatically

Improve shelf-ready cataloging PromptCat/Cataloging Partners – 100% goal Partner with major vendors

Selection Possible future service OCLC partnering with materials vendors to help with

notification slip selection process Cataloging a by product of selection Watch for more information in the future!

Page 38: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Projects More Scripts/Language support

New scripts support Cyrillic, Greek and Hebrew – July 2005 Thai and Tamil scripts for use with Connexion client 1.50 (investigating

Devanagari, Sinhala, and Bengali next) Connexion interface translations

Client Chinese (Traditional and Simplified) and Japanese – July 2005 German and Korean – Nov. 2005

CatExpress – French (Nov. 2006) Unicode export – Nov. 2005 Automatic transliteration Web service – June 2006

Page 39: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 40: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Page 41: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Questions, Answers & Sharing

???

Page 42: Pattern Recognition in Action for Cataloging and Metadata

OCLC Online Computer Library Center

Cataloging future directionsContact us

Eric Childress: [email protected]

Chris Grabenstatter: [email protected]