6/15/20151 opportunities for collaboration: the hearth project joy paulson and nathan rupp cornell...

31
1 03/27/22 Opportunities for Collaboration: The HEARTH Project Joy Paulson and Nathan Rupp Cornell University Digital Library Federation Spring Forum New Orleans, Louisiana April 2004

Post on 20-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

104/18/23

Opportunities for Collaboration:The HEARTH Project

Joy Paulson and Nathan Rupp

Cornell University

Digital Library FederationSpring Forum

New Orleans, LouisianaApril 2004

204/18/23

Outline

• Introduction

• Metadata

• Tying metadata together

304/18/23

The HEARTH Project

• Home Economics Archive: Research, Tradition, History

• Subjects covered• Bibliography development• Reformatting• Technical details• Funding

404/18/23

Vendor Metadata

TIFF headers contain:

• Bibliographic information

• Descriptive information

• Date and time of scan

• Source description

• Image description

504/18/23

Image Description

• Monographs and serials Shipment number ID number Image sequence number (padded to 8 digits) Technical description information

• Serials only Volume and issue number

604/18/23

Image Description

Image width Resolution unit

Image length Bits per sample

X resolution Compression

X position Orientation

Y resolution

Y position

704/18/23

Descriptive Metadata:Traditional Cataloging

• All titles included in OPAC– Titles available at Cornell– Titles from other libraries supporting home

economics

• MARC 856 field

804/18/23

904/18/23

1004/18/23

Descriptive Metadata: Repurposing MARC

• Descriptive metadata from MARC records• MARC to TEI Lite conversion scheme

100ad Author 300a Pagination245ab Title 6XX Keywords260abc Publication

• Problems– Dates– Edition statements

1104/18/23

1204/18/23

1304/18/23

Structural Metadata

Document Structuring Tool

• Designed in house

• Provides project management

• Manages copyright information

• Structures the document

• Performs quality control of images

1404/18/23

Administrative System (1)

1504/18/23

Administrative System (2)

1604/18/23

Administrative System (3)

1704/18/23

Administrative System (4)

1804/18/23

Copyright Management

1904/18/23

Structuring Documents (1)

2004/18/23

Structuring Documents (2)

• Quality control of images

• Match page numbers to image sequence

• Highlight important structures:Title pages

Table of contetnts

Other tables

Front matter

Notes

Indexes

Illustrations

Bibliographies

Back matter

Errata

2104/18/23

Providing Access

• Article level– Browse by author– Search for author or title

• Volume level– Structuring Process

2204/18/23

Other Functionalities

• Insert blank pages

• Insert pages missing note

• Reorder pages out of sequence

• Mark pages with scanning problems—description placed in notes field

2304/18/23

2404/18/23

Tying It All Together

• Vendor-supplied metadata

• Descriptive (MARC) metadata

• Preservation metadata

• Structural metadata

• MARC record information

• OCR

• Images on server

2504/18/23

HEARTH

2604/18/23

Browse by Author or Title

2704/18/23

Browse Journals Page

2804/18/23

Journals and Monographs by Year

2904/18/23

Article Author Browse Page (1)

3004/18/23

Article Author Browse Page (2)

3104/18/23

Questions?

Joy Paulson

(607) 255-7950

[email protected]

Nathan Rupp

(607) 255-7943

[email protected]