converting millennium ils bibliographic records into dublin-core xml format for dspace
DESCRIPTION
Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace. PNC 2009 Annual Conference and Joint Meetings Taipei, Taiwan. Alan Ng Hong Kong University Libraries. Introduction. HKU Libraries. established in 1912 the oldest academic library in HK - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/1.jpg)
Converting Millennium ILS Bibliographic records into
Dublin-Core XML format for DSpace
Alan NgHong Kong University Libraries
PNC 2009 Annual Conference and Joint MeetingsTaipei, Taiwan
![Page 2: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/2.jpg)
Introduction
![Page 3: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/3.jpg)
•established in 1912•the oldest academic library in HK•main library and 6 branches
HKU Libraries
![Page 4: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/4.jpg)
HKU Libraries
•2.84M total physical volumes•49K print periodical titles•80K electronic periodical titles•1.90M e-book
![Page 5: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/5.jpg)
HKU Libraries
•Millennium ILS from Innovative Interface Inc.
•hosting the HKALL union catalog for 8 university libraries in HK
![Page 6: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/6.jpg)
Institutional Repository
![Page 7: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/7.jpg)
HKU Scholars Hub
•collects intellectual output of HKU for fulltext open access
•http://hub.hku.hk/
![Page 8: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/8.jpg)
HKU Scholars Hub
•uses DSpace (version 1.5)•OAI-compliant•implements DCMI
![Page 9: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/9.jpg)
HKU Scholars Hub•25300+ records (as of 2009 June)•Articles•Conference paper•Postgraduate thesis and others•1.6M download (as of 2009 June)
![Page 10: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/10.jpg)
HKU Scholars Hub•some records originate from the
OPAC•HKU postgraduate thesis•Digital editions from HKU Press•Bibliographic MARC fields are
mapped to DC XML data
![Page 11: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/11.jpg)
MARC to DC mapping001 identifier -- other
008 language020 identifier -- isbn022 identifier -- issn050 subject -- lcc
092|a|b subject -- dcc110|a contributor -- author
245|a|b title260|b publisher260|c date -- issued
300|a|b|c format -- extent490|a relation -- ispartofseries5XX description650 subject -- lcsh
710|a|b contributor -- other856|u identifier970 description -- tableofcontents
![Page 13: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/13.jpg)
Same record in Hub
http://hub.hku.hk/handle/123456789/55513
![Page 14: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/14.jpg)
Automated batch processing
![Page 15: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/15.jpg)
Incentives
•needs to convert 100+ records at a time
•tedious, easy to make mistake manually
•time consuming
![Page 16: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/16.jpg)
Automated approach
•efficiency•accuracy•eliminate duplicated effort of data
entry•easier quality control of converted
data
![Page 17: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/17.jpg)
Perl programming•free of charge•easy to program•powerful in handling plain text in
MARC•runs on any computer platform•needs a persistent URL syntax to
locate a particular record on OPAC
![Page 18: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/18.jpg)
Perl programming•reads in a list of bibliographic
record numbers•captures the MARC records on
OPAC real time one by one via HTTP
•regards the returned HTML as plain text
![Page 19: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/19.jpg)
MARC record as seen by human
http://library.hku.hk/search~S6?/.b4200627/.b4200627/1%2C1%2C1%2CB/marc~b4200627
![Page 20: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/20.jpg)
MARC record as seen by program
http://library.hku.hk/search~S6?/.b4200627/.b4200627/1%2C1%2C1%2CB/marc~b4200627
![Page 21: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/21.jpg)
Perl programming•extracts the essential MARC fields
using Regular Expression•constructs the DC fields according
to the mapping table•converts 100+ records in a couple
of minutes
![Page 22: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/22.jpg)
Converted record in DC XML format
![Page 23: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/23.jpg)
Running Perl program
•runs natively on Unix, Linux and Mac OS X
•needs Perl interpreter on Windows•download ActivePerl•http://www.activestate.com/
activeperl/
![Page 24: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/24.jpg)
Running the program on Mac OS X
![Page 25: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/25.jpg)
Demo
![Page 26: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/26.jpg)
Recap
![Page 27: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/27.jpg)
Recap•uses existing MARC records for DSpace•uses Perl program for fast batch
converting•retrieves MARC in real time via HTTP•works with any OPAC with persistent
URL•source codes is free for sharing
![Page 28: Converting Millennium ILS Bibliographic records into Dublin-Core XML format for DSpace](https://reader036.vdocument.in/reader036/viewer/2022062400/568155e7550346895dc3ad30/html5/thumbnails/28.jpg)
Q & A