make marcedit work for you: olc technical services retreat

51
Making MarcEdit Work For You TERRY REESE HEAD OF DIGITAL INITIATIVES THE OHIO STATE UNIVERSITY [email protected]

Upload: reeseterry

Post on 26-May-2015

741 views

Category:

Education


4 download

DESCRIPTION

Walk through of some MarcEdit's common functions

TRANSCRIPT

Page 1: Make MarcEdit Work For You: OLC Technical Services Retreat

Making MarcEdit Work For You

TERRY REESE

HEAD OF DIGITAL INITIATIVES

THE OHIO STATE UNIVERSITY

[email protected]

Page 2: Make MarcEdit Work For You: OLC Technical Services Retreat

Topics Working With MARC Data

◦ Breaking/Making◦ Processing in Batch◦ Handling Character Conversions◦ Dealing with Errors

Working with Non-MARC Data◦ Understanding MarcEdit’s XML Framework◦ Adding New XML Functions◦ Dealing with Delimited Data

Editing MARC Records◦ Global Editing Functions◦ Automated Tasks◦ OAI Harvesting

Page 3: Make MarcEdit Work For You: OLC Technical Services Retreat

Topics Integrating MarcEdit with OCLC

◦ Batch Holdings Edits◦ Working with Local Bibliographic Data Records◦ Editing WorldCat in Real-Time

MarcEdit and RDA◦ Understanding the RDA Helper

Getting Help

Page 4: Make MarcEdit Work For You: OLC Technical Services Retreat

Working with MARC dataWhat is the MARC Tools section

• Access to the Making and Breaking functionality

• Characterset processing• Access to the XML Sub-routines

Page 5: Make MarcEdit Work For You: OLC Technical Services Retreat

Marc Tools Built-in functions

◦ MarcBreaker – Tool used to convert MARC records to the MarcEdit mnemonic format

◦ MarcMaker – Tool used to convert MarcEdit mnemonic format to MARC◦ MARC=>MARC21XML – converts MARC to MARC21XML

◦ Automatically converts data from MARC-8 to UTF8

◦ MARC21XML=>MARC – converts MARC21XML to MARC◦ Doesn’t automatically convert data from UTF8 to MARC8 – will leave data in UTF8

Page 6: Make MarcEdit Work For You: OLC Technical Services Retreat

MARCEngine Settings Of Note:

◦ Use Diacritics turns mnemonics on and off

◦ MARCXML XSLT determines how data moves between MarcEdit’s mnemonic format and MARCXML

◦ XSLT Engine◦ Saxon.net supports XSLT 2.0◦ MSXML supports XSLT 1.0, but is orders of

magnitude faster

◦ Unicode Normalization◦ New feature designed to allow

international users to break away from MARC21’s preferred KD normalization

Page 7: Make MarcEdit Work For You: OLC Technical Services Retreat

MARC Character Conversions Supports moving between any known Windows Characterset and MARC8.

Can be run from the Breaker/Maker – or as its own standalone utility

Page 8: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and bad records

Two MARC breaking algorithms◦ Strict MARC algorithm◦ Loose breaking algorithm

Loose algorithm can heal MARC records (sometimes)◦ Structural errors◦ Missing field or record markers

Page 10: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit: crosswalking design

MarcEdit model:◦ So long as a schema has been mapped to MARCXML, any

metadata combination could be utilized. This means that no more than two transformations will ever take place. Example: MODS MARCXML EAD

Page 11: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit Crosswalking model

MARC21XML

EAD

FGDC

MODSMARC

Dublin Core

Page 12: Make MarcEdit Work For You: OLC Technical Services Retreat

Registering XML Crosswalks in MarcEdit

Page 13: Make MarcEdit Work For You: OLC Technical Services Retreat

Automatic Crosswalk Operations

What’s MarcEdit doing?Facilitates the crosswalk by:

1. Performing character translations (MARC8-UTF8)2. Facilitates interaction between binary and XML formats.

Page 14: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC Records MarcEditor

◦ Specialized TextPad designed specifically for MARC records.

◦ Is UTF8 aware – can be used to generate records in MARC8 (though mnemonics) or UTF8 charactersets.

Page 15: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEditor Properties Templates

Fonts

Encodings

Preview Settings

Page 16: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit Templates Templates work much like Microsoft Word Templates

◦ Define a set of default data that will appear on a screen◦ Templates exist for all material formats◦ Can be customized to suit your needs.

Page 17: Make MarcEdit Work For You: OLC Technical Services Retreat

Paging Methods Why not just open the entire file?

◦ Memory limitations; while theoretical limits can reach into the 16 GBs, practical limits due to available RAM, etc. limit the application to displaying ~150-250 MB of text.

What are the Paging Methods?◦ MarcEdit has two:

◦ Preview Mode (disabled by default): Preview mode opens a snapshot of the file, and is best used for large (150-200 MB+) to remove any file loading penalties.

◦ Paging Mode (enabled by default): Loads files in “pages” showing nth number of records in each page. Changes made are made globally, but this allows users to jump between pages, and view all data in the file. Best if used on files 150-200 MB- as the program much create a memory map of all the records in the file.

Page 18: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC MarcEditor

◦ Supports a number of global editing functions:◦ Find/Replace functionality◦ Globally Add/Delete MARC fields◦ Globally Edit Subfield data

◦ Conditionally add/remove field data◦ Globally Edit Indicator data◦ Globally Swap field data◦ Record Deduplication◦ Record Sorting◦ Call Number Generator◦ Automation

Page 19: Make MarcEdit Work For You: OLC Technical Services Retreat

Specialized Tools Edit Subsets of Records:

◦ Tool allows users to extract subsets of a file, make changes, and save them back into the original file.

Edit Shortcuts:◦ Edit shortcuts represent tools that answer specialized questions, that don’t

rise to the level of having complete global editing functions. Examples, case conversion, Find records missing a field or subfield, etc.

Moving data between MarcEdit and the Web◦ MarcEdit can convert clipboard content into MARC8 or UTF8 so data can be

moved between different applications.

Page 20: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC – Find/Replace Works like a normal Find/Replace in most Textpad utilities.

Unlike most Textpads, Replace supports UTF-8 (when working with UTF-8 files) and regular expressions.

Page 21: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC – Find All Find all function was designed for use with the Paging mode

Allows users to find any text across all pages

Generates a jump list that can be used to find individual records for edit

Page 22: Make MarcEdit Work For You: OLC Technical Services Retreat

Jump List Find All

Page 23: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC – Global Add/Delete Field

Globally add fields to all MARC records◦ Allows users to set insertion position.

Globally delete fields◦ Allows global delete◦ Allows conditional delete

Supports Regular Expressions

Page 24: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC – Modifying subfield data

Allows for the modification of variable MARC field subfield data (MARC fields >10)

Allows for the modification of control field data by position or range of positions

Allows users to prepend and append data to subfields.

Allows users to change subfield tagging.

Page 25: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC – Modifying subfield data

Allows users to insert new subfields and define subfield placement.

Allows users to move field data from one field to another.

Supports:

◦ UTF-8 with UTF-8 files◦ Regular Expressions◦ Adding new subfields.

Page 26: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC – Modifying subfield data

Page 27: Make MarcEdit Work For You: OLC Technical Services Retreat

Editing MARC – Swapping Fields

Swap parts of MARC Fields or entire MARC fields◦ Define field, indicator and

subfields to move.◦ Can move field data and

delete the original field or clone the field data and move the clone to the new location.

◦ Can add data to an existing field.

Page 28: Make MarcEdit Work For You: OLC Technical Services Retreat

Character Conversions within the MarcEditor

MarcEditor allows users to convert character data between different charactersets.

Page 29: Make MarcEdit Work For You: OLC Technical Services Retreat

Fixing Boo-boos MarcEdit’s Special Undo

◦ Allows you to step back one global change.

Page 30: Make MarcEdit Work For You: OLC Technical Services Retreat

Sorting Fields MarcEdit provides multiple sorting types:

◦ Control Number◦ Sorts record position within the file

◦ Title◦ Sorts record position within the file

◦ Author◦ Sorts record position within the file

◦ Call Number◦ Sorts record position within the file

◦ 0xx Fields◦ Sorts the 0xx fields within individual records (does

*not* change record position within a file)

◦ All Fields◦ Sorts all fields within individual records (does *not*

change record position within a file)

◦ Custom Sort◦ Sorts all defined fields within individual records

(does *not* change record position within a file)

Page 31: Make MarcEdit Work For You: OLC Technical Services Retreat

Field Counts Field Count

◦ Provides a quick count of fields◦ Report of subfields used within a

particular field◦ Detailed reports of all

fields/subfields used within a fileset.

Page 32: Make MarcEdit Work For You: OLC Technical Services Retreat

Material Type Report Material Type Report

◦ Reports number of records by material type

◦ Breaks down material type by sub-types

◦ Utilizes the Leader, 008 and GMD to determine format types

Page 33: Make MarcEdit Work For You: OLC Technical Services Retreat

In-Line Validation MarcValidator-lite

◦ Can access MarcValidator for quick validation of data elements found in the file set

◦ Validation can use any defined rules set.

Page 34: Make MarcEdit Work For You: OLC Technical Services Retreat

Harvesting Metadata MarcEdit includes a builtin OAI harvester

Allows for direct XML=>MARC translations

Allows for custom modification of XSLT translation tables.

Page 35: Make MarcEdit Work For You: OLC Technical Services Retreat

Integrating with OCLC

Page 36: Make MarcEdit Work For You: OLC Technical Services Retreat

OCLC Classify Service MarcEdit can leverage OCLC WorldCat to generate call numbers automatically for files

◦ Fields used:◦ 001◦ 010$a$z◦ 020$a$z◦ 022$a$z◦ 024$a$z◦ 1xx$a◦ 776$w$z

Page 37: Make MarcEdit Work For You: OLC Technical Services Retreat

OCLC Classify Service

Page 38: Make MarcEdit Work For You: OLC Technical Services Retreat

Working with OCLC’s Metadata API

MarcEdit can work directly with WorldCat via the Metadata API.

Page 39: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and WorldCat Available Operations:

◦ Create/Read/Update Bibliographic Records◦ Update/Delete Institutional Holdings◦ Retrieve Holding Code information about an Institution◦ Create/Read/Update Local Bibliographic Data

Page 40: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and WorldCat A Word of Caution -- there is no net

Page 41: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and WorldCat But this is really cool because:

◦ Further automate traditional technical services processes◦ Specifically holdings management◦ Batch record ingestion

◦ Build pipelines between our repository systems and WorldCat◦ Develop localized interfaces for metadata entry outside the library◦ Opens up the opportunity for tool builders to interact with the OCLC

member community

Page 42: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit: Batch WorldCat Holdings Management

Page 43: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit: Batch Bibliographic Record Upload

Page 44: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and WorldCat Don’t forget – these functions are available in the MarcEditor as well

Page 45: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and WorldCat What’s not there:

◦ Record Validation◦ Anything to do with authority data◦ Record Locking (for record editing)◦ Service Status◦ User Validation (for permission validation)

Page 46: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and WorldCat How do I use this?

◦ You need to get a key from OCLC◦ OCLC’s Developer Network: http://oclc.org/developer/ ◦ OCLC Metadata API Documentation: http://oclc.org/developer/services/worldcat-metadata-api ◦ Notes on MarcEdit Integration: http://blog.reeset.net/archives/1245 ◦ C# OCLC API Library: https://github.com/reeset/oclc_api

Page 47: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit and RDA In Dec. 2012, I introduced the RDA Helper into MarcEdit

Purpose:◦ Provide automated conversion between AACR2 and RDA◦ Provide an automated process to update provisional RDA records to current

practice◦ Address concerns from librarians that still relied on the GMD, by providing an

automated method for regenerating the data.

Page 48: Make MarcEdit Work For You: OLC Technical Services Retreat

MarcEdit’s RDA Helper

Page 49: Make MarcEdit Work For You: OLC Technical Services Retreat

Troubleshooting Occasionally, errors can occur during install or with the configuration file.

◦ If configuration settings are not being saved, you can reset your configuration data.

Page 50: Make MarcEdit Work For You: OLC Technical Services Retreat

Troubleshooting Installation issues:

◦ Sometimes, the windows installer can get stuck making it so you cannot install or uninstall the program.◦ Use the MSI Cleaner: http://marcedit.reeset.net/software/msi_cleaner.zip

Page 51: Make MarcEdit Work For You: OLC Technical Services Retreat

Getting Help Youtube videos (just search for marcedit)

You can ask me: [email protected] or [email protected] MarcEdit Website: http://marcedit.reeset.net

MarcEdit Listserv: http://www.lsoft.com/scripts/wl.exe?SL1=MARCEDIT-L&H=MAIL04.GMU.EDU

Questions