what do image & soundbesser.tsoa.nyu.edu/howard/talks/imagem-som.pdf · 2009-06-26 ·...
TRANSCRIPT
6/25/09
1
Museus Imgagem Son, 26/6/09
In a Digital World, Image & Sound Collections require Stewardship:�
Selection, Curation, Preservation, Presentation
Howard Besser http://besser.tsoa.nyu.edu/howard/
http://www.nyu.edu/tisch/preservation/
In a Digital World, Image & Sound Collections require Stewardship:�
Selection, Curation, Preservation, Presentation- • What is Stewardship? • What do Image & Sound Museums already
do? • What kind of new things do we need to
learn about? – Digital Preservation – Digital Curation – Digital Selection
Museus Imgagem Son, 26/6/09
Digital Stewardship definitions
• “acquire, manage, organize, preserve, and provide access to massive amounts of data for use and re-use by a variety of interdisciplinary and heterogeneous communities over time”
• “Preservation + Use”
Museus Imgagem Son, 26/6/09
Digital Stewardship:�
Harvard Library OIS definition • the management of digital objects over the long term through careful digital
asset management practices. Collection managers and DRS staff must work together to manage stored digital objects throughout all phases of the objects' life cycle. This section describes the major areas of digital stewardship in the life cycle of a digital object:
• Assessment and selection phase: Collection manager performs a curatorial assessment of materials intended for DRS storage.
• Acquisition and creation phase: Collection manager (in consultation with HUL analysts) selects digital formats and defines technical specifications and workflow processes for creation of objects and related metadata.
• Deposit phase: DRS ensures a successful deposit by validating each package of digital objects and related metadata that is submitted to DRS.
• Archive and preservation phase: DRS staff perform periodic checks to ensure the usability of digital objects over time. This includes periodic reports to collection managers about their objects.
Museus Imgagem Son, 26/6/09
Digital Stewardship • Includes much of what cultural institutions have always
done (select, acquire, preserve, provide access, maintain context)
• But also adds some of the activities that audiovisual collections have always done (maintain knowledge of the original equipment and formats, manage process of reformatting when older hardware is difficult to maintain
• Also includes important notions from the digital world about ongoing management of re-formatting information, technical metadata, contextual info, repository issues, …
Museus Imgagem Son, 26/6/09
What do Image & Sound Museums already do?-�
• Have broad collections where each needs to be handled in different ways
• Handle Re-Formatting • Have an online presence & provide access
to their collections • Encourage new activities in the Sound &
Image communities
Museus Imgagem Son, 26/6/09
6/25/09
2
MIS Collection Content (turntable in RJ)
Museus Imgagem Son, 26/6/09
MIS Collection Content (posters in MG)
Museus Imgagem Son, 26/6/09
MIS Collection Content (newsfilm in MG-1)
Museus Imgagem Son, 26/6/09
MIS Collection Content (newsfilm in MG-2)
Museus Imgagem Son, 26/6/09
Museus Imgagem Son, 26/6/09
Media News And other cultural organizations have similar collections (but
sometimes for different purposes)
Museus Imgagem Son, 26/6/09
6/25/09
3
Museus Imgagem Son, 26/6/09
Bumba-Meu-Boi
Museus Imgagem Son, 26/6/09
TV History
Museus Imgagem Son, 26/6/09
TV History
Museus Imgagem Son, 26/6/09
TV History
Museus Imgagem Son, 26/6/09
TV History
Museus Imgagem Son, 26/6/09
TV History
6/25/09
4
Handle Re-Formatting
Museus Imgagem Son, 26/6/09
MIS Collection Content (video transfer in RJ)
Museus Imgagem Son, 26/6/09
Have an online presence &�provide access to their collections
Museus Imgagem Son, 26/6/09
MIS online activities-SP
Museus Imgagem Son, 26/6/09
MIS online activities-RJ
Museus Imgagem Son, 26/6/09
MIS online activities:�
Highlighting Collections
Museus Imgagem Son, 26/6/09
6/25/09
5
Encourage new activities in the Sound & Image communities�
Museus Imgagem Son, 26/6/09
MIS activities in Campinas
Museus Imgagem Son, 26/6/09
MIS activities in Cuiabá
Museus Imgagem Son, 26/6/09
What kind of new things do we need to learn about?�
• Digital Preservation • Digital Curation • Digital Selection
Museus Imgagem Son, 26/6/09
Museus Imgagem Son, 26/6/09
The Short Life of Digital Info: Digital Longevity Problems-
• Digital Preservation • The Viewing Problem • The Scrambling Problem
• Digital Curation • The Inter-relation Problem • The Custodial Problem • The Translation Problem
Digital Preservation
Museus Imgagem Son, 26/6/09
6/25/09
6
Museus Imgagem Son, 26/6/09
The Viewing Problem
• Digital Info requires a whole infrastructure to view it
• Each piece of that infrastructure is changing at an incredibly rapid rate
• How can we ever hope to deal with all the permutations and combinations
Museus Imgagem Son, 26/6/09
Viewing Problem
• Requires new file formats and new physical strata at regular intervals
• Needs a serious Managed Environment • Main InterPARES finding--the need for
complete lifecycle management – archivist needs to be incolved when record is
created and throughout active life
Museus Imgagem Son, 26/6/09
The Scrambling Problem�
Dangers from: Compression to ease storage & delivery Encryption to enhance digital commerce
Museus Imgagem Son, 26/6/09
Save the Hardware & Software-
• A huge undertaking • Computer Museum • Broderbund
Besser-Pratt Gatewai, 5/2/08 35
Keep Old Video Equipment--NARA
Besser-Pratt Gatewai, 5/2/08 36
Keep Old Video Equipment--NARA
6/25/09
7
Besser-Pratt Gatewai, 5/2/08 37
Keep Old Audio Equipment-NARA
Museus Imgagem Son, 26/6/09
Old Video Formats
Museus Imgagem Son, 26/6/09
Old Digital Formats
Museus Imgagem Son, 26/6/09
Save the Hardware & Software
• A huge undertaking • Computer Museum • Broderbund
Museus Imgagem Son, 26/6/09
Conceptual Approaches to Digital Preservation
• Refreshing always necessary due to volatility of physical strata – Impact on evidential value
• Migration -- advantages & disadvantages • Emulation -- advantages & disadvantages
• And will need a long-term managed environment
Museus Imgagem Son, 26/6/09
Migration
• Wordstar to Word 1 to Word 3, … • -Tables and complex features often get
corrupted • -Need to repeat every 4-5 years (maybe
forever) • +We know how to do this ourselves • +If there’s a problem, we can catch it soon
6/25/09
8
Museus Imgagem Son, 26/6/09
Emulation • Keep the Wordstar file format, but write emulators to make
it work in newer environments • +A better chance of carrying over complexity • +Many more features can survive • -Problems may not be caught until it’s too late • -Specialists and a whole infrastructure of emulators
required • -Serious © problems (reverse engineering?)
Museus Imgagem Son, 26/6/09
Brittle Newspapers (Australia Battye Library)
Museus Imgagem Son, 26/6/09
Film Decay (LC Dayton)
Museus Imgagem Son, 26/6/09
Why do we Reformat?
• Because we cannot sustain the original object (its physical characteristics are deteriorating too fast)
• Because continued access and handling of the original object will rapidly decay its physical characteristics (so we create a surrogate for users and store the original in very good conditions, away from users)
• Because viewing the work requires some kind of technology, and we can’t keep that technology working very far into the future-
Besser-MIAP Intro, 9/4/07 47
Lost Tapes, Found SoundsExhibition�Harold Schellinex
Besser-MIAP Intro, 9/4/07 48
Obsolete or deteriorated Physical Carriers
6/25/09
9
Besser-MIAP Intro, 9/4/07 49
Obsolete Carriers & Info Techn
Museus Imgagem Son, 26/6/09
Metal sound recording Disks�Casa Rui Barbosa
Museus Imgagem Son, 26/6/09
Paper print (LC Dayton)
Museus Imgagem Son, 26/6/09
Record Turntables
Museus Imgagem Son, 26/6/09
Slide Projector
Museus Imgagem Son, 26/6/09
Limitations of Reformatting
• Authenticity issues • User behaviors (newspaper, book, video
game, …) • Users mistaking the reformatted work for
the original
6/25/09
10
Museus Imgagem Son, 26/6/09
Critiques of Reformatting
Mainly User Behaviors
• Can’t view outside the library
• Only sequential access • Viewing and studying
is awkward • …
Museus Imgagem Son, 26/6/09
But if we don’t Reformat, we totally lose some kinds of works�
(particularly audiovisual works like film) • 50% of all titles produced before 1950 have vanished (approximate
number as of late 1970s) • This reflects full-length features; survival rates are much lower for
other types (studio newsreels, shorts, docs, independent, …), and these “orphans” are particularly in peril
• Fewer than 20% of features from 1920s survive in complete form; survival rates of 1910s is <10% (& none of these are negatives)
-Film Preservation 1993: A Study of the Current State of American Film Preservation, Vol 1: Report, June 1993, Report of the Librarian of
Congress (http://www.loc.gov/film/study.html)
Museus Imgagem Son, 26/6/09
And sometimes we have to reformat because of technology changes
• We don’t have video players to play tapes made 25 years ago
• We don’t have 8-inch floppy disk drives, syquest drives, zip drives
• We don’t have Windows 3 operating systems
• But this is something that conservators have always dealt with…
Museus Imgagem Son, 26/6/09
We sometimes have no control over the technologies we use
• Environmental--Government mandates to discontinue use of halon gas as fire extinguisher
• Economic--Companies in economic trouble will cease manufacturing technologies that aren’t hugely profitable – Ilford (2004 bankruptcy) – Agfa (2005 bankruptcy) – Kodak-
Museus Imgagem Son, 26/6/09
Kodak stops making some films
Museus Imgagem Son, 26/6/09
Kodak stops making some papers
6/25/09
11
Museus Imgagem Son, 26/6/09
Basic Economics • The cultural community is not large enough a purchaser to
sustain many types of manufacturing and technological production
• Many of the things we use are based upon larger production runs for larger (and richer) communities
• Therefore, we need to be periodically monitoring the economic health of our suppliers, and be aware of long-term trends affecting their other customers
Museus Imgagem Son, 26/6/09
Storing on CDs becomes a big problem over time
Besser-Film/Video/Audio-ABRACOR, 2/9/06
63
Consumers replace their CDs with a hard disk (& so should you)
Museus Imgagem Son, 26/6/09
Possible endless need for reformatting implies
• Possible loss with each generation • Requires managed environment
Museus Imgagem Son, 26/6/09
Managed Environment
• More than temperature & humidity control • Periodic monitoring of the works • Periodic monitoring of the technical
environment for viewing the works (software, systems, hardware)
• Trusted repositories
Museus Imgagem Son, 26/6/09
Paradigms Shifts needed Old New
Physical preservation
atmospheric cntrl ongoing mgmt
What to save? artifact idea + ancillary material & documentation
Cataloging Individual work in hand
FRBR
Later access Artifact & documentation
Restaging, ancillary material & documentation
6/25/09
12
Digital Curation
Museus Imgagem Son, 26/6/09 Museus Imgagem Son, 26/6/09
The Inter-relation Problem
-Info is increasingly inter-related to other info
-How do we make our own Info persist when it points to and integrates with Info owned by others?
-What is the boundary of a set of information (or even of a digital object)?
Museus Imgagem Son, 26/6/09
The Custodial Problem
In the past, much of survival was due to redundancy
How do we decide what to save? Who should save it? Mellon-funded E-Journal Archives
How should they save it?-
Museus Imgagem Son, 26/6/09
The Custodial Problem:�How to save information?
Methods for later access Refreshing Migration Emulation
Issues of authenticity and evidence
Museus Imgagem Son, 26/6/09
The Translation Problem
Content translated into new delivery devices changes meaning – -A photo vs. a painting – -If Info is produced originally in digital form in
one encoded format, will it be the same when translated into another format?
– Behaviors
Museus Imgagem Son, 26/6/09
The Translation Problem�Thinking of the Future (1/2)
• Screens will be different resolutions and different aspect ratios
• CRTs won’t exist • A decade or 2 from now, today’s user
interfaces will look like arrow-key navigation looks like today
6/25/09
13
Museus Imgagem Son, 26/6/09
The Translation Problem�Thinking of the Future (2/2)
• Today’s streaming media are small windows, slow speeds
• As bandwidth increases, viewers will expect higher quality streams
• Creators may need to consider how they’ll be able to deliver higher-bandwidth streams – Delivery Derivatives vs. Masters encoded w/standards – May also want to re-edit the piece to take advantage of
changes in technology, viewer expectations, society- Museus Imgagem Son, 26/6/09
Screen Formats
Museus Imgagem Son, 26/6/09
Many quality questions
• Quality of playback? • Theater experience?
Museus Imgagem Son, 26/6/09
Standards, Metadata, & Best Practices to follow-
• Risk Management • Best Practices for Reformatting • Preservation Repositories & Metadata • Other Metadata & Standards
Museus Imgagem Son, 26/6/09
Risk Management • We can’t say definitively that we can make every digital work persist • What we CAN say is that the more a digital work conforms to
standards and best practices, the greater the likelihood that we can assure persistance
• Our preservation repositories can even accept deposits of non-conforming works, but the less they conform, the less likely that they’ll be salvageable
• Persistance is most likely for works that share standards, metadata, and best practices
Museus Imgagem Son, 26/6/09
Reformatting Best Practices (still images) • Think about users (and potential users),
uses, and type of material/collection • Scan at the highest quality that does not
exceed the likely potential users/uses/material
• Do not let today’s delivery limitations influence your scanning file sizes; understand the difference between digital masters and derivative files used for delivery
• Many documents which appear to be bitonal actually are better represented with greyscale scans
• Include color bar and ruler in the scan
• Use objective measurements to determine scanner settings (do NOT attempt to make the image good on your particular monitor or use image processing to color correct)
• Don’t use lossy compression • Store in a common (standardized)
file format • Capture as much metadata as is
reasonably possible (including metadata about the scanning process itself)
6/25/09
14
Museus Imgagem Son, 26/6/09
Preservation Repositories:�Open Archival Info System Model
Producer
Management
Consumer
OAIS Functional Entities�(AHDS, Moving Images and Sound Archiving Study, June 2006)
Museus Imgagem Son, 26/6/09
Museus Imgagem Son, 26/6/09
Preservation Repositories:�Open Archival Info System Model
High-level reference model describing submission, organization and management, and continuing access
Conceptual framework for different organizations to share discussions with a common language
Producers, consumers, management, actual repository SIP, DIP, AIP AIP consists of data objects plus representation info (Content,
Preservation Description, Packaging, Descriptive) Originally developed for Space Science community
Museus Imgagem Son, 26/6/09
OCLC/RLG�Digital Repository Attributes
• Administrative responsibility • Organizational viability • Financial sustainability • Technological suitability • System security • Procedural accountability • Certification-
Museus Imgagem Son, 26/6/09
�Trustworthy Repositories Audit &
Certification (TRAC): Criteria and Checklisthttp://www.crl.edu/content.asp?l1=13&l2=58&l3=162&l4=91
• Organizational Infrastructure • Digital Object Management • Technologies, Technical Infrastructure, & Security • Who can benefit? • The document benefits those who work in or are responsible for digital repositories and who want to be
certified against its requirements, as well as for those who execute the audit and certification process. However, the guidelines are also designed for a much broader audience—for organizations planning repositories and for producers of digital materials.
• When does it come in handy? • —For staff working within repositories: The checklist is useful in helping them to determine the criteria
they need to plan for and to meet for safely storing and migrating digital materials. • —For producers of digital materials: It also serves producers of digital materials by helping them know
what to expect from the repositories they work with and how to streamline interactions with the repositories.
• —For those considering outsourcing their digital materials storage: The checklist will help organizations considering an external agency to serve their digital preservation needs to ensure that contracted organizations are executing the task adequately.
Museus Imgagem Son, 26/6/09
OAIS-steps in the process
• Agreement • Ingest • Store and manage/maintain
– Refresh – Emulate/Migrate/others
• Disseminate
6/25/09
15
Museus Imgagem Son, 26/6/09
Metadata
• Containers/Packaging for SIP (METS) • AIP • Preservation (PREMIS)
Museus Imgagem Son, 26/6/09
AIP Metadata
• Preservation Description Info – reference info – context info – provenance info – fixity info
• Packaging Info • Descriptive Info • Content Info
Museus Imgagem Son, 26/6/09
OCLC/RLG Efforts�PREMIS Data Model
Museus Imgagem Son, 26/6/09
OCLC/RLG Efforts�PREMIS Data Dictionary Example
DCC Digital Curation Life-cycle�http://www.dcc.ac.uk/
Museus Imgagem Son, 26/6/09
Digital Curation
• Planning & risk management • Open flexible file formats • Format registries • OAIS repositories • Extensive Metadata • Individuals & organizations-
Museus Imgagem Son, 26/6/09
6/25/09
16
Organisation to Engage & Collaborate
Museus Imgagem Son, 26/6/09
Industry
research collaborators
standards bodies
testbeds�& tools
communities of practice: users
community support & outreach
research
development co-ordination
service definition & delivery
management & admin support
Associates Network
curation �organisations �eg DPC
a centre of expertise in data curation and preservation
OCLC October 2006 Museus Imgagem Son, 26/6/09
Digital Preservation Players
• Collection staff (need to reach agreement on SIP/DIP and acceptable AIP transformations) – preservation/conservation staff – metadata staff – access staff
• Repository staff • Agreement negotiators
Museus Imgagem Son, 26/6/09
Preservation Repositories:�too difficult for small institutions
• Too complex for small institutions to manage • Will be done through partnering (small museum or dance
company with University) or through consortia (museum association, state-wide organization, …) or through service bureaus (OCLC)
• Archive or museum will direct what is needed, but digital repository will carry out the actual work (as defined in SIP/DIP/AIP agreement)
Digital Selection
Museus Imgagem Son, 26/6/09
Museus Imgagem Son, 26/6/09
But who is preserving today’s “born-digital” works?
• In the past, we knew about history by finding written documents: – Changes between different drafts of a scientific or literary paper – Letters and correspondence between a scientist (or literary figure)
and colleagues (that both helps contextualize the work, and lets us see changes in thought processes or discovery)
• But today, these documents are not on paper! They are in the form of: – Email correspondence – Word processing files that do not show changes between drafts
/versions
• Who will take responsibility to save these works for future study? Museus Imgagem Son, 26/6/09
Selecting and managing digital works
• “born-digital” works – Web-based media (like
the first songs of a band that later becomes famous)
– Email, Documents, Architectural drawings, Photographs
– Inventory records & DBs
– CD ROMs – DVDs
• Works converted to digital form – Digitized documents – Digitized photos – Digitized maps – Digitized audio – Digitized video – …
6/25/09
17
Museus Imgagem Son, 26/6/09
We convert works to digital or need born-digital because:
• We want to make documents or photographs available to people without them having to come to our building
• We want to capture documentation of our intangible heritage (patrimonio imaterial) (music, dance, performance) – And most of today’s documentation methods
are digital (sound, photographs, movies) Museus Imgagem Son, 26/6/09
SP Supreme Court Justice Enio Santarelli Zuliani shuts down YouTube
Museus Imgagem Son, 26/6/09 Museus Imgagem Son, 26/6/09
Intro to Imaging
Museus Imgagem Son, 26/6/09
Out of Print Other interesting projects
• Digital Public Television Preservation • Disaster Prevention & Recovery
– Restoration and distribution of Helen Hill’s works
• …
Museus Imgagem Son, 26/6/09
6/25/09
18
Museus Imgagem Son, 26/6/09
NYU/Public Television Project • Preserve a broad set of elements (including ancillary
material) • Life-cycle mgmt (add metadata as soon as a clip comes in) • Establish a community of stakeholders, working together
for preservation (stations, university, librarians, journalists, historians, producers, scholars, …)
• Build an OAIS Server • Explore appropriate file formats, wrappers, METS
extensions • Develop sustainable business model
Museus Imgagem Son, 26/6/09
Public Television is home of
Local broadcasting is nearly all via digital systems �
Thirteen Master Control manages one high definition, two analog and three digital over-the-air broadcast channels.�
Monitors, monitors, monitors to keep an eye on everything…
Museus Imgagem Son, 26/6/09 Museus Imgagem Son, 26/6/09
The key is designing a preservation repository that the system can afford to maintain and use. �
To test this out, we are focusing on -- • Appraisal and Selection – developing criteria and standards for what to
preserve and by whom
• File Formats and Packages – determining the best formats for our various uses, plus testing the suitability of file “packaging” for long term preservation
• Metadata and Related Topics – specifying technical, descriptive and rights information
• Repository Design – technical architecture, administrative policies and potential business models
• Sharing Our Findings – Keeping the public broadcasting community involved and informed all along the way
Museus Imgagem Son, 26/6/09
Project activities include -- • Completing an inventory of at-risk materials to better quantify our
holdings and prepare for selection
• Reviewing best practices and most up-to-date developments in the field of video archiving
• Conducting facilitated discussions on key topics to guide setting standards and policies
• Establishing an Advisory Committee to assist with selection criteria
• Ingesting sample materials and testing the repository
• Presenting regular reports to public broadcasting and moving image archive community for ongoing feedback
Museus Imgagem Son, 26/6/09
http://www.nyu.edu/tisch/preservation/research/
6/25/09
19
Museus Imgagem Son, 26/6/09
New Orleans:�
Film Projector
Museus Imgagem Son, 26/6/09
New Orleans:�
Still Photo Damage
Museus Imgagem Son, 26/6/09 Museus Imgagem Son, 26/6/09
New Orleans Films projected�Helen Hill films
Museus Imgagem Son, 26/6/09
New Orleans Films projected�Helen Hill films
Museus Imgagem Son, 26/6/09
New Orleans Films projected�Helen Hill films
6/25/09
20
Museus Imgagem Son, 26/6/09
New Orleans Films projected�Helen Hill films
Museus Imgagem Son, 26/6/09
New Orleans Films projected�Helen Hill films
Museus Imgagem Son, 26/6/09
We looked at preventative techniques & their limitations
Museus Imgagem Son, 26/6/09
Conclusions for preserving all types of digital works:�
Digital Repository Traditions & Services require
Sustainability Interoperability Access
And all of these require Standards and Metadata
Museus Imgagem Son, 26/6/09
Conclusions for preserving all types of complex works:�
From the technological point of view�Standards offer the best hope of
overcoming Impediments
• Easier to maintain a single set of standards over long periods of time
• Puts your institution in the same large boat with lots of other institutions who will face obsolescence and migration problems periodically throughout the future
Museus Imgagem Son, 26/6/09
for artistic and other challenging works:�
How Best to save these works? • Use Standards wherever possible • Be aggressive about asset mgmt -- saving component parts and
ancillary materials • Both creator and Archive should develop an institution-wide plan for
saving electronic works – Refreshing and either migration or emulation – Standard encoding schemes – What is the work? And prioritize what needs to be saved – Save ancillary materials and records
6/25/09
21
Museus Imgagem Son, 26/6/09
Paradigms Shifts needed Old New
Physical preservation
atmospheric cntrl ongoing mgmt
What to save? artifact idea + ancillary material & documentation
Cataloging Individual work in hand
FRBR
Later access Artifact & documentation
Restaging, ancillary material & documentation Museus Imgagem Son, 26/6/09
So, with electronic works, the focus should be less on stable temperature�
(Helsinki underground vaults)
Museus Imgagem Son, 26/6/09
And less on the construction of Vaults (Helsinki underground vaults)
Museus Imgagem Son, 26/6/09
But more on ongoing management of a work without worrying so much about
physical embodiment
Producer
Management
Consumer
In a Digital World, Image & Sound Collections require Stewardship:�
Selection, Curation, Preservation, Presentation Howard Besser, NYU Moving Image Archiving & Preservation Program
• http://besser.tsoa.nyu.edu/howard/Talks/ • http://besser.tsoa.nyu.edu/howard/longevity/ • http://www.nyu.edu/ tisch/preservation • http://www.nedcc.org/resources/digitalhandbook/ix.htm
• http://www.dcc.ac.uk/
• http://www.digitalpresservation.gov/ • UC Libraries Systemwide Operations and Planning Advisory Group (SOPAG) Site
http://www.slp.ucop.edu/sopag/ for the UC Digital Preservation & Archiving Committee Final Report
• http://www.firstmonday.dk/issues/issue7_6/besser/ • http://www.ifla.org/II/metadata.htm • METS official site: http://www.loc.gov/standards/mets Museus Imgagem Son, 26/6/09