19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/talks/19dig-preserv-apoyo.pdf · how can we ever hope...

8
9/19/19 1 Why Managing Digital Preservation is different than managing Analog Preservation Association for Heritage Preservation of the Americas—APOYO 30 Howard Besser New York University http://besser.tsoa.nyu.edu/howard/ http://www.nyu.edu/tisch/preservation/ APOYO30, 23/9/2019 1 Why Managing Digital Preservation is different than managing Analog Preservation Cultural Ins.tu.ons must keep things longer than other organiza.ons How Digital Preserva.on is different than Analog Preserva.on paradigms need to shi@ to “ongoing management” in the digital age Refreshing, and Emula.on & Migra.on Tools and Processes for managing digital preserva.on APOYO30, 23/9/2019 2 Sorry that I can’t be there APOYO30, 23/9/2019 3 I haven’t been in Rio since January APOYO30, 23/9/2019 4 But I started visiting Casa Rui Barbosa about 15 years ago APOYO30, 23/9/2019 And I know Casa Rui Barbosa well APOYO30, 23/9/2019 6

Upload: others

Post on 16-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

1

Why Managing Digital Preservation is different than managing Analog Preservation

Association for Heritage Preservation of the Americas—APOYO 30

Howard BesserNew York University

http://besser.tsoa.nyu.edu/howard/http://www.nyu.edu/tisch/preservation/

APOYO 30, 23/9/2019 1

Why Managing Digital Preservation is different than managing Analog

Preservation

•  CulturalIns.tu.onsmustkeepthingslongerthanotherorganiza.ons

•  HowDigitalPreserva.onisdifferentthanAnalog–  Preserva.onparadigmsneedtoshi@to“ongoingmanagement”inthedigitalage

•  Refreshing,andEmula.on&Migra.on•  ToolsandProcessesformanagingdigitalpreserva.on

APOYO 30, 23/9/2019 2

SorrythatIcan’tbethere

APOYO 30, 23/9/2019 3

Ihaven’tbeeninRiosinceJanuary

APOYO 30, 23/9/2019 4

But I started visiting Casa Rui Barbosa about 15 years ago

APOYO 30, 23/9/2019

AndIknowCasaRuiBarbosawell

APOYO 30, 23/9/2019 6

Page 2: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

2

AndIwastherejust9monthsago

APOYO 30, 23/9/2019 7

How long we keep things•  Companies keep information for days, or even years•  Individuals keep things for years, or a lifetime•  Archives, Libraries, and museums keep things for hundreds of

years

Cultural Institutions have a much greater responsibility for preservation!

APOYO 30, 23/9/2019 8

Keeping works and documentation for the Future

•  requires Digital Preservation

•  Why is Digital Preservation a problem?

APOYO 30, 23/9/2019 9

AnalogWorks•  Analog Photos, manuscripts, books, paintings, sculpture•  Objective is to make object itself endure (temperature/

humidity control, chemicals/pigments/fibers/adhesives, …)•  Goal is to keep object as close as possible to original state

(though occasionally controversy arises over whether to let aging show)

APOYO 30, 23/9/2019 10

Electronic Works•  Video, audio, digital, new media•  Difficult to make the original object endure (magnetic particle

deterioration, warping, etc.)•  Even if we could make the original object endure, we wouldn’t have the

infrastructure to view it in the future•  Need to develop a paradigm shift from preserving the original object to

preserving info content•  Need to pay more attention to maintaining authenticity and replicating user

experience

APOYO 30, 23/9/2019 11

With Analog Photos, we focus on the Original

•  We repair and try to make the original negative or original print continue to survive

•  But where is the original of your digital photo?

APOYO 30, 23/9/2019 12

Page 3: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

3

Longevidade Digital

APOYO 30, 23/9/2019

Paradigms Shifts neededOld New

Physical preservation

atmospheric cntrl ongoing mgmt

What to save? artifact idea + ancillary material & documentation

Cataloging Individual work in hand

FRBR

Later access Artifact & documentation

Restaging, ancillary material & documentation

APOYO 30, 23/9/2019

The Short Life of Digital Info: Digital Longevity Problems-

✿ Disappearing Information✿ The Viewing Problem✿ The Scrambling Problem✿ The Inter-relation Problem✿ The Custodial Problem✿ The Translation Problem

APOYO 30, 23/9/2019 15

Viewing Problem—electronic works need every part of infrastructure to work

APOYO 30, 23/9/2019

The Viewing Problem

✿ Digital Info requires a whole infrastructure to view it

✿ Each piece of that infrastructure is changing at an incredibly rapid rate (software versions, file formats, operating systems, storage devices)

✿ How can we ever hope to deal with all the permutations and combinations

APOYO 30, 23/9/2019 17

Viewing Problem

•  Requires new file formats and new physical strata at regular intervals

•  Needs a serious Managed Environment•  Main InterPARES finding--the need for

complete lifecycle management– archivist needs to be incolved when record is

created and throughout active life

APOYO 30, 23/9/2019 18

Page 4: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

4

Responding to serious Longevity Problems

✿ Previous formats required little ongoing intervention (remote storage facilities, Iron Mtn); digital formats require intense ongoing management

✿ Need for:✿ Preservation Repositories✿ Preservation Metadata

APOYO 30, 23/9/2019 19

Conceptual Approaches to Digital Preservation

•  Refreshing always necessary due to volatility of physical strata–  Impact on evidential value

•  Migration -- advantages & disadvantages•  Emulation -- advantages & disadvantages

•  And will need a long-term managed environment

APOYO 30, 23/9/2019 20

Migration•  Wordstar to Word 1 to Word 3, …•  -Tables and complex features often get

corrupted•  -Need to repeat every 4-5 years (maybe

forever)•  +We know how to do this ourselves•  +If there’s a problem, we can catch it soon

APOYO 30, 23/9/2019 21

Emulation•  Keep the Wordstar file format, but write emulators to make it

work in newer environments•  +A better chance of carrying over complexity•  +Many more features can survive•  -Problems may not be caught until it’s too late•  -Specialists and a whole infrastructure of emulators required•  -Serious © problems (reverse engineering?)

APOYO 30, 23/9/2019 22

Possible endless need for reformatting implies

•  Possible loss with each generation•  Requires managed environment

APOYO 30, 23/9/2019 23

Managed Environment

•  More than temperature & humidity control•  Periodic monitoring of the works•  Periodic monitoring of the technical

environment for viewing the works (software, systems, hardware)

•  Trusted repositories

APOYO 30, 23/9/2019 24

Page 5: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

5

Preservation Repositories:�Open Archival Info System Model

Producer

Management

Consumer

APOYO 30, 23/9/2019

Digital Preservation Players•  Collection staff (need to reach agreement on

SIP/DIP and acceptable AIP transformations)– preservation/conservation staff– metadata staff– access staff

•  Repository staff•  Agreement negotiators

APOYO 30, 23/9/2019

Tools&SkillsneededforDigitalPreserva.onManagement

•  BasicDigitalLibraryrepositoryknowledge(OAIS/SIP/DIP,AuditCer.fica.on)

•  FileFormatIden.fica.on/Valida.on(DROID/JHOVE,PRENOM)(explain)

•  PREMIS(explain)•  Alsolearningapplica.onsandcommand-lineskillsfor:–  Crea.ngandvalida.ngcheck-sums(explain)–  Extrac.ngmetadatafromfileheaders(explain)– Managingbulkingest,…(explain)

•  HereareexamplesfromNYU’scurriculumforteachingthese-

APOYO 30, 23/9/2019 27

Exercises/ToolsforFileIden.fica.on

File ID Exercise1. Unzip the DROID folder using unzip droid-6.01.zip -d droid

2. Change to the droid directory

3. Launch DROID using sh droid.sh

4. Add sample files by clicking on the green Add button, then to DROID by going to home/miap/Desktop/samples, click OK

5. Click Start

6. What does it display?

7. What doesn’t it display?

8. Is file ID enough?

APOYO 30, 23/9/2019 28

DroidFileIden.fica.on

APOYO 30, 23/9/2019 29 APOYO 30, 23/9/2019

OCLC/RLG Efforts �PREMIS Data Model

30

Page 6: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

6

APOYO 30, 23/9/2019

OCLC/RLG Efforts �PREMIS Data Dictionary Example

31

DigitalPreserva.onToolsTraining

APOYO 30, 23/9/2019 32

DigitalPreserva.onTools

APOYO 30, 23/9/2019 33

ChecksumEduca.on

APOYO 30, 23/9/2019 34

ChecksumEduca.on

APOYO 30, 23/9/2019 35

BatchProcessingChecksums

APOYO 30, 23/9/2019 36

Page 7: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

7

ExtractMetadatafromDigitalFileHeaders

APOYO 30, 23/9/2019 37

ExtractMetadatafromDigitalFileHeaders

APOYO 30, 23/9/2019 38

ExtractMetadatafromDigitalFileHeaders

APOYO 30, 23/9/2019 39

Checkingifwell-formedPDFs/XMLswerereceived

APOYO 30, 23/9/2019 40

OCLC/RLG �Digital Repository Attributes

•  Administrative responsibility•  Organizational viability•  Financial sustainability•  Technological suitability•  System security•  Procedural accountability•  Certification

APOYO 30, 23/9/2019 41

Preservation Repositories:�may be too difficult for small institutions

•  May be too complex for small institutions to manage•  May be done through partnering (small museum or dance

company with University) or through consortia (museum association, state-wide organization, …) or through service bureaus (OCLC, Cloud)

•  Archive or museum will direct what is needed, but digital repository will carry out the actual work (as defined in SIP/DIP/AIP agreement)

•  And many will choose to just outsource the storage part of the repository because today there are many vendors offering this

APOYO 30, 23/9/2019 42

Page 8: 19dig-preserv-apoyobesser.tsoa.nyu.edu/howard/Talks/19dig-preserv-apoyo.pdf · How can we ever hope to deal with all the permutations and combinations APOYO30,23/9/2019 17 ... 3.Launch

9/19/19

8

Paradigms Shifts neededOld New

Physical preservation

atmospheric cntrl ongoing mgmt

What to save? artifact idea + ancillary material & documentation

Cataloging Individual work in hand

FRBR

Later access Artifact & documentation

Restaging, ancillary material & documentation

APOYO 30, 23/9/2019

Why Managing Digital Preservation is different than managing Analog

Preservation

APOYO 30, 23/9/2019 44

•  besser.tsoa.nyu.edu/howard/Talks•  http://www.brapci.inf.br/index.php/res/v/107431•  http://revista.arquivonacional.gov.br/index.php/

revistaacervo/article/view/26/26

Imagesofpresidentstogether

APOYO 30, 23/9/2019 45