scape information day at bl - some of the scape outputs available

12
Other SCAPE outputs available William Palmer, Peter May & Carl Wilson SCAPE Information Day British Library, UK, 14 th July 2014

Upload: scape-project

Post on 05-Dec-2014

76 views

Category:

Technology


1 download

DESCRIPTION

The British Library hosted a ‘SCAPE Information Day at the British Library’, on 14 July 2014. The information day introduced the EU-funded project SCAPE (Scalable Preservation Environments) and its tools and services to the participants. Some tools were presented and demonstrated in more detail (see the other presentations) and the day was closed with a presentation by Will Palmer, Carl Wilson and Peter May of some of the other outputs that SCAPE has delivered.

TRANSCRIPT

Page 1: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

Other SCAPE outputs available

William Palmer, Peter May & Carl Wilson

SCAPE Information Day

British Library, UK, 14th July 2014

Page 2: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• Digital Repository

• RODA: Repository of Authentic Digital Objects

• Delivers functionality of the main units of the OAIS model

• Uses Fedora Commons (2/3) /EAD/PREMIS/…

• Fedora 4 (in development, beta status)

• Improved scalability

• Flexible storage options

• Better linked-data capabilities

• RODA & Fedora 4 implement standard SCAPE APIs for other tools/services to interact with a repository

2

RODA – Demo / Fedora 4

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 3: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• Preservation Planning

• Walks you through making preservation plans

• Use components[1] from Component Catalogue to test/design preservation workflows

• For example, automatically design an image migration workflow

[1] http://openplanets.github.io/scape-component-profiles/

3

Plato - Demo

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Plan: Image migration to TIFF

Page 4: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• “Preservation Watch” service

• Ideally use a central instance, aggregating preservation watch information from several repositories

• Detect risks and opportunities

• Uses plugins for easy integration of new information sources

• Users can create triggers for when risk and opportunities occur

• Notifies users when events occur such as format obsolescence, or new preservation tools are available

• Compare repository information against others’

4

SCOUT - Demo

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 5: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• An open-source Workflow Management System

• Graphically design and execute workflows*

• *command line and server execution environments available

• Integrates with the Component Catalogue at MyExperiment.org

• …

5

Taverna

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 6: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• Uses SCAPE tool specification files “toolspecs”

• Standardise tool invocation to standard type:

• QA / Characterisation/ Migration / etc

• Create Debian packages for standardised invocation

• Create Taverna Components for upload to the Component Catalogue (on MyExperiment)

• Component Profiles: http://www.myexperiment.org/tags/3693.html

• Components: http://www.myexperiment.org/tags/3214.html

6

ToolWrapper / Component Catalogue

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 7: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• “To MapReduce”: wrap command line applications for parallel execution in a Hadoop MapReduce job

• (Re)uses SCAPE tool specification “toolspecs” (as used by the ToolWrapper)

• Can parallelise standalone tools that cannot otherwise be integrated within Hadoop/MapReduce

• Can be used to run a step in a Taverna workflow (see demo)

7

ToMaR - Demo

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 8: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• Hadoop based web archive record processing

• ARC to WARC migration

• Identify files with Droid

• Identify files with Tika

• Different approach to identification/characterisation from Nanite

8

Hawarp

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 9: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• Sits on top of the UK Web Archive

• Some [older] content may not be viewable with contemporary software/hardware

• Therefore, provide:

• Emulation-on-access/demand

• Migration-on-access/demand

• Rate the access: +1/-1 buttons

• Potentially use ratings to inform future preservation actions

9

Interject - Demo

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 10: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• Compares web pages using two approaches:

• Visual

• Structural (HTML etc)

• Similarity scores are produced

• Can be used for QA, comparing archive copy with live site

10

Pagelyzer - Demo

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 11: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

• Suite of tools for:

• Detecting overlap in two audio files

• Finding occurrences of a smaller WAV in a bigger one (and pre-index files for analysis)

• Find the similarity between two audio files

11

xCorrSound - Demo

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

Page 12: SCAPE Information Day at BL - Some of the SCAPE Outputs Available

12

Sustainability of Tools and Services

This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

SCAPE tools are published as open source software.

Tools and services from SCAPE are sustained by

• Open Planets Foundation - address core digital preservation challenges and engage with the community

• COPTR - Community Owned digital Preservation Tool Registry