major repositories using the arctos database:

18
Arctos A multi-institution, multi-collection museum database http:// arctos .database.museum http://code. google .com/p/ arctos /

Upload: cedric

Post on 01-Feb-2016

40 views

Category:

Documents


0 download

DESCRIPTION

Arctos A multi-institution, multi-collection museum database http://arctos.database.museum http://code.google.com/p/arctos/. Major repositories using the Arctos database: (34 collections of specimens or observations, 1.3M records). in partnership with. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Major repositories using the Arctos database:

ArctosA multi-institution, multi-

collection museum databasehttp://arctos.database.museum

http://code.google.com/p/arctos/

Page 2: Major repositories using the Arctos database:

Major repositories using the Arctos database:(34 collections of specimens or observations, 1.3M records)

Page 3: Major repositories using the Arctos database:

in partnership with

which is a member of

TeraGrid – A nationwide network of 11 supercomputing facilities

U. S. National Science Foundation’s

Office of Cyberinfrastructure

which is sponsored by

Page 4: Major repositories using the Arctos database:

Arctos: A 15 year history MVZ: 1995 - 1995 - Hired Stan Blum to develop relational data model (following

modeling by Assoc. Systematic Collections).

MVZ: 1997 - Hired John Wieczorek to implement model (desktop application) using Sybase and Versata. Partial implementation (e.g., no loans).

UAM: 1998-2000 - John W. migrated mammal data to Oracle, set up Versata.

UAM: 2002 - Dusty McDonald replaced Versata with ColdFusion, implemented full model (first web-based instance, aka Arctos).

MSB: 2003 – Joined Arctos at UAM (first multi-hosting instance).

MVZ and MCZ: 2005-2007 - Implemented separate instances of Arctos at Berkeley and Harvard (MVZ: first Postgres, then Oracle).

MVZ: 2009 - Moved hosting of data to Alaska (Virtual Private Database version).

Page 5: Major repositories using the Arctos database:

Arctos is an ongoing effort to integrate access to museum data, collection management tools, and external resources on the Web.

Specimen Catalog

label data (and more)

Projectscontribute and/or

use specimens

Accessions Loans,usage

Publicationscite specimens

GenBank

Federated portals

Arctos

BerkeleyMapper

“Media” in TeraGrid

The rest ofCyberspace

Citations

Page 6: Major repositories using the Arctos database:

Arctos Philosphy Maximize access to high quality information about Maximize access to high quality information about collections on the web.collections on the web.

Integrate different kinds of information (specimen data, Integrate different kinds of information (specimen data, DNA sequences, media, etc.) into easily and visually DNA sequences, media, etc.) into easily and visually retrievable records.retrievable records.

Facilitate management of data by curators and collection Facilitate management of data by curators and collection managers. managers. Collaboration (development, curatorial) is key Collaboration (development, curatorial) is key to success.to success.

Show value of specimens for scientific research to the Show value of specimens for scientific research to the public through web resources.public through web resources.

Page 7: Major repositories using the Arctos database:

Breadth of Data in Arctos Fish, amphibians, reptiles, mammals, birds and bird Fish, amphibians, reptiles, mammals, birds and bird eggs/nests, plants, arthropods, fossils, molluscseggs/nests, plants, arthropods, fossils, molluscs Specimens and observationsSpecimens and observations Media (images, audio)Media (images, audio) Publications, fieldnotesPublications, fieldnotes

Arctos constantly evolving to incorporate new kinds of Arctos constantly evolving to incorporate new kinds of data, e.g.,:data, e.g.,: Better representation of non-publication documents Better representation of non-publication documents (fieldnotes, correspondence)(fieldnotes, correspondence) Cultural collections (art, anthropology...)Cultural collections (art, anthropology...)

Nearly all that is known about an object (or observation) can be included in Arctos.

Page 8: Major repositories using the Arctos database:

Linking specimen records to archival documentation…

Page 9: Major repositories using the Arctos database:

Access and Security Virtual Private Database (Row-Level Security) allows Virtual Private Database (Row-Level Security) allows

collections to maintain control over data while sharing collections to maintain control over data while sharing certain nodes (taxonomy, agents, geography).certain nodes (taxonomy, agents, geography).

Access to data is controlled by user privileges (collections, Access to data is controlled by user privileges (collections, tables).tables).

Public can query without user account. More options if Public can query without user account. More options if logged in.logged in.

Business rules enforced at database (not application) Business rules enforced at database (not application) level.level.

Page 10: Major repositories using the Arctos database:

Arctos is…A versatile online collections management system

Cataloged Items (ID, attributes, parts, etc.; batch uploading, downloading, editing; encumbrances)

Localities & Collecting Events (mapping, media, history)

Transactions (loans, accessions, borrows, permits; email reminders)

Usage (publications, projects, sponsors, GenBank)

Curatorial (object tracking, parts, condition, relations, etc.)

Determination history (identification, georef, attributes)

Page 11: Major repositories using the Arctos database:

Arctos is…Modular and Pluggable

Projects and Publications

Media (local, external URI)

External Services (GenBank, TACC, MorphBank, CalPhotos, Ubio, etc.; ) – Collaborative

DiGIR (ORNIS, HerpNET, MaNIS, VertNET, etc.)

Page 12: Major repositories using the Arctos database:

Arctos is…Customizable

Specific features, e.g., Attributes (A cataloged item can A cataloged item can have any number of attributes, customized to collections. A have any number of attributes, customized to collections. A generic solution.)generic solution.)

Users (search, results, identifiers, download; save and email searches)

Managers (users, headers, CSS, reports)

Page 13: Major repositories using the Arctos database:

Arctos is…Open

Open source code

Built upon community needs

Compartmentalized (choose what you want to share)

Community input valued (user group, code issue list)

Responsive development model (bugs generally fixed within hours, requests within days to weeks unless major new feature) – community discussion drives development

Page 14: Major repositories using the Arctos database:

Arctos is…Economical

$15,300/year hosting (currently split by 3 institutions):• secure environment• dedicated web server and database server• server and database administrators• Oracle 11G• offsite backups in Alaska, TACC, San Diego

1 full-time developer

1 part-time DBA

Lots of input from users

Page 15: Major repositories using the Arctos database:

Arctos Multihosting Shared CodeShared Code Shared CostShared Cost Shared ExpertiseShared Expertise

CuratorialCuratorial TechnicalTechnical

Location not important

(electrons are fast!)

MCZ

?

Page 16: Major repositories using the Arctos database:

GoalRun Arctos as a single system from one or more mirrored Run Arctos as a single system from one or more mirrored locations, locations,

from which each participating institution can be from which each participating institution can be represented to its users and operators as an exclusive represented to its users and operators as an exclusive instance through row-level security, instance through row-level security,

but in which vocabulary, programming, and physical but in which vocabulary, programming, and physical resources are shared. resources are shared. 

(No technical issues to adding collections. Social issues (No technical issues to adding collections. Social issues in multihosting.) in multihosting.)

Page 17: Major repositories using the Arctos database:

Governance and Operating Support Steering Committee and Advisory Group with

representatives of each institution contributing to operational budget. Steering Committee (institutional Directors or designees):

budget, proposals for funding, proposals to connect to external resources, addition of collections, etc.

Advisory Group (curators): priorities for development - Chair of Advisory Group oversees activity of developer(s), reports to Steering Committee

Operating costs shared among institutions, funded by outside grants - identify developments that require outside funding vs. in-house support

Page 18: Major repositories using the Arctos database:

Summary Cost-effective access to a deeply relational, highly

expandable database capable of recording all aspects of specimen curation, running on Enterprise-caliber hardware and software, managed by professional IT staff.

Community of experienced users provide input into data structure and new development. Burden of development and operation shared among institutions.

All Arctos data are online. The public can query it, privileged users manage it through web browsers.

Demonstrates and documents the scientific context of specimens (projects, documents, media, etc.)…more than just label data.