producer questions 6 december 2005. 2005-12-06 producer questions 2 purpose the sip standard...
TRANSCRIPT
Producer Questions
6 December 2005
2005-12-06 Producer Questions 2
Purpose
The SIP standard envisions the development of a formal model of the data for use by the Producer and the Archive for each Archive Project.
In general, the Archive will need to guide the Producer to a framework under which the Producer can make clear what data are to be provided
Archive must understand dataArchive must be able to ensure that what was agreed to be provided is provided
The primary purpose of this paper is to attempt to bridge the gap between what the Producer has and what the formal model will require.
2005-12-06 Producer Questions 3
Assumptions
The Producer and the Archive need to use a common terminology and have a common understanding of the associated concepts.
OAIS terminology is used
Data Producer and the Archive have agreed to establish an Archive Project
i.e. a certain set of data, possibly not yet fully available, will be transferred to the Archive.
The primary data to be provided reside in files under the control of the Producer. The files are organized in some way by the producer. There is some amount of documentation, available somewhere, that describes these data files.
documentation should be adequate to allow effective use by Archive customers (i.e., Consumers).
2005-12-06 Producer Questions 4
Model of File DataData File = digital file containing scientific or technical data
A Data File has a file type.
A file type has a Representation Descriptioni.e., a file type is characterized by a definition of it internal structure and by the meanings associated with those structures.
• i.e., the format of the file and the meanings assigned to the format elements.
Primary Data File = data file that contains the primary information to be submitted to an Archive for preservation
Examples• a series of observations of magnetic field values taken by a
spacecraft• files containing images of stellar space taken by a telescope• files containing reading of blood pressure taken during an
astronaut centrifuge experiment.
2005-12-06 Producer Questions 5
Content Information Unit
2005-12-06 Producer Questions 6
Basic Information Collection
2005-12-06 Producer Questions 7
Basic Info Collection with PDI
2005-12-06 Producer Questions 8
Information Collection Type Hierarchy
2005-12-06 Producer Questions 9
Question Categories
Identify TypesIdentify Type RelationshipsIdentify the Units and CollectionsIdentify Processing History and ContextIdentify the Locations of the Processing History and Context
2005-12-06 Producer Questions 10
Questions
Identify Types
For this Archive Project, how many Content Information Unit Types are to be provided?
For each, provide a unique reference name for the type.
How many Basic Information Collection types are to be provided?
For each, provide a unique reference name for the type
How many other Collection types are to be provided?For each, provide a unique reference name for the type
2005-12-06 Producer Questions 11
Questions
Identify Type Relationships
For each CIUT, identify the BIC types and non-Basic Collection types to which it belongs.
This may involve rules for name formation.
For each non-Basic Collection type, identify the BIC types and non-BIC types that belong to it.
This may involve rules for name formation
At this point, a complete hierarchy of Collections and Units, in terms of types, is defined.
However the number of different Units and Collections is not.
2005-12-06 Producer Questions 12
Questions
Identify the Units and Collections
For each Unit type and Collection type, identify the location(s) of the Representation Description.
For each Collection, assign a unique Collection name or provide the naming rules.
At this point one has an identification of all the SIP POT Descriptors. Subsequent questions are asking for information to further populate the Descriptors.
For each Unit, assign a unique Unit name or provide the naming rules.
2005-12-06 Producer Questions 13
Questions
Identify the Processing History and Context
For each Unit and Collection, identify the Processing History and Context Information.
For each Unit that is composed of multiple files, identify the files by name or provide the naming rules.
2005-12-06 Producer Questions 14
Questions
Identify the Locations of the Processing History and Context
For each Unit and Collection, identify the location of the Processing History and Context Information.
For each Unit that is composed of multiple files, identify the location(s) of the Processing History and Context
Information.