entities & attributes overview by the end of this section participants will be able to discuss...
TRANSCRIPT
Entities & Attributes Overview
By the end of this section participants will be able to discuss the main components of Section 5, and be able to incorporate section 5 into the practice record at the end of the Workshop.
Metadata describes • Metadata is information about data• Metadata describes the characteristics of
the data– How it was produced– Data Sources– The currentness of the data– The spatial parameters of the data set– Accuracy of the lines– Plus much more
Section 5 is the component of metadata which describes the data table.
Environmental Sensitivity Index Data
RARNUM - unique combination of species, concentration, and seasonality
CONC (concentration) = Density species is found at location
Season_ID = seasonality code like to the seasonal table
Element - Biology group
Metadata
Entity and Attribute Section
Detailed_Description: Entity_Type:
Entity_Type_Label: Population Entity_Type_Definition: U.S. Population in 1,000 aggregated by census geography and year. Entity_Type_Definition_Source: NOS, Special Projects
Attribute: Attribute_Label: c_popul Attribute_Definition: Total Population (Units: persons) Attribute_Definition_Source: NOS, Special Projects Attribute_Domain_Values:
Unrepresentable_Domain: Variable by location and year
Attribute: Attribute_Label: c_urbn Attribute_Definition: Urban Population (Units: persons) Attribute_Definition_Source: NOS, Special Projects Attribute_Domain_Values:
Unrepresentable_Domain: Variable by location and year
Overview Description
Dataset covers the following general areas, population, age, structures, income distributions, employment, educational attainment, and residential mobility aggregated by geographic area.
http://stics.noaa.gov
Detailed DescriptionDetailed_Description:
Entity_Type: Entity_Type_Label: Population Entity_Type_Definition: U.S. Population in 1,000s aggregated by census geography and year. Entity_Type_Definition_Source: NOS, Special Projects
Attribute: Attribute_Label: c_popul Attribute_Definition: Total Population (Units: persons) Attribute_Definition_Source: NOS, Special Projects Attribute_Domain_Values:
Unrepresentable_Domain: Variable by location and year
Attribute: Attribute_Label: c_urbn Attribute_Definition: Urban Population (Units: persons) Attribute_Definition_Source: NOS, Special Projects Attribute_Domain_Values:
Unrepresentable_Domain: Variable by location and year
Definitions• Entity - the definition and description of
a set into which similar entity instances are classified.
• Attribute - a defined characteristic of an entity.
Deciding the Entities and Attributes
Entity Road TableAttribute Road Location Road Type Length
Road FIPS Code Road Type LengthAnderson Trail 11 1 1McTale Path 12 1 0.5Tally Drive 11 3 2Sundale Route 13 2 3
Breakdown
Deciding the Entities and Attributes
Road FIPS Code Road Type LengthAnderson Trail 11 1 1McTale Path 12 1 0.5Tally Drive 11 3 2Sundale Route 13 2 3
Entity Road Location Road Type Length Attribute of Road Type 1 2 3
Breakdown
Entity Type
Entity Type Label: c_popul
Entity Type Definition: Total Population (Units:
persons)
Entity Type Definition Source: NOAA
Required Fields
Attribute
Attribute Label: c_popul
Attribute Definition: Total Population (Units:
persons)
Attribute Definition Source: NOAA
Attribute Domain Values:
Range Domain:
Range Domain Minimum: 1000
Range Domain Maximum: 300000
Required Fields
Enumerated Domain – A fixed list of values
Road FIPS Code Road Type LengthAnderson Trail 11 1 1McTale Path 12 1 0.5Tally Drive 11 3 2Sundale Route 13 2 3
Attribute Domain Value: Enumerated Domain: Enumerated Domain Value: 1 Enumerated Domain Value Definition: Bike Trail Enumerated Domain Value Definition Source: Test Company
Codeset Domain – Standardized List of Values
Attribute Domain Value: Codeset Name: Federal Information Processing Standards Codes Codeset Source: National Institute of Standards and Technology
Road FIPS Code Road Type LengthAnderson Trail 11 1 1McTale Path 12 1 0.5Tally Drive 11 3 2Sundale Route 13 2 3
Range Domain – A series of measurements where you would provide the minimum and maximum values
Road FIPS Code Road Type LengthAnderson Trail 11 1 1McTale Path 12 1 0.5Tally Drive 11 3 2Sundale Route 13 2 3
Attribute Domain Value: Range Domain: Range Domain Minimum: 0.5 Range Domain Maximum: 3
Unrepresentable Domain – Unique values such as names which cannot be easily summarized
Attribute Domain Value: Unrepresentable Name: Unique Values, Road Names
Road FIPS Code Road Type LengthAnderson Trail 11 1 1McTale Path 12 1 0.5Tally Drive 11 3 2Sundale Route 13 2 3
Additional Things to Think About - Definitions
Entity Type Entity Type Label: YEAR Entity Type Definition: year Entity Type Definition Source: NOAA
Entity Type Entity Type Label: YEAR Entity Type Definition: Fiscal year Entity Type Definition Source: NOAA
Additional Things to Think About - Units
Attribute Attribute Label: Depth Attribute Definition: Water Depth Attribute Definition Source: NOAA Attribute Domain Values: Range Domain:
Range Domain Minimum: 0.5 Range Domain Maximum: 3
Additional Things to Think About - Completeness
Attribute Attribute Label: 2013 Attribute Definition: S Attribute Definition Source: NOAA Attribute Domain Values: Unrepresentable Domain: Fixed Value
Additional Things to Think About - Labels
Attribute Attribute Label: Anthrax Attribute Definition: Anthracene Attribute Definition Source: NOAA Attribute Domain Values: Unrepresentable Domain: Fixed Value
Finally...
Remember, dataset definitions are an
important part of ensuring long-term
data integrity.
Provide complete definitions that are
understood by others outside of the
project and when appropriate the units
of measure.
• Have someone else read it.
• If you’re the only reviewer, put it away and read it again later.
• Check for clarity and omissions.
Review your final product.
Writing Metadata
• Does the documentation present all the information needed to use or reuse the data?
• Are any pieces missing?
When you review your work, ask:
Writing Metadata
5. Entity_and_Attribute_Information: (p. 75)
Database design
• entities, attributes, domains, description of data values