classification systems
DESCRIPTION
TRANSCRIPT
![Page 1: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/1.jpg)
Classification Systems
Intro to Mapping & GIS
![Page 2: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/2.jpg)
Levels of Measurement• Nominal: “names” of items without
intercorrelation• Ordinal: implied order without inference to
the spaces between values• Interval: ranking considering values
between• Ratio: meaningful base and ratios between
values
![Page 3: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/3.jpg)
Measurement Scales• Nominal: Categorical measure [e.g., land
use map]• Lowest level [= only possible operation]
• Ordinal: Ranking Measure– Lowest quantitative level [=, >, <]– Strong ordinal: all objects placed in order, no
ties.– Weaker ordinal: all objects placed in order, but
ties exist.– Weakest ordinal: rating scales:
• Strongly agree, agree, disagree, strongly disagree.
![Page 4: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/4.jpg)
Interval Scale• Can measure the size of a difference, but not how
many times one observation is greater than another:– Temperature: Consider ratio of 80°F and 40°F
• 80°F / 40°F = 2.00
• 26.6°C / 4.44°C = 5.99– No true zero amount. At 0°F there is still some of the
thing called temperature. But an interval of 20° is two times as large as the interval of 10°.
– Operations: [=, >, <, +, -]
Temperature Scale Conversion fromDegrees F to C
![Page 5: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/5.jpg)
Ratio Scale• Can place numbers in ratio to each other:
– Edith makes three times as much money as Walter.
– Bill lives twice as far from here as Mary.• 20 miles / 10 miles = 2.0
• 32.3 km / 16.1 km = 2.0– There is a true zero. You can have zero money.
Ratio remains same through transformation of metric.
– Operations: [=, >, <, +, -, *, /, ^]
Distance Scale Conversion fromMiles to Kilometers
![Page 6: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/6.jpg)
Overview of Map Classing• What is classing? Why do we class?• Overriding principles.• Principles for deciding:
– Number of classes.– Method to use in classing.
• Methods of Classing:– Natural Breaks– Equal Interval– Quantile– Mean and Standard Deviation– Arithmetic Progression– Geometric Progression
![Page 7: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/7.jpg)
What Is Classing?• Classification process to reduce a large
number of individual quantitative values to:– A smaller number of ordered categories, each
of which encompasses a portion of the original data value range.
• Various methods divide the data value range in different ways.
• Varying the method can have a very large impact on the look of the map.
![Page 8: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/8.jpg)
Fundamental Principles• Each of the original (unclassed) data values must
fall into one of the classes. Each data value has a class home.
• None of the original data values falls into more than one class.
• These two rules are supreme: if any method results in classes that violate these rules, the resulting classes must be altered to conform to the two fundamental principles. Always.
![Page 9: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/9.jpg)
Shorthand Way of Saying This
• Classes must be:
–Mutually exclusive
&–Exhaustive
![Page 10: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/10.jpg)
Principles for Deciding on Number of Classes
• Rules of thumb:– Monochrome color schemes: no more than 5
to 7 classes.– Multi-hue map: no more than 9.
• it depends on other things such as ….
![Page 11: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/11.jpg)
• Communication goal– What are you trying to prove or disprove with
this map? Is it to communicate complex data or prove a simple point. Consider your audience.
• Complexity of Spatial Pattern– Complex data may require specific
classification methods.
• Available Symbol Types
Principles for Deciding on Number of Classes
![Page 12: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/12.jpg)
Principles for Deciding on Number of Classes
• Communication goal: quantitative precision: use larger number of class intervals. Each class will encompass a relatively small range of the original data values and will therefore represent those values more precisely.
• Trade offs:– Too much data to enable the information to
show through.– Indistinct symbols.
![Page 13: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/13.jpg)
![Page 14: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/14.jpg)
Principles for Deciding on Number of Classes
• Communication goal: immediate graphic impact: use smaller number of class intervals. Each class will be graphically clear, but will be imprecise quantitatively.
• Trade offs:– Potential for oversimplification– One class may include wildly varying data
values
![Page 15: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/15.jpg)
![Page 16: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/16.jpg)
Principles for Deciding on Number of Classes
• Complexity of Spatial Pattern– Highly ordered spatial distribution can have
more classes.
– Complex pattern of highly interspersed data values requires fewer classes.
![Page 17: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/17.jpg)
![Page 18: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/18.jpg)
![Page 19: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/19.jpg)
Classing Methods
![Page 20: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/20.jpg)
Natural BreaksAttempts to create class breaks such that
there is minimum variation in value within classes and maximum variation in value between classes.
Default classification method in ArcMap.
![Page 21: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/21.jpg)
Natural Breaks
![Page 22: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/22.jpg)
Natural Breaks
![Page 23: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/23.jpg)
• Advantages– Maximizes the similarity of values within each
class.– Increases the precision of the map given the
number of classes.• Disadvantages
– Class breaks often look arbitrary.– Need to explain the method.– Method will be difficult to grasp for those
lacking a background in statistical methods.
Natural Breaks
![Page 24: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/24.jpg)
Equal Interval• Each class encompasses an equal portion of
the original data range. Also called equal size or equal width.
• Calculation:– Determine range of original data values:
• Range = Maximum - Minimum– Decide on number of classes, N.– Calculate class width:
• CW = Range / N
![Page 25: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/25.jpg)
• Class Lower Limit Upper Limit1 Min Min + CW2 Min + CW Min +
2CW3 Min + 2CW Min +
3CW. . . . . .
N Min + (N-1) CW Max
Equal Interval
![Page 26: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/26.jpg)
• Example: Min = 0; Max = 100; N = 5• Range = 100 - 0 or 100• Class Width = 100 / 5 or 20• Class Lower Limit Upper Limit
1 0 to 202 20 to 403 40 to 604 60 to 805 80 to 100
Equal Interval
![Page 27: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/27.jpg)
• Example: Min = 0; Max = 100; N = 5• Range = 100 - 0 or 100• Class Width = 100 / 5 or 20• Class Lower Limit Upper Limit
1 0 to 202 21 to 403 41 to 604 61 to 805 81 to 100
Accepted Equal Interval
Must beMutually Exclusive
![Page 28: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/28.jpg)
Equal Interval
![Page 29: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/29.jpg)
Equal Interval
![Page 30: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/30.jpg)
• Advantages:– Easy to understand, intuitive appeal– Each class represents an equal range or
amount of the original data range.– Good for rectangular data distributions
• Disadvantages:– Not good for skewed data distributions—nearly
all values appear in one class.
Equal Interval
![Page 31: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/31.jpg)
Five Classes Equal Interval
![Page 32: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/32.jpg)
![Page 33: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/33.jpg)
Defined Interval• Each class is of a size defined by the map
author.• Intervals may need to be altered to fit the
range of the data.• Calculation:
– Set interval size.– Determine range of original data values:
• Range = Maximum – Minimum– Calculate number of classes:
• N = Range / CW
![Page 34: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/34.jpg)
• Example: Min = 40; Max = 165; CW = 25• Range = 165 - 40 = 125• N = 125 / 25 = 5 classes • Class Lower Limit Upper Limit
1 40 to 652 66 to 903 91 to 1154 116 to 1405 141 to 165
Defined Interval
Must beMutually Exclusive
![Page 35: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/35.jpg)
Defined Interval
![Page 36: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/36.jpg)
Defined Interval
![Page 37: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/37.jpg)
• Advantages:– Easy to understand, intuitive appeal– Each class represents an specified amount– Good for rectangular data distributions– Good for data with “assumed” breaks
• Decades for years, 1,000s for money, etc.
• Disadvantages:– Not good for skewed data distributions—many
classes will be empty and not mapped.
Defined Interval
![Page 38: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/38.jpg)
Quantile Classes• Places an equal number of cases in each class.• Sets class break points wherever they need to be in
order to accomplish this.• May not always be possible to get exact quantiles:
– Number of geounits may not be equally divisible by number of classes. [21 / 5].
– Putting same number of cases in each class might violate mutually exclusive classes rule.• 12 values in 4 classes:
0 0 0 |0 0 3| 4 4 5| 6 7 7 NO• 0 0 0 0 0 |3| 4 4 5| 6 7 7 YES
![Page 39: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/39.jpg)
Quantile
![Page 40: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/40.jpg)
Quantile
![Page 41: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/41.jpg)
Quantile Classes• Advantages:
– Each class has equal representation on the map.
– Intuitive appeal: map readers like to be able to identify the “top 20%” or the “bottom 20%”
• Disadvantages:– Very irregular break points unless data have
rectangular distribution.– Break points often seem arbitrary. Remedy this
with approximate quantiles.
![Page 42: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/42.jpg)
Mean & Standard Deviation• Places break points at the Mean and at
various Standard Deviation intervals above and below the mean.
• Mean: measure of central tendency.
• Standard Deviation: measure of variability.
![Page 43: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/43.jpg)
Mean & Standard Deviation • Class Lower Limit Upper Limit
1 Min Mean – 1.5*SD
2 Mean – 1.5*SD Mean – 0.5*SD
3 Mean – 0.5*SD Mean + 0.5*SD
4 Mean + 0.5*SD Mean + 1.5*SD
5 Mean + 1.5*SD Mean + 2.5*SD
6 Mean + 2.5*SD Max
![Page 44: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/44.jpg)
Mean & Standard Deviation
Mean
![Page 45: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/45.jpg)
Mean & Standard Deviation
Clearly showswhat’s “average”.
![Page 46: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/46.jpg)
Mean & Standard Deviation• Advantages:
– Statistically oriented people like it.– Allows easier comparison of maps of variables
measured in different metrics. Income and education levels.
• Disadvantages:– Many map readers are not familiar with the
concept of the standard deviation.– Not good for skewed data.
![Page 47: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/47.jpg)
Geometric Interval• The width of each succeeding class interval
is larger than the previous interval by a constant amount.
• Calculating the constant amount, CW:– Decide on number of classes, N.– Calculate the range: R = Max - Min– Solve: R = CW + 2CW + . . . + NCW
for CW
![Page 48: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/48.jpg)
Geometric: Class Widths
Class Width1 CW2 2CW3 3CW4 4CW5 5CW
![Page 49: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/49.jpg)
Geometric Progression Classes• The width of each succeeding class interval
is larger than the previous interval by a exponentially varying amount.
• Calculating the BASE amount, CW:– Decide on number of classes, N.– Calculate the range: R = Max - Min– Solve:
![Page 50: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/50.jpg)
Geometric: Class Example
• Max = 160; Min = 10; R = 150; CW = 10• Class Lower Limit + CLASS* CW = Upper Limit
1 10 + 10 = 202 20 + 20 = 403 40 + 30 = 704 70 + 40 = 110 5 110 + 50 = 160
![Page 51: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/51.jpg)
Geometric Interval
![Page 52: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/52.jpg)
Geometric Interval
![Page 53: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/53.jpg)
Geometric Interval• Advantages:
– Uneven, but regular class breaks.– Tends to even out class frequencies for skewed
distributions while making class widths relatively small in areas where there is high frequency.
• Disadvantages:– Uncommon.– Unequal width classes.
![Page 54: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/54.jpg)
Making a sensible map• Remember, class breaks should make sense
to both you and the audience.• Most of the time, you should change what
ArcMap produces by default.• Median Year Built
![Page 55: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/55.jpg)
Years shouldn’t be demical!
![Page 56: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/56.jpg)
Number formatting• ArcGIS assumes by default that numbers
not explicitly set as integers are floating point (decimal) numbers.
• ArcGIS also assumes all floating point numbers should show 6 places after the decimal point.
![Page 57: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/57.jpg)
Layer Properties• Under the Fields tab in
the Layer Properties, you can set aliases for the field names, as well as define the format of the field.
• Set numbers to have no zeroes after the decimal point.
![Page 58: Classification Systems](https://reader036.vdocument.in/reader036/viewer/2022081602/54809e61b4af9ff2518b488c/html5/thumbnails/58.jpg)