chapter 5 knowledge representation cios / pedrycz / swiniarski / kurgan

45
Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan Cios / Pedrycz / Swiniarski / Kurgan

Upload: derek-williams

Post on 17-Jan-2016

223 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

Chapter 5

KNOWLEDGE REPRESENTATION

Cios / Pedrycz / Swiniarski / KurganCios / Pedrycz / Swiniarski / Kurgan

Page 2: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Outline

• Introduction

• Main categories of data representation schemes

• Granularity of data and its taxonomy • Design aspects

Page 3: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Knowledge Representation

• Sources of knowledge are highly diverse and need to be represented in different ways

• Representation schemes are essential for processing of data and revealing relationships

• Granularity of information is a vehicle of abstraction, which is essential in description of relationships formed through data mining

Page 4: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Types of Data: Continuous Quantitative Data

• Continuous variables, such as pressure, temperature, and height

• They often have some relationship to the physical phenomena that generated them

• Linear order is quite common

Page 5: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Types of Data: Continuous Quantitative Data

Linear order of real numbers

Page 6: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Types of Data: Qualitative Data

Typically assume some limited number of values. They could be either organized in a linear fashion (ordinal qualitative data) or no order could be established (nominal qualitative data).

Page 7: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Nominal Qualitative Data

Page 8: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Ordinal Qualitative Data

Page 9: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Structured Data

They form some structure that leads to a hierarchy of concepts

More specialized concepts occur at lower level of hierarchy

Tree structure is commonly used for their representation

car

SUV luxury truck

bus

vehicle

van

Page 10: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Categories of Models of Knowledge Representation

• Rules

• Graphs and directed graphs

• Trees

• Networks

Page 11: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Rules and Their Taxonomy

Generic format

IF condition THEN conclusion (action)

In general, we encounter a finite collection of rules

IF condition is Ai THEN conclusion is Bi

Often the rules are multivariable, namely, they consist a number of variables (conditions)

- IF condition1 and condition2 and …. and conditionn THEN conclusion

Page 12: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Gradual Rules

“The higher the values of condition, the higher the values of conclusion” or“the lower the values of condition, the higher the values of conclusion”

They capture notion of graduality between the concepts occurring in the conditions and conclusions

IF (Ai) THEN (Bi)

Page 13: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Quantified Rules

Quantification of confidence of the rules

the likelihood that high fluctuations in real estate prices lead to a significant migration of population within the province

is quite moderate.

Confidence of rule

Page 14: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Analogical Rules

Focus on analogy (similarity, closeness, resemblance…)between conditions and conclusions

IF similarity (Ai, Aj) THEN similarity (Bi, Bj)

express analogy

Page 15: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Rules with Regression Local Models

More advanced and functionally augmented conclusion part of the rule. It involves some regression model

IF condition is Ai THEN y = fi(x, ai)

local regression model

Page 16: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Graphs and Directed Graphs

Concepts and links express relationships between the concepts.Two main categories of graphs: (a) undirected graphs (b) directed graphs

A

B

C D

A

B

C D E

0.7

1.0

0.4

0.6

1.0

Page 17: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Hierarchy of Graphs

Refinement of relationships starting from the most general relationships (with general nodes) and expansion of the nodes

`

Page 18: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Trees

An important category of graphs with (a) Single root(b) No loops(c) Terminal nodes

root

Terminal nodes

Page 19: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Decision Trees

A

a

b c

w z

B C

k l

IF A is c and B is w THEN IF A is c and B is z THEN IF A is a and C is k THEN IF A is a and C is l THEN

Decision tree rules

Page 20: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Networks

Generalized graphs with nodes endowed with some processing capabilitiesFor instance: (a) Each node computes some logic formula using input variables and logic operators (conjunction, disjunction, complement)(b) Node could be a local neural network

z1 = (x1, x2, x3)

z2 = (x1, x2, x3)

y = (z1, z2, x4)

Page 21: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Information Granulesand Information Granularity

Information granules support human-centric computing

By human-centricity we mean characteristics of computing systems that facilitate interaction with humans either by improving the quality of communication of findings or by accepting inputs from users in a flexible and friendly manner, say, in a

linguistic form.

Information granules permeate human endeavors. Any given task can be cast into a certain conceptual framework of relevant generic entities; this is the framework in which we:

(a) formulate generic concepts at some level of abstraction (b) carry out processing (c) communicate the results to the user/external environment

Page 22: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Information Granules:Examples

Image processing

Humans do not focus on individual pixels but group them together into semantically meaningful constructs such as: regions that consist of groups of pixels drawn together owing to their proximity in the image, similar texture, color, level of brightness, etc.

Signal processing

We describe signals in a semi-qualitative manner by identifying specific regions of the signals in time or frequency domain. E.g., specialists easily interpret ECG signals by identifying some segments and their combinations.

Page 23: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Granular Computing: an Emerging Paradigm

Identifies essential commonalities between diverse problems and technologies, cast into a unified framework we refer to as a granular world.

With granular processing we better understand the role of interaction between various formalisms and might visualize a way in which they communicate.

It brings together the formalisms of sets, fuzzy sets and rough sets, by visualizing that in spite of their distinct underpinnings, the granular computing establishes an environment for building synergy between different individual approaches.

Page 24: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Granular Computing:Key Formal Frameworks

Set theory (interval analysis)

Fuzzy Sets

Rough Sets

Shadowed Sets

Page 25: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Sets

• Notion of Membership

belongs to excluded from

Page 26: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Characteristic Functions

Concept of dichotomy

1A(x) Ax

0A(x) Ax

Page 27: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Description of Sets

• Membership– enumerate elements belonging to the set

• Characteristic function

1

0

A(x)=0

A(x)=1

}1,0{:A X

Page 28: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Basic Operations on Sets

A B = B A Commutativity A B = B A

A (B C) = (A B) C Associativity

A (B C) = (A B) C A (B C) = (A B) (A C)

Distributivity A (B C) = (A B) (A C) A A = A

Idempotency A A = A

A = A and A X = X Boundary Conditions

A = and A X = A

Involution AA

Transitivity (in terms of inclusion) if A B and B C then A C

Page 29: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Challenge: Three-valued Logic

Lukasiewicz (~1920) true (0) false (1) don’t know (1/2)

Three valued logic

Page 30: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Fuzzy set - Definition

Fuzzy set A is described by its membership function A(x)

A(x) =1: complete membership

A(x) =0: complete exclusion

]1,0[:A X

Page 31: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Fuzzy Sets:Membership Functions

• Partial membership of element to the set – membership degree A(x)

• The higher the value of A(x), the more typical the element “x” (as a representative of A)

Page 32: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Membership Functions:Examples

membership

1

0

km/h 60 90 120 150

Page 33: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Principle of the Least Commitment

Page 34: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Characteristics of Fuzzy Sets

Notion and definition Description

-cut A = {x | A(x) } Set induced by some threshold consisting of elements belonging to A to an extent not lower than. By choosing a certain threshold, we convert A into the corresponding set representative. -cuts provide important links between fuzzy sets and sets

Height of A, hgt(A) = supxA(x) Supremum of the membership grades; A is normal if hgt(A) =1. Core of A is formed by all elements of the universe for which A(x) attains 1.

Support of A, supp(A) ={x| A(x) >0} Set induced by all elements of A belonging to it with nonzero membership grades

Cardinality of A, card(A) = XA(x)dx

(assumed that the integral does exist) Counts the number of elements

belonging to A; characterizes the granularity of A. Higher card(A) implies higher granularity(specificity) or, equivalently, lower generality

Page 35: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Shadowed Sets

Granular constructs in which we allow for regions of space of complete ignorance

A~ : X { 0, 1, [0,1]}

Exclusion Full membership Ignorance

A~

[0,1] [0,1]

Page 36: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Shadowed Sets:Logic Operations

]1,0[1 0

[0,1]1[0,1]

111

[0,1]10

]1,0[

1

0

Union Intersection

[0,1] 1 0

[0,1][0,1]0

[0,1]10

000

]1,0[

1

0

]1,0[

0

1

]1,0[

1

0complement

Page 37: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Shadowed Sets:Estimation Method

Shadowed sets are generated (induced) on a basis of fuzzy sets:Re-allocation of membership grades (a) low membership grades reduced to zero (b) high membership grades elevated to 1 (c) membership grades resulting from the reduction and elevation are used in the formation of shadows

1

1-

a b

1

2

3 A(x)

x

a1 a2

1 + 2 = 3

2

1

1

2

a

a

a

a

b

adxA(x))dx(1A(x)dx

Linear membership function:

4142.02

22β

2/3

Page 38: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Rough Sets

Defining concept X with the use of a finite vocabulary of information granules:

• Lower bound (approximation)• Upper bound (approximation)

Page 39: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Rough Sets :Upper and Lower Approximations

temperature

pres

sure

X

temperature

pres

sure

lower approximation

upper approximationAi

Page 40: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Rough Sets :Upper and Lower Approximations

}XA|{AX

ionapproximatupper

X}A|{AX

ionapproximatower

ii*

ii*

l

Page 41: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Rough Sets: The Principle of Data Description

Given some information granules they are used to characterize other granular view at data

C1

Data

C2

information granules (sets) formed by C1

Rough set representation

Page 42: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Characterization of Knowledge Representation Schemes

Expressive power of the scheme Computational complexity and associated tradeoffs :

- Flexibility of knowledge representation – familiarity of the users with a specific scheme of knowledge representation

- Effectiveness of forming models on the basis of domain knowledge and experimental data

- Character of information granulation and the level of specificity of information granules

Page 43: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Information Granularity Information granules are defined at different levels of specificity (abstraction) which is reflective of their “size”

We define measures that characterize the size of information granules

XA(x)dx

Information granules Granularity

Fuzzy sets In the case of a finite space X, the integral is replaced by a sum of the membership grades

Rough sets Cardinality of the lower and upper bound,card (A+), card (A-);

the difference between these two describes roughness of A

Page 44: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

Information Granularity in Rule-based Systems

granularity of condition

gran

ular

ity

of c

oncl

usio

n increased usefulness of rule

acceptable rules

unacceptable rules

Page 45: Chapter 5 KNOWLEDGE REPRESENTATION Cios / Pedrycz / Swiniarski / Kurgan

© 2007 Cios / Pedrycz / Swiniarski /

Kurgan

References Bargiela A and Pedrycz W. 2003. Granular Computing: An Introduction,

Kluwer Academic Publishers

Giarratano J and Riley G. 1994. Expert Systems: Principles and Programming, 2nd Ed., PWS Publishing

Moore R. 1966. Interval Analysis, Prentice Hall

Pal SK and Skowron A (eds.) 1999. Rough Fuzzy Hybridization. A New trend in Decision-Making, Springer Verlag

Pawlak Z. 1991. Rough Sets. Theoretical Aspects of Reasoning About Data, Kluwer Academic Publishers

Pedrycz W and Gomide F. 1998. An Introduction to Fuzzy Sets; Analysis and Design.

MIT Press, 1998.

Pedrycz W (ed.). 2001. Granular Computing: An Emerging Paradigm, Physica Verlag

Zadeh LA. 1965. Fuzzy sets, Information & Control, 8, 1965, 338-353.