lecture 30 - auckland

51
Data Mining and Machine Learning CompSci 760 Learning Hierarchical Skills in a Cognitive Architecture Lecture 30

Upload: others

Post on 07-Jan-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Lecture 30 - Auckland

Data Mining and Machine LearningCompSci 760

Learning Hierarchical Skillsin a Cognitive Architecture

Lecture 30

Page 2: Lecture 30 - Auckland

Synthetic Agents for Urban Driving

Insert demo here

Page 3: Lecture 30 - Auckland

What is a Cognitive Architecture?

A cognitive architecture (Newell, 1990) is an infrastructurefor intelligent systems that:

• Specifies the facets of cognition that hold constant acrossdifferent domains;

• Including memories and the representations of elements inthose memories;

• But not the content of those memories, which can changeacross domains and over time.

A cognitive architecture is analogous to a building architecture,which describes its fixed structure but not its contents.

Page 4: Lecture 30 - Auckland

What is a Cognitive Architecture?

A cognitive architecture provides more than an integrationframework for intelligent systems, in that it:

• makes strong theoretical assumptions about the representationsand mechanisms underlying cognition;

• incorporates many ideas from psychology about the natureof human cognition;

• contains distinct modules, but these access and alter the samememories and representations;

• comes with a programming language with a high-level syntaxthat reflects the theoretical assumptions.

A cognitive architecture is all about mutual constraints, as it aimsto provide a unified theory of the mind.

Page 5: Lecture 30 - Auckland

The ICARUS Architecture

ICARUS (Langley & Choi, 2006) is a computational theory of thehuman cognitive architecture that posits:

These assumptions are not novel; it shares them with architectureslike Soar (Laird et al., 1987) and ACT-R (Anderson, 1993).

1. Short-term memories are distinct from long-term stores2. Memories contain modular elements cast as symbolic structures3. Long-term structures are accessed through pattern matching4. Cognitive processing occurs in retrieval/selection/action cycles5. Cognition involves dynamic composition of mental structures

Page 6: Lecture 30 - Auckland

Distinctive Features of ICARUS

However, ICARUS also makes assumptions that differentiate itfrom other architectures:

Some of these tenets also appear elsewhere, but only ICARUScombines them into a unified cognitive theory.

1. Cognition is grounded in perception and action2. Categories and skills are separate cognitive entities3. Short-term elements are instances of long-term structures4. Long-term knowledge is organized in a hierarchical manner5. Inference and execution are more basic than problem solving

Page 7: Lecture 30 - Auckland

ICARUS’ Memories and Processes

ConceptualConceptualMemoryMemory

BeliefBeliefMemoryMemory

ProblemProblem Memory Memory

ConceptualConceptualInferenceInference

ActionAction

PerceptionPerception

EnvironmentEnvironment

PerceptualPerceptualBufferBuffer

Problem SolvingProblem SolvingSkill LearningSkill Learning

MotorMotorBufferBuffer

Skill ExecutionSkill Execution

Skill MemorySkill Memory

Page 8: Lecture 30 - Auckland

Cascaded Integration in ICARUS

ICARUS adopts a cascaded approach to integration in whichlower-level modules produce results for higher-level ones.

conceptual inference

skill execution

problem solving

learning

Like other unified cognitive architectures, ICARUS incorporates anumber of distinct modules.

Page 9: Lecture 30 - Auckland

Research Goals for ICARUS

Our current objectives in developing ICARUS are to produce:

• a computational theory of high-level cognition in humans

• that is qualitatively consistent with results from psychology

• that exhibits as many distinct cognitive functions as possible

• that supports creation of intelligent agents in many settings

Modeling quantitative experimental results has its place but candelay achieving broad coverage (Cassimatis et al., 2009).

Page 10: Lecture 30 - Auckland

Intellectual Precursors

Like many other architectures, ICARUS builds on four key ideasthat are basic to AI and cognitive science:

• Physical symbol systems

• List structures and list processing

• Symbolic patterns and pattern matching

• Relational rules and dynamic composition

These insights date back to the field’s earliest days, but theyremain central to accounts of high-level cognition.

Page 11: Lecture 30 - Auckland

List Structures and List Processing

In 1956, Newell, Shaw, and Simon introduced list processing.This new framework became important to cognitive sciencebecause it could:

• Encode arbitrarily complex structural descriptions using• Symbols such as on, A, and B• Lists such as (on A B) and (eats agent John object soup)• List structures such as (goal me (not (on ?any B)))

• Create new structural descriptions dynamically• Use such structures to designate other structures• Interpret these structures to produce behavior

These abilities play crucial roles in developing computationalmodels of high-level processing.

Page 12: Lecture 30 - Auckland

Symbolic Patterns and Pattern Matching

Most human knowledge is generic in that it applies to distinctinstances of the same class of situations.We can state such knowledge as symbolic patterns that:

• Describe the structures held in common by these instances• Designate the class of situations with these structures• Omit structures that instances do not hold in common• Use variables to indicate subelements that may vary

Many symbolic systems interpret patterns by matching themagainst other symbol structures that describe a situation.

This opens the door to ‘declarative’ knowledge representationsthat support very flexible processing.

Page 13: Lecture 30 - Auckland

Symbolic Patterns as List Structures

We can encode symbolic patterns as lists or list structures thatcontain pattern-match variables. For instance,

(taller-than ?person1 ?person2) and (on ?block1 ?block2)

include variables ?person1, ?person2, ?block1, and ?block2.

We can encode complex patterns as lists of simple patterns thatmay share variables. For example,

((taller-than ?person1 ?person2) (taller-than ?person2 ?person3))

((on ?b1 ?b2) (on ?b2 ?b3) (wider ?b1 ?b2) (wider ?b2 ?b3))

are two complex patterns that have two and four subpatterns,respectively.

Page 14: Lecture 30 - Auckland

Relational Rules and Dynamic Composition

Symbolic patterns are often used to support rule-based reasoning.

For example, given the relational rule

If ((on ?x ?y) (on ?y z)) then (above ?x ?z)

and the working memory ((on B C) (on C D)) , one can infer theliteral (above B D) by substituting bindings from the condition.

Moreover, relational rules can be composed dynamically toproduce chains of reasoning. E.g., given the rule above and

If ((above ?x ?y) (on ?y ?z)) then (above ?x ?z))

and the working memory ((on A B) (on B C) (on C D)) , one caninfer the literals (above A C), (above B D), and (above A D).

This generative capacity gives rule-based systems great power.

Page 15: Lecture 30 - Auckland

Cascaded Integration in ICARUS

ICARUS adopts a cascaded approach to integration in whichlower-level modules produce results for higher-level ones.

conceptual inference

skill execution

problem solving

learning

Like other unified cognitive architectures, ICARUS incorporates anumber of distinct modules.

Page 16: Lecture 30 - Auckland

• Concepts are symbolic predicates that the agent can use forthis cognitive purpose.

• Conceptual rules / clauses provide definitions for thesesymbols that the agent uses for recognition and inference.

• Beliefs are instances of concepts that, taken together,describe the agent’s situation.

Concepts and Beliefs in ICARUS

ICARUS relies on three types of symbol structures to supportits interpretation of the environment.

Together, these three types of structures support conceptualinference, which lets ICARUS characterize its situation.

Page 17: Lecture 30 - Auckland

ICARUS Concepts for Urban Driving

((driving-well-in-rightmost-lane ?self ?line1 ?line2) :percepts ((self ?self) (segment ?seg) (line ?line1 segment ?seg))

(line ?line2 segment ?seg)) :relations ((driving-well-in-segment ?self ?seg ?line1 ?line2)

(not (lane-to-right ?line1 ?line2 ?anyline))))

((driving-well-in-segment ?self ?seg ?line1 ?line2) :percepts ((self ?self) (segment ?seg) (line ?line1 segment ?seg))

(line ?line2 segment ?seg)) :relations ((in-segment ?self ?seg)

(aligned-and-centered-in-lane ?self ?line1 ?line2) (steering-wheel-straight ?self)))

((aligned-and-centered-in-lane ?self ?line1 ?line2) :percepts ( (self ?self segment ?seg)

(line ?lane1 segment ?seg dist ?dist1 angle ?ang1) (line ?lane2 segment ?seg dist ?dist2 angle ?ang2))

:tests ((*nearly-equal ?dist1 ?dist2) (*nearly-equal ?ang1 ?ang2)))

Page 18: Lecture 30 - Auckland

Hierarchical Organization of Concepts

ICARUS organizes conceptual memory in a hierarchical manner.

The same conceptual predicate can appear in multiple clausesto specify disjunctive and recursive concepts.

conceptconcept clausepercept

Page 19: Lecture 30 - Auckland

ICARUS Beliefs / Percepts for Urban DrivingInferred beliefs:(current-street me A) (current-segment me g550)(lane-to-right g599 g601) (first-lane g599)(last-lane g599) (last-lane g601)(under-speed-limit me) (slow-for-right-turn me)(steering-wheel-not-straight me) (centered-in-lane me g550 g599)(in-lane me g599) (in-segment me g550)(on-right-side-in-segment me) (intersection-behind g550 g522)(building-on-left g288) (building-on-left g425)(building-on-left g427) (building-on-left g429)(building-on-left g431) (building-on-left g433)(building-on-right g287) (building-on-right g279)(increasing-direction me) (near-pedestrian me g567)Observed percepts:(lane-line line7 segment segment23 color white dist 2.3 angle 49 …)(building building15 address 756 dist-to-corner 21.2 angle-to-corner 49 …)(self segment segment23 speed 33 wheel-angle 10 throttle 80 …))

Page 20: Lecture 30 - Auckland

Conceptual Inference in ICARUS

Conceptual inference in ICARUS occurs from the bottom up.

Starting with observed percepts, this process produces high-levelbeliefs about the current state.

conceptconcept clausepercept

Page 21: Lecture 30 - Auckland

Cascaded Integration in ICARUS

ICARUS adopts a cascaded approach to integration in whichlower-level modules produce results for higher-level ones.

conceptual inference

skill execution

problem solving

learning

Like other unified cognitive architectures, ICARUS incorporates anumber of distinct modules.

Page 22: Lecture 30 - Auckland

• Skills are symbolic predicates that the agent can use forthis cognitive purpose.

• Skill clauses provide definitions for these symbols that theagent can use for retrieval and control.

• Intentions are instances of skills that, taken together,describe the agent’s planned environmental activity.

Skills and Intentions in ICARUS

ICARUS relies on three types of symbol structures to supportits interaction with the environment.

Together, these three types of structures support skill execution,which lets ICARUS carry out extended activities.

Page 23: Lecture 30 - Auckland

((driving-well-in-rightmost-lane ?self ?line1 ?line2) :percepts ((self ?self) (segment ?seg) (line ?line1 segment ?seg))

(line ?line2 segment ?seg)) :start ((not (lane-to-right ?line1 ?line2 ?anyline)) :subgoals ((driving-well-in-segment ?self ?seg ?line1 ?line2)))

((driving-well-in-segment ?self ?seg ?line1 ?line2) :percepts ((self ?self) (segment ?seg) (line ?line1 segment ?seg))

(line ?line2 segment ?seg)) :start ((steering-wheel-straight ?self)) :subgoals ((in-segment ?self ?seg)

(aligned-and-centered-in-lane ?self ?line1 ?line2) (steering-wheel-straight ?self)))

((aligned-and-centered-in-lane ?self ?line1 ?line2) :percepts ((self ?self)) :start ((misaligned-to-left-in-lane ?self ?line1 ?line2)) :requires ((not (steering-to-right ?self))) :actions ((*steer 20)))

ICARUS Skills for Urban Driving

Page 24: Lecture 30 - Auckland

Hierarchical Organization of SkillsICARUS organizes skills in a hierarchical manner, which eachskill clause referring to its component subskills.

skillskill clauseoperator

The same skill can involve multiple clauses to allow disjunctive,conditional, and recursive procedures.

Page 25: Lecture 30 - Auckland

Skill Execution in ICARUS

skillskill clauseoperator

Execution operates from the top down, starting with high-levelintentions, to find paths through the skill hierarchy.

A skill clause is applicable if its conditions hold and its effectsare unsatisfied, given bindings from above.

Page 26: Lecture 30 - Auckland

Skill Execution in ICARUS

However, ICARUS prefers to continue ongoing skills when theymatch, giving it a bias toward persistence over reactivity.

This process repeats on later cycles to produce hierarchical butreactive behavior (Nilsson, 1994).

skillskill clauseoperator

Page 27: Lecture 30 - Auckland

Cascaded Integration in ICARUS

ICARUS adopts a cascaded approach to integration in whichlower-level modules produce results for higher-level ones.

conceptual inference

skill execution

problem solving

learning

Like other unified cognitive architectures, ICARUS incorporates anumber of distinct modules.

Page 28: Lecture 30 - Auckland

• Problems are symbolic structures that specify desiredsituations in the environment.

• Each problem comprises a set of goals that describedesired or undesired beliefs.

• Problem stacks are chains of problems and intentions thatenable progress toward problem solution.

Problems, Goals, and Stacks

ICARUS relies on three types of symbol structures to respondto unfamiliar tasks that it encounters.

Together, these three types of structures let ICARUS engage inproblem solving to generate novel behavior.

The architecture uses a version of means-ends analysis (Newell,Shaw, & Simon, 1959) for this purpose.

Page 29: Lecture 30 - Auckland

Problem Solving in ICARUS

The architecture invokes problem solving when it encountersan unsolved problem (set of goals).

ABC

Page 30: Lecture 30 - Auckland

Problem Solving in ICARUS

ICARUS creates an intention based on this skill instance with theoriginal problem as its parent.

The architecture retrieves relevant skill instances and uses itsheuristics to select one of them.

ABC

Page 31: Lecture 30 - Auckland

Problem Solving in ICARUS

When such an impasse occurs, ICARUS chains backward off theintention’s instantiated conditions to create a subproblem.

In some cases, the generated intention is not applicable becauseits conditions do not match current beliefs.

ABC

Page 32: Lecture 30 - Auckland

Problem Solving in ICARUS

This leads it to select a skill instance and generate an intentionthat should lead toward the subproblem’s solution.

The architecture applies the same machinery recursively to thesubproblem in an attempt to solve it.

ABC

Page 33: Lecture 30 - Auckland

Problem Solving in ICARUS

The new situation in turn produces revised beliefs that shouldsatisfy at least some of the problem’s goals.

Once it has generated an applicable intention, ICARUS executesit in the environment, changing the situation.

AB

C

Page 34: Lecture 30 - Auckland

Problem Solving in ICARUS

In such cases, ICARUS reactivates the parent problem and looksfor a skill that would achieve additional goals.

But the resulting beliefs may not yet satisfy all of the problem’sgoals, in which case it continues the process.

AB

C

Page 35: Lecture 30 - Auckland

Problem Solving in ICARUS

When this effort produces another applicable intention, ICARUSexecutes it, changing the situation further.

This leads the architecture to generate another intention, checkits application conditions, and so on.

AB

C

Page 36: Lecture 30 - Auckland

Problem Solving in ICARUS

When this occurs, ICARUS realizes that it has solved the problemand pops it from the problem stack.

With luck, this process eventually leads to a situation and tobeliefs that satisfy all of the problem’s goals.

AB

C

Page 37: Lecture 30 - Auckland

Problem Solving in ICARUS

In response, ICARUS calls on the execution module to carry outthis intention in the environment.

Because the subproblem was linked to the conditions of theinitial intention, this means the latter is applicable.

AB

C

Page 38: Lecture 30 - Auckland

Problem Solving in ICARUS

Accordingly, ICARUS may generate another intention, which maylead to new subproblems or may be applicable.

But, again, this intention may not by itself solve the originalproblem, which involves multiple goals.

AB C

Page 39: Lecture 30 - Auckland

Problem Solving in ICARUS

This variant of means-ends analysis can make incorrect guesses,in which case the module backtracks and tries other choices.

With luck, ICARUS will finally achieve all goals for the initialproblem, at which point it turns to other tasks.

AB C

Page 40: Lecture 30 - Auckland

1. Skill acquisition involves monotonic addition to memory of newsymbolic structures.

2. Learning is driven by experience but draws on prior knowledge.

3. Skill learning operates in an incremental and cumulative manner.

4. The learning process is interleaved with problem solving, whichprovides it with experience.

5. Skills are generalized traces of successful means-ends analysis.

Skill Acquisition in ICARUS

Humans acquire skills that store results of experience to improvetheir future performance; in ICARUS:

ICARUS shares the first four assumptions with other cognitivearchitectures like Soar and ACT-R, but the fifth is novel.

Page 41: Lecture 30 - Auckland

Learning from Problem Solutions

• operates whenever problem solving overcomes an impasse• incorporates only information stored locally with goals• generalizes beyond the specific objects concerned• depends on whether chaining involved skills or concepts• supports cumulative learning and within-problem transfer

ICARUS incorporates a mechanism for learning new skills that:

This skill creation process is fully interleaved with means-endsanalysis and execution.

Learned skills carry out forward execution in the environmentrather than backward chaining in the mind.

Page 42: Lecture 30 - Auckland

Execution and Problem Solving in ICARUS

Skill Hierarchy

ReactiveExecution

impasse?

ProblemSolving

yes

no

Problem solving involves means-ends analysis that chains backward over skillsand concept definitions, executing skills whenever they become applicable.

Primitive Skills

Problemgoal

beliefs

Generated Plan

Page 43: Lecture 30 - Auckland

ICARUS Learns Skills from Problem Solving

Skill Hierarchy

ReactiveExecution

impasse?

ProblemSolving

no

Primitive Skills

Problemgoal

beliefs

Generated Plan

SkillLearning

yes

Page 44: Lecture 30 - Auckland

Skill Learning in ICARUS

Each trace includes details about the goal, skill, and the initiallysatisfied and unsatisfied subgoals.

goalskill clauseoperator

As the architecture carries out means-ends analysis, it retainstraces of successful decompositions.

Page 45: Lecture 30 - Auckland

Skill Learning in ICARUS

Thus, problem solving operates from the top down, but ICARUSacquires skills from the bottom up.

goalskill clauseoperator

As the system solves achieves each subgoal, it generalizes theassociated trace and stores it as a new skill.

Page 46: Lecture 30 - Auckland

Skill Learning in ICARUS

Each newly learned skill is available for use on future problems,thus supporting structural transfer.

goalskill clauseoperator

When the architecture achieves new goals, or the same goal in adifferent way, it gradually expands the hierarchy.

Page 47: Lecture 30 - Auckland

Skill Learning in ICARUS

Over time, the architecture acquires a broad set of skills thatminimize the need for problem solving.

goalskill clauseoperator

These new acquisitions occur with both top-level goals and withones internal to the skill hierarchy.

Page 48: Lecture 30 - Auckland

Creating Synthetic Agents with ICARUSU

rban

Com

bat

Urb

an D

rivin

gRu

sh 2

008

Mad

RTS

Page 49: Lecture 30 - Auckland

Robin Hood in the Twig Environment

Page 50: Lecture 30 - Auckland

Concluding Remarks

• Grounds high-level cognition in perception and action• Treats categories and skills as separate cognitive entities• Organizes skills and concepts in a hierarchical manner• Combines teleoreactive control with means-ends analysis• Acquires new skills from successful problem solving

In this lecture, we covered ICARUS, a theory of the cognitivearchitecture that:

ICARUS combines ideas from many different traditions in aunified account of high-level cognition.

The framework also offers a high-level programming languagefor developing cognitive models and intelligent agents.

Page 51: Lecture 30 - Auckland

Selected ReferencesChoi, D., Konik, T., Nejati, N., Park, C., & Langley, P. (2007). Structural transfer of

cognitive skills. Proceedings of the Eighth International Conference on CognitiveModeling. Ann Arbor, MI.

Choi, D., Konik, T., Nejati, N., Park, C., & Langley, P. (2007). A believable agent forfirst-person shooter games. Proceedings of the Third Annual Artificial Intelligenceand Interactive Digital Entertainment Conference. Stanford, CA: AAAI Press.

Langley, P. (2006). Cognitive architectures and general intelligent systems. AIMagazine, 27, 33-44.

Langley, P., & Choi, D. (2006). A unified cognitive architecture for physical agents.Proceedings of the Twenty-First National Conference on Artificial Intelligence.Boston: AAAI Press.

Langley, P., Choi, D., & Rogers, S. (2009). Acquisition of hierarchical reactive skills ina unified cognitive architecture. Cognitive Systems Research, 10, 316-332.

Langley, P., Laird, J. E., & Rogers, S. (2009). Cognitive architectures: Research issuesand challenges. Cognitive Systems Research, 10, 141-160.

Li, N., Stracuzzi, D. J., Langley, P., & Nejati, N. (2009). Learning hierarchical skillsfrom problem solutions using means-ends analysis. Proceedings of the Thirty-FirstAnnual Meeting of the Cognitive Science Society. Amsterdam.

For downloadable papers, documentation, and software, see http://www.isle.org/icarus/