giving agents artificial intuition - trinity universityyzhang/reu/2008/program/reports/final... ·...
TRANSCRIPT
Giving Agents Artificial Intuition
Jason LeezerComputer Science REU
What is an Agent?- Perceives it's environment- Reasons- Acts upon it's environment
What is Deliberation?- Slow- Effortful- Deliberately Controlled
What is Intuition?- Fast- Associative- Automatic
A bat and a ball cost $1.10 in total. The bat costs $1 more than the ball.How much does the ball cost?
Shane Frederick (personal communication, April 2003)
Ball Costs $0.10Bat Costs $1.00
Right?
A+
Wrong!
(Ball) (Bat) (Total) $0.05 + $0.05 + $1.00 = $1.10
Motivation
Reproducing Human Behaviour is Difficult- Can't use utility theory- Humans don't always act rational
Human Simulations Could Aid Social Sciences- Allow situations/scenarios they would be
unable to create in the natural world- Allow experiments to be performed on a
massive scale in a much shorter time frame
How to Reproduce IntuitionIntuition is based strongly on accessibility – Khaneman
Accessibility - “the ease with which particular mental contents come to
mind”
Our Method:If an agent's current state is close to a state in
memory, then the best action can be inferred from the past experience
Intuitive Decisions:- Restrict amount and type of information used- Information accessible from current state
Deliberative Decisions:- No restriction- All information is accessible
Q – Learning:- Assigns a “Q-Value” to every state action pair- In each state the action that corresponds to the
highest value is chosen.- Q-Value's are updated after each state transition- α is the learning rate, r is the reward, γ is the
discount factor
Intuition Deliberation
Yes No
Am I Familiar With This?
Satisfying Choice
The Environment
Actions
Perception
I'll take x, you get 10 - x
Accept:Person A get x,
Person B gets 10 - xReject:
Both get $0
Person A
Person B
The Ultimatum Game
How people play the game:
Human Players
Rational Players
Rational Players
0 1 2 3 4 5 6 7 8 9 100
20
40
60
80
100
120
140
BothAll IntuitionAll Deliberation
Split Value
Num
ber t
ime
s ac
cept
ed
ResultsIntuition Deliberation Agents
Human Players
Rational Players
Human Players
ResultsIntuition Deliberation Agents
Rational Players
Human Players
ResultsIntuition Deliberation Agents
Conclusion & Future Work- While difficult, human behavior can be reproduced-Both Intuition and Deliberation decision making needed-- The learning algorithm plays a big role
What's Next?
-Investigate the effectiveness of other learning algorithms
-Restrict other types of information
-Reproduce human social behavior
Thanks to
Dr. Zhang
Dr. Lewis Dr. Drennon