learning unit vi. how do we learn? learning- the process of acquiring new and relatively enduring...

LearningUnit VI

How do we learn?

• Learning- the process of acquiring new and relatively enduring information or behaviors

• By learning, we are able to adapt to our environments

• John Locke agreed with the ideas of Aristotle and said that we learn by association

• This means that our minds naturally connect events that occur in sequence *

How do we learn?

• An example of learning by association:– You see and smell freshly baked bread, eat some,

and find it satisfying. The next time you see and smell fresh bread, you will expect that eating it will again be satisfying

– Give some one a red pen and ask them to correct someone’s essay, they will find more errors than if you had given them a black pen *

How do we learn?

• Habituation- an organism’s decreasing response to a stimulus with repeated exposure to it– A dog and a dog whistle

• Associative learning- learning that certain events occur together– The events may be two stimuli or a response and its

consequences– The process of learning associations is called

conditioning *

How do we learn?

• In classical conditioning we learn to associate two stimuli and thus to anticipate events– We learn that a flash of lightning signals an impending crack

of thunder so when lightning strikes we begin to brace ourselves

• Stimulus- any event or situation that evokes a response

• In operant conditioning we learn to associate a response and its consequence– We learn to repeat acts followed by good results and avoid

acts followed by bad results *

How do we learn?

• Cognitive learning- the acquisition of mental information, whether by observing events, by watching others, or through language– Observational learning is one form of cognitive

learning• it lets us learn from others’ experiences *

Classical Conditioning

• Ivan Pavlov’s studies on classical conditioning are considered to be one of if not the most famous psychological research experiments in history

• Classical conditioning- a type of learning in which one learns to link two or more stimuli and anticipate events

• Pavlov’s work laid the foundation for much of John B. Watson’s work *


• Watson urged his colleagues to discard reference to inner thoughts, feelings, and motives

• Watson believed the goal of psychology was to predict and control behavior

• Introspection was worthless according to Watson

• Watson believed psychology should be an objective science based on observable behaviors *


• Watson’s ideas became known as behaviorism– Behaviorism- the view that psychology should be an

objective science and that psychology should study behavior without reference to mental processes

– Most psychologists today agree with part one of behaviorism but not part two

• Watson and Pavlov believed that the basic laws of learning were the same for all animals *

Classical ConditioningPavlov’s experiment

• Pavlov was a doctor that first studied the digestive system

• Pavlov’s turned towards learning accidently• When conducting his digestive system studies he

realized that the dogs would salivate without fail each time food was put in their mouth

• He then realized that the dog would even start to salivate at the mere sight of the food, at the sight of the food dish, or even at the sound of a person’s footsteps

• He started to realize that this salivation was showing a form of learning *


• Pavlov started to experiment on why the dog started to salivate even before they saw the food

• To eliminate extraneous variables they isolated the dog in a small room, secured it in a harness, and attached a device to divert its saliva to a measuring instrument *

Classical ConditioningPavlov’s Experiment

• From the next room, they presented food to the dog• They then paired various neutral stimuli with food in

the dog’s mouth– A stimulus that elicits no response before conditioning– Events the dog could see or hear but didn’t associate them

with food• What they wanted to find out was if a sound or sight

was regularly signaled with the arrival of food, would the dog learn the link between the two *


• If the dog did learn the link between the two, would they salivate anticipating food

• They found the answer to both questions to be yes

• Just before placing food in the dog’s mouth to produce salivation, Pavlov sounded a tone

• After a few times of pairing the tone with the presentation of food, the dog began to salivating to the tone alone *


• Pavlov later found that a pairing with a buzzer, a light, a touch on the leg, and the sight of a circle all led to the dog salivating anticipating food

• A dog does not learn to salivate in response to food in its mouth

• Food in the mouth automatically, unconditionally, triggers a dog’s salivary reflex *


• Pavlov called the natural drooling an unconditioned response(UR)– Unconditioned response- an unlearned, naturally

occurring response to an unconditioned stimulus• The food was an unconditioned stimulus– A stimulus that unconditionally- naturally and

automatically- triggers a response(US) *


• Salivation in response to the tone is learned so it is called a conditioned response(CR)– A learned response to a previously neutral stimulus– The response is conditioned upon the dog’s

associating the tone and the food• The stimulus that used to be neutral like the tone

becomes the conditioned stimulus(CS)– An originally irrelevant stimulus that after association

with an unconditioned stimulus comes to trigger a conditioned response *


• Conditioned=learned• Unconditioned=unlearned• Try the example of the tone and a puff of air in

your eye– Identify the NS, US, UR, CS, CR

• Pavlov would spend the next three decades looking into his conditioning studies even further *


• Acquisition- in classical conditioning, the initial stage, when one links a neutral stimulus and an unconditioned stimulus so that the neutral stimulus begins triggering the conditioned response

• What happens if the US appeared before the NS?– Conditioning usually does not happen when the

NS follows the US *


• Higher-order conditioning- a procedure in which the conditioned stimulus in one conditioning experience is paired with a new neutral stimulus, creating a second conditioned stimulus– Example: an animal that has learned that a tone

predicts food might then learn that a light predicts the tone and being responding to the light alone *


• Higher-order conditioning is often called second order conditioning

• High-order conditioning tends to be weaker than first-order conditioning *


• What happens if after conditioning, the CS occurred repeatedly without the US?

• If the tone sounded again and again but no food appeared what would happen?

• The dogs would start to salivate less and less• Extinction- the diminishing of a conditioned

response; occurs in classical conditioning when an unconditioned stimulus does not follow a conditioned stimulus *


• Spontaneous recovery- the reappearance, after a pause, of an extinguished conditioned response

• This made Pavlov think that extinction was suppressing the CR rather than eliminating it *


• Pavlov noticed that a dog conditioned to the sound of one tone also responded somewhat to the sound of a new and different tone

• Generalization- the tendency, once a response has been conditioned, for stimuli similar to the conditioned stimulus to elicit similar responses *


• Generalization can be adaptive• Toddlers taught to fear moving cars can also

become afraid of moving trucks and motorcycles

• Generalized fears can linger for awhile• We tend to like unfamiliar people more if they

look some what like someone we’ve learned to like rather than dislike


• Discrimination- the learned ability to distinguish between a conditioned stimulus and stimuli that do not signal an unconditioned stimulus

• In Pavlov’s experiment, the dogs learned to salivate to a particular tone and not to other tones

• Being able to recognize these differences is adaptive• Confronted by a guard dog, your response may be

different than when you confronted by a guide dog *


• Most psychologists today agree that classical conditioning is a basic form of learning

• According to current psychologists, Pavlov’s ideas were incomplete, because modern psychology is a combination of biology, psychology, and cultural environment

• The main thing learned from his studies is that classical conditioning is one way that virtually all organisms learn to adapt to their environment *


• Pavlov also showed us how learning can be studied objectively

• Pavlov’s work involved very little if any subjective judgments or guesses about what went on in a dog’s mind

• Pavlov presented a way to study psychology scientifically *


• Pavlov’s work provided a basis for Watson’s idea that human emotions and behaviors are mainly a bundle of conditioned responses

• Watson’s Little Albert experiment is an example of classical conditioning *

Operant conditioning

• Classical conditioning and operant conditioning are both examples of associate learning but they are very different

• Classical conditioning forms associations between stimuli and it involves respondent behavior– Actions that are automatic responses to a stimulus

• Operant conditioning- a type of learning in which behavior is strengthened if followed by a reinforcer or diminished if followed by a punisher *

Operant conditioning

• In operant conditioning actions followed by reinforcers increase; those followed by punishers often decrease

• Behavior that operates on the environment to produce rewarding or punishing stimuli is called operant behavior *

Operant conditioningSkinner’s experiments

• BF Skinner would become behaviorism’s most influential and controversial figure

• Skinner’s work elaborated on what psychologist Edward L. Thorndike called the law of effect– Behaviors followed by favorable consequences

become more likely, and that behaviors followed by unfavorable consequences become less likely

• Skinner would use Thorndike’s ideas as a starting point


• Skinner would develop a behavioral technology that revealed principles of behavior control

• For his studies, he designed the operant chamber– A chamber containing a bar or key hat an animal can

manipulate to obtain a food or water reinforcer– Attached devices record the animal’s rate of bar

pressing or key pecking– Also called the Skinner Box *


• The skinner box allowed Skinner to study his idea of reinforcement

• Reinforcement- any event that strengthens the behavior it follows

• What is reinforcing depends on the animal and the conditions

• For people, it could be praise, attention, or a paycheck

• For rats it could be food or water *


• Shaping- an operant conditioning procedure in which reinforcers guide behavior toward closer and closer approximations of the desired behavior

• Shaping works on building on natural behaviors• With the rat, you would give food to the rat once it

starts toward the bar, then only give it to the rat when it gets even closer, and lastly only give the rat the food once it touches the bar *


• In shaping you are hoping for successive approximations– You reward responses that are ever closer to the

final desired behavior, and you ignore all other responses

• By making rewards contingent on desired behaviors, researchers and animal trainers gradually shape complex behaviors *


• Discriminative stimulus- a stimulus that elicits a response after association with reinforcement

• Example: when experimenters reinforced pigeons for pecking after seeing a human face, but not after seeing other images, the pigeon’s behavior showed that it could recognize human faces. The human face was a discriminative stimulus. *

Operant conditioningTypes of Reinforcers

• Positive reinforcement- increasing behaviors by presenting positive reinforcers– Any stimulus that, when presented after a

response, strengthens the response• Negative reinforcement- increasing behaviors

by stopping or reducing negative stimuli– Any stimulus that, when removed after a

response, strengthens the response– Negative reinforcement is not punishment *


• Negative reinforcement can be a reason why drug users go back to using after they have tried stopping. Why?

• Negative reinforcement is something that provides relief

• Just remember, reinforcement is something that strengthens a certain behavior *


• Primary reinforcer- an innately reinforcing stimulus, such as one that satisfies a biological need– food

• Conditioned reinforcer(Secondary reinforcer)- a stimulus that gains its reinforcing power through its association with a primary reinforcer– In the Skinner Box, if a rat figures out that a light signals

food coming, they will work to turn the light on– Money, good grades *


• When rewarding behavior you have to be careful to only present the reward after the desired behavior

• For the rat, do not present the reward immediately following the rat scratching, sniffing, or just moving around

• What happens if there is a delay for the presentation of the reward after the desire behavior is performed? *


• For the rat, if the delay lasts more than 30 seconds, they will not link the reward with the desired behavior

• Humans do respond to delayed reinforcers• Paychecks at the end of the week, the good grade

at the end of the term, the trophy at the end of the season

• Often times small but immediate consequences are sometimes more attractive than big but delayed consequences *

Operant conditioningTypes of reinforcement schedules

• Reinforcement schedules- a pattern that defines how often a desired response will be reinforced

• Continuous reinforcement- reinforcing the desired response every time it occurs– Learning occurs rapidly, which makes this the best

choice for mastering a behavior– Extinction occurs rapidly– When reinforcement stops, the desired behavior stops

soon as well– Real life rarely provides continuous reinforcement *


• Partial (intermittent) reinforcement- reinforcing a response only part of the time– Slower acquisition of a response but much greater

resistance to extinction• Types of reinforcement schedules– Fixed-ratio– Variable-ratio– Fixed-interval– Variable-interval *


• Fixed ratio schedule- a reinforcement schedule that reinforces a response only after a specified number of responses– Free drink after 10 purchases– Once conditioned, animals will pause only briefly

after a reinforcer before returning to a high rate of responding *


• Variable-ratio- a reinforcement schedule that reinforces a response after an unpredictable number of responses– Slot machines, fishing, and playing golf– These conditioned behaviors are hard to

extinguish– Reinforcers increase as the number of responses

increase, producing high rates of responding *


• Fixed-interval- a reinforcement schedule that reinforces a response only after a specified time has elapsed– Animals on this type of schedule tend to respond

more frequently as the anticipated time for reward draws near

– Checking the mail– This produces a choppy stop-start pattern rather

than a steady rate of response *


• Variable-interval- a reinforcement schedule that reinforces a response at unpredictable time intervals– Produce slow, steady responding– Rechecking email or facebook – It is unknown when the waiting will end *


• Response rates are usually higher when reinforcement is linked to the number of responses(ratio) rather than to time (interval)

• Responding is more consistent when reinforcement is unpredictable (variable) than when it is predictable (fixed)

• Skinner believed the reinforcement principles were universal for all animals *

Operant conditioningPunishment

• Punishment- an event that tends to decrease the behavior that it follows

• Swift and sure punishers can powerfully restrain unwanted behavior

• A child who is burned by touching a hot stove will learn not to repeat those behaviors

• Criminal behavior is influenced by swift and sure punishers than by the threat of severe sentences *


• Punished behavior is suppressed not forgotten. This suppression of the behavior may negatively reinforce parents’ punishing behavior

• Punishing teaches discrimination among situations– Just don’t swear around the house

• Punishment can teach fear *


• Physical punishment may increase aggression by modeling aggression as a way to cope with problems

• Punishment tells you what not to do, reinforcement tells you what to do *

Operant conditioningSkinner’s Legacy

• Skinner repeatedly how he thought external influences not internal thoughts and feelings shape behavior

• Skinner encouraged people to use operant principles to influence the behavior of others at work, school, and home

• Skinner’s critics said that he dehumanized people by neglecting their personal freedom and by seeking to control their actions *

Operant conditioningSkinner’s Legacy

• Skinner said that he was using operant conditioning to better people’s lives

• He stated that using reinforcers would be more humane than using punishment *

Biofeedback

• Biofeedback- a system for electronically recording, amplifying, and feeding back information regarding subtle physiological state, such as blood pressure or muscle tension

• Helps in training people to counteract harmful things like stress

• It allows people to learn techniques of controlling a particular physiological response

• Further research found the early results of biofeedback to be overblown and oversold– It does work in tension headaches *

Contrasting Classical and Operant Conditioning

• Both are associative learning• Both involve acquisition, extinction, spontaneous

recovery, generalization, and discrimination• Classical conditioning involves an association of two

stimuli we do not control, and we respond automatically(respondent behaviors)

• Operant conditioning we associate our own behaviors that act on our environment to produce rewarding or punishing stimuli(operant behaviors) *

Cognitive influences on conditioning

• Watson and Pavlov underestimated the importance of biological constraints on an organism’s capacity to learn and the effects of cognitive processes

• Early behaviorists believed animals could be reduced to mindless mechanisms responding to rewards or punishment with not thinking

• Later studies found that animals could learn to expect a certain event– The more predictable the association, the stronger the

conditioned response *


• Cognitive map- a mental representation of the layout of one’s environment– Rats exploring a map, given no obvious reward will do

this– After exploring a maze, rats act as if they have learned

a cognitive map of it– The learning is usually not displayed until food is

placed in the maze’s goal box– Most of the rats completed the task as quickly as those

that had learned the maze through reinforcement *


• Latent learning- learning that occurs but is not apparent until there is an incentive to demonstrate it

• The point you need to remember with latent learning and cognitive maps is that there is more to learning than associating a response with a consequence; there is also cognition *


• Some learning occurs after little or no systematic interaction with our environment

• We may puzzle over a problem, and suddenly, the pieces fall together as we perceive the solution in a sudden flash of insight– A sudden realization of a problem’s solution *


• Promising people a reward for a task they already enjoy can backfire

• Excessive rewards can destroy intrinsic motivation– A desire to perform a behavior effectively for its

own sake• Overjustification- the overuse of bribes leading

people to see their actions as externally controlled rather than internally appealing *


• Extrinsic motivation- a desire to perform a behavior to receive promised rewards or avoid threatened punishment

• Rewards used to signal a job well done instead of ones that are meant to bribe or control someone are more effective

• These kind of rewards can help boost intrinsic motivation *

Learning and personal control

• Problems in life are unavoidable• We have to learn how to cope with these problems– Coping- alleviating stress using emotional, cognitive, or

behavioral methods• These problems are called stressors• Some of them we deal with directly with problem-

focused coping– Attempting to alleviate stress directly- by changing the

stressor or the way we interact with that stressor *


• If we have a family fight, we may go directly to that family member to work things out

• We tend to use problem-focused strategies when we feel a sense of control over a situation and think we can change the circumstances, or at least change ourselves to deal with the circumstances more capably *


• Emotion-focused coping- attempting to alleviate stress by avoiding or ignoring a stressor and attending to emotional needs related to one’s stress reaction

• We turn to this form of coping when we cannot or believe we cannot change a situation

• If we cannot get along with the family member, we may search for stress relief by reaching out to friends for support and comfort *


• Emotion-focused coping can be adaptive• We exercise or keep busy with hobbies to

avoid thinking about an old addiction• It can also be maladaptive as when students

go out and party to cope with stress from needing to read for class *


• Often times, we are presented with stressors that we seem to not be able to get past or cope with

• Feeling helpless and oppressed may lead to a state of passive resignation called learned helplessness– The hopelessness and passive resignation an

animal or human learns when unable to avoid repeated aversive events *


• Martin Seligman discovered the idea of learned helplessness by studying dogs

• The dogs were strapped in a harness and given repeated shocks, with no opportunity to avoid them

• Later, when placed in another situation where they could escape the punishment by simply leaping a hurdle, the dogs cowered as if without hope

• Animals able to escape the first shocks learned personal control and easily escaped the shocks in the new situation *


• Humans who are faced with repeated traumatic events over which they have no control, may start to feel helpless, hopeless, and depressed

• Perceiving a loss of control, we become more vulnerable to stress and ill health

• Elderly placed in a nursing home have been found to decline faster and died sooner if they perceived they have no control over their activities *


• Increasing self control noticeably improves health and morale

• Losing control provokes an outpouring of stress hormones

• Captive animals experience more stress and are more vulnerable to disease than are wild animals *


• Hundreds of studies have compared people who differ in their perceptions of control

• On one side are those that have an external locus of control– The perception that chance or outside forces

beyond our personal control determine our fate• On the other side are those that have an

internal locus of control– The perception that you control your own fate *


• Internals have achieved more in school and work, acted more independently, enjoyed better health, and felt less depressed that did externals

• Internals are better at delaying gratification and coping with various stressors

• Studies have shown that people that believe in free will, or that willpower is controllable, learn better, perform better at work, and are more helpful *


• Self-control- the ability to control impulses and delay short-term gratification for greater long-term rewards

• This ability has predicted good adjustment, better grades, and social success

• Using self-control will usually deplete the mental energy needed for self-control but with rest it will recover *

Learning by observation

• Observational learning- learning by observing others- also called social learning

• This usually happens in higher animals, especially humans, because it involves learning without direct experience

• By watching someone burn their hand on a hot stove, we are able to learn to not touch the stove *


• We learn our native languages and various other specific behaviors by observing and imitating others

• this process is called modeling– The process of observing and imitating a specific

behavior• Albert Bandura was the pioneering researcher

for observational learning *


• Bandura’s Bobo the Clown experiment studies observational learning

• Bandura determined from his studies that we experience vicarious reinforcement or vicarious punishment, and we learn to anticipate a behavior’s consequences in situations like those we are observing

• We are more likely to learn from people we perceive to be similar to ourselves *


• Mirror neurons- frontal lobe neurons that some scientists believe fire when performing certain actions or when observing another doing so

• The brain’s mirroring of another’s action may enable imitation and empathy

• Studies have found that neurons fire when a monkey grasps, holds, or tears something and that the same neurons fire when the monkey watches someone else doing so

• When one monkey sees, its neurons mirror what another monkey does *


• Prosocial behavior- positive, constructive, helpful behavior

• This is the opposite of antisocial behavior• Many business organizations effectively use

behavior modeling to help new employees learn communications, sales, and customer service skills

• Models are most effective when they are consistent *

Extra terms or ideas to know• Interval schedules are more resistant to

extinction than ratio• Variable schedules are more resistant to

extinction than fixed• Ration schedules produce the highest rates of

response• Desensitization- the diminished emotional

responsiveness to a negative or aversive stimulus after repeated exposure to it

• Tokens-using plastic poker chips or similar tokens instead of money *

Extra terms or ideas to know

• Vicarious- It refers to the process of learning behaviors through observation of reward and punishment, rather than through direct experience. For example, children who have grown up with older brothers and sisters often learn about behavior and expectations through watching their siblings. When they see their sibling get rewarded for a certain action, they learn that they should also do the behavior. *

learning unit vi. how do we learn? learning- the process of acquiring new and relatively enduring...

Documents

type of learning

example of learning

learning unit vi slide

history classical conditioning

form of cognitive learning

operant conditioning

watson watson

basic laws of learning