an integrative review of cognitive learning theories

An integrative Review of Cognitive Learning Theories

Olena Chambers

Introduction

Learning is a lifelong process which begins in embryonic stage and continues throughout

life. Obviously pre-school learning and post-academic self-study (like learning a hobby)

are very informal and are less important in terms of efficiency of learning for a number of

reasons. Also the tasks in acquiring competency rarely match the complexity of academic

tasks. For these reasons, this assignment is focused on the formal learning only – from

school age to University and other tertiary study. This essay-integrative review is based

on the last four of the five given theories where the researchers focus on effective

learning: Schema development, Cognitive load theory, Skill learning and the

development of expertise and Metacognition.

While students try to make their learning more efficient by using their preferred learning

styles and adopting more appropriate learning techniques (e.g. memorisation), it is

arguably their instructors’ job to make their learning more efficient by reducing the effort

they spend on the acquisition of the new material and on placing this new material in the

appropriate niche, linking it with the existing concepts and enriching their understanding

of how things work.

Throughout years of practice, teachers, guided by their experience, would improve the

design of the material they give to their students based on the students’ performance, time

and effort the students spend to acquire the new knowledge, e.g. regardless of prior

studying of Cognitive Load Theory, an expert teacher, before giving the class a text with

a diagram over the page, would most likely photocopy the diagram, especially, if the

students need to constantly flip the page to answer questions on the diagram. However,

knowledge and understanding of effective learning theories gives a huge advantage to

those teachers who really want to get to the bottom line of how not only expertise in

learning develops, but how we, teachers, can optimise this process the best possible way

– not driven by trial and error and just using common sense, like in case with the diagram

over the page, but equipped with sound knowledge of the theoretical principles and

understanding of how they work..

Even if we possess excellent knowledge of our subject and its teaching methodologies,

very often these methodologies, anecdotally, do not fully take into account the very

closely linked process of students’ learning. When a teacher knows, for example, that our

short-term memory can only hold 7 +-2 units of information, s/he will be able to design a

list of new vocabulary chunking it better; knowledge and understanding of

Metacognition, for example, will help to structure revision and evaluation sessions.

In this assignment the interrelationships between the four abovementioned theories are

analysed in the form of explanation of one theory, merged with an explanation of another

theory, focusing on the similarities of approaches and the parts of the Information

Processing Model they focus on.

The original version of the assignment had to be significantly shortened as it exceeded

the recommended volume.

Research in Cognitive Learning Theories

Before we proceed with the development of an integrated whole theory of how cognitive

learning theories can be used to create more effective learning, we need to look at the

way the brain works in regard to the construction of knowledge from both auditory and

visual sources. To do this we need to review the research on human cognitive architecture

and cognitive load theory. This review is necessary for 2 reasons:

1. The recognition of auditory information takes place within the brain. This decoding

and integration of audio information into existing schemata is extremely complex and

ultimately has an impact on the way students assimilate and integrate both audio and

visual information. This assimilation is known to be a complex process. It is also known

(through research) that this processing can be enhanced by manipulating the instructional

design methodology.

2. Cognitive learning theories play a part in redefining learning and the process of

Instructional Design. As Cooper (1998) states in his review of cognitive load theory “The

fundamental tenet of cognitive load theory is that the quality of instructional design will

be raised if greater consideration is given to the role and limitations, of working

memory.” If we can enhance the instructional design methods used will we then see a

follow on enhancement of student learning outcomes and the promotion of more

effective learning?

Human Cognitive Architecture

According to the modal model of human memory (attributed to Atkinson & Shiffrin,

1968), human memory can be divided into 3 memory sub-systems. These are the sensory

store, short term memory and long term memory. As will be seen each of these types of

memory systems have attributes that can and do affect the way material to be learned is

perceived and processed.

The Modal Model. (Cooper, 1998)

As can be seen from the modal model diagram, there are 3 forms of memory with

separate processors for audio and visual information. As will be discussed these

processing systems and their associated limitations directly influence learning.

Sensory memory

The first memory that receives information from the senses is called sensory memory.

Information in sensory memory is essentially uninterpreted however it:

a. holds information for basic processing of features

b. can hold many items at once

c. has a large capacity

d. allows for very brief retention of images.

Testing carried out by researchers has revealed retention of information in this

memory system to be about .3 seconds and about 3 seconds for hearing.

It has been theorized that there are several types of sensory memories:

iconic memory – for visual and textual information

echoic memory – auditory – Sperling, 1960

Haptic memory – touch

Transfer of information from sensory memory to working memory occurs via attention.

Attention is a selective selection process that takes place in working memory. Once

working memory has determined what requires attention the information is moved from

the sensory memory to the Working memory.

Students learning in lectures for example will use both iconic (PowerPoint and/or notes)

and echoic memory (Lecturers voice) and will need to selectively attend to appropriate

information from both information streams. Enhancing the students’ ability to attend and

process this information can be theorized to positively impact on the students’ integration

and retention of material to be learned.

Cues used by instructors to assist in integrating the auditory and visual information may

form a valuable attention directing device during this process.

Working Memory

Working memory has often been equated to consciousness by cognitive researchers (see

Cooper, Tindall-Ford et al, 2001). That is the conscious processing of information occurs

within this memory system.

Unfortunately working memory has been found to have limitations. These limitations

have in turn been shown to greatly affect the process of learning. The first limit is a limit

on information capacity. In a review of research at the time Miller (1956) found that for

simple one dimensional stimuli that our “span of absolute judgement” (essentially what

one can recall after being presented information) was equal to about 7 +- 2 elements.

The second limit is a limited duration for the holding of information. Research has shown

that this limit is approximately 30 seconds.

If a person cannot use rehearsal or if there is interference, for instance from a competing

task, then information in short term memory decays very quickly. For example counting

backwards while memorizing something will cause interference and thus an inability to

rehearse. This problem was first identified by Peterson & Peterson in 1959 where they

asked learners to count backwards in 3’s while attempting to memorise a ‘Trigram” of 3

consonents. Interestingly this was a test of the working memory auditory processor. The

so called Brown-Peterson task is still used as a standard test of working memory.

What happens if we want to hold more than 7 bits of information? In this case simple

memory expansion techniques such as chunking can be used. This expands working

memory. E.g. phone numbers are often broken into clumps, Miller (1956)

Maintenance rehearsal can also be used to extend the time that items can be held in

memory and will aide “memorization” that is encoding and transferal of the information

into long term memory.

Working memory capacity may also be expanded by presenting some information

visually and the remainder of the information auditorily than it is when all of the

information is presented through a single sense (either all visually or all auditorily).

In the Baddely and Hitch (1974) working memory model, working memory is seen as

having two processing systems: One for visual information and the other for auditory

information. This separation of the processing systems is similar to that in the sensory

memory. In addition a central executive controls the handling and processing of

information in working memory.

Baddeley's working memory model. Baddeley and Hitch, 1974, Cited in Bruning et al (2004)

Let us look at each component in turn:

Visuospatial sketchpad – processes and holds visual and spatial information brought in

from sensory visual memory. Once the information has been stored it will then undergo

processing in order for the processor of the information to reach the necessary outcome.

For example a pilot will make use of this memory for estimating distance from the

runway when landing. His eyes will perceive the runway, and in addition he will attend to

Central Executive

Visuospatial Sketchpad

Phonological Loop

the height and speed measurement from the controls and then his attention system/central

executive will direct this information to Working Memory where it will be processed to

determine an expected touchdown time, plus allow him/her to make judgements on

required speed, direction and height. Information will be pulled from Long Term

Memory to permit the calculation of the distance and time till touchdown. This form of 3

dimensional processing is the specialty area of the visuospatial sketchpad.

In face to face teaching the use of this memory subsystem will depend on the

information/subject area to be learned. Students may need to attend to a visual

presentation of information such as a 2 dimensional chart, a worked example, a graphic

all while integrating information from the audio stream where the lecturer is describing

the graphic image. This is a complex cognitive processing task. Any interference while

attempting this task will have an impact on the learning.

Phonological loop – holds and processes verbal information

Relating to our pilot example above our pilot may at the same time as watching the

approaching runway receive auditory information from instruments or a co-pilot in

regards to the state of the landing. This information will be held in the phonological loop

long enough for integration with the visuospatial information.

Likewise students listening to the lecturer will make use of the phonological loop. While

the loop will hold the lecturers words it may at times also be used for memorisation.

Memorisation in the phonological loop is usually carried out by deliberate repeated

repetition of key information. This repetition can take the form of deliberate repeating by

way of internal verbalization or by repeatedly revising material. Repeated internal

verbalization can usually not occur during live lectures due to the pace of the lectures and

the volume of information to be attended to. Hence the need for a revision process by

most students.

Students should be advised that revision i.e. repeatedly listening to the material (by

means of digital recording) OR listening to the presentation followed by reading of the

transcript will lead to improved retention of material.

Central executive – coordinates activities of Working Memory. Brings information in

from the sensory store and encodes it for storage into Long Term Memory (LTM). The

central executive is theorized to be involved in planning, focusing attention and switching

between tasks.

Two advanced mechanisms to overcome the limits or working memory are:

Schema acquisition, which allows us to work with ever larger more meaningful units of

information and, automation of procedural knowledge i.e. automation of skills.

The first mechanism, schema acquisition, deals primarily with processing and

understanding information; the second deals with the acquisition of skills. Each

mechanism helps us overcome the limits of working memory by allowing us to draw

down the contents of our long-term memories, which contain detailed schemas. Schemas

when loaded from long term memory into working memory are loaded in total and form

1 element. Hence schemas can greatly expand working memory by allowing the learner

to work with large chunks of information.

Long Term Memory

Long Term Memory (LTM) organizes and stores information received from working

memory. The relationship between long term and working memory can be defined in

terms of 2 main processes: Encoding and Retrieval.

Encoding - the process that controls movement of information from WM to LTM

Retrieval – the process that controls flow of information from LTM to WM

Control processes of Attention, maintenance rehearsal, encoding and retrieval govern

movement of information between stores. The central executive is theorized to be one of

the controlling structures involved in this processing.

General theoretical characteristics of LTM include that it is a more passive form of

storage than working memory, has unlimited capacity and information can be retained in

it almost indefinitely.

Defining Learning

For information to be stored in long term memory there needs to be a structure in which

to store the information. One theory is that Schemas are used to store information long

term. A schema can be defined as a mental representation of information, concepts or

anything stored in long term memory. Schemas are thought to be related to each other by

means of a hierarchical structure. This structure is also thought to aide in the retrieval of

information from LTM.

Learning in turn may be defined as the encoding (storage) of knowledge and/or skills into

long term memory in such a way that information can be recalled from the learner's long-

term memory and utilized in practice (applied as a skill).

Mayer (2005) also importantly outlines 3 other major requirements of effective learning:

Knowledge construction, Understanding, Meaningful learning.

Understanding and meaningful learning both relate to the ability of the user to transfer

learning to new situations. This is one of the ultimate measures of effective learning.

Schemas

First attributed to Bartlett in 1932, schemas are now a widely accepted model for long

term memory organization. Rumelhart (1977) defines schema as “generalized knowledge

about a sequence of events”. Rumelhart suggests that comprehension relies on finding a

schema in memory that accounts for perceived information. At the time of writing this

there was no extended modal model. In light of the modal model this equates to finding a

schema in long term memory that relates to the information passed into working memory

from short term memory.

Once this related schema is found in long term memory and retrieved into working

memory a comparison can occur and the information or concept can either be assimilated

into the existing schema if the two are related or accommodated as a new schema if the

new concept does not fit into the existing schema. For example:

Automation and the Novice vs Expert Debate

Recalling information and applying it on demand is called automation. Automation is the

ability to apply learned skills, information and processing models automatically. Ulric

Neisser (Quoted in Bruning et al, pg 25) first conceived of automaticity. Automation is

only believed to be obtained if there is repeated practice. While automaticity was

originally thought to be only related to skill acquisition recent research suggests that

automaticity can also be applied to cognitive skills. (Cooper and Sweller, 1987)

Automation of schemas along with schema acquisition are believed to form the

cornerstones of learning. An instructional designers role is to develop learning materials

that enhance the ability of the learner to acquire schemas and to automate these schemas.

Repeated practice and revision can lead to schema automation. (REFERENCE). Once

schemas have been acquired and automation is evident then a learner can be expected to

have graduated to expert status.

Repeated practice is often aided in the form of examples, worked examples, diagrams or

simulations. Additionally the use of followup tests and self tests aide in repeated practice

and revision.

If one has enough repeated practice and hold automated schemas then one may be

considered an expert. Holding advanced schemas allows experts to move in a forward

direction when solving problems as they can pull schemas from LTM and use this to

apply to the seek a solution. Novices instead must use a means-end strategy and work

using subgoals and often working from goal state back to initial state. This is an

inefficient strategy.

Cognitive theorists postulate that the only two distinguishing features of expertise in a

subject area are:

1 . the schemas that experts hold, and

2 . the high level of automation that experts demonstrate.

(REFERENCE)

Recommendation for iLecture: Ensure students understand how learning can occur using

these new mediums. A student understanding the metacognitive processes involved in

learning and how simple learning can be will be at an advantage to a student who does

not understand the metacognitive learning processes.

Effects on Learning

As has been shown learning occurs in the memory systems of the learner. If these

memory systems are affected in any way by the type of material to be learned or its

delivery there may be negative consequences for the learning processes occurring in the

brain. One of the major effects that can occur is cognitive load.

Cognitive Load

Learners studying any new material are constantly processing information and integrating

it into schemas using the mechanisms outlined earlier in this paper. This information can

often be 1. poorly organized 2. in too large a quantity or simply 3. irrelevant. 4. A

duplicate of information already provided. Cognitive researchers have defined several

theories outlining more exactly how such problems with information and learning

materials can impact on learners. One of these theories is cognitive load theory.

Cognitive load theory promoted by Sweller (1999) is based on the limited cognitive

capacity available in working memory during the learning process. If any of the above

instructional design issues are present then a learners working memory may become

overloaded. This overload is defined according to Sweller (date) to be caused by the level

of interactivity between elements in working memory. High levels of interactivity equal

high levels of cognitive load. As defined by Mein (2005) cognitive load is “the amount of

effort expended by a learner when he or she is participating in instruction”.

Intrinsic, Extrinsic, Germane cognitive load

There are 3 types of cognitive load that can be placed on learners:

Intrinsic

Extraneous

Germaine

These loads all relate to the effect learning on working memory

Intrinsic cognitive load comes from the complexity of the material to be learned. Some

material is easy to comprehend, some is more complex.

Word pairs learned as part of learning vocabulary in a foreign language is easy compared

with learning word order in a foreign language.

Element Interactivity

One of the things that leads to intrinsic cognitive load is element interactivity. For

example when learning vocabulary it is easy as we can learn elements/words individually

and can relate these to existing words in our native language but when learning word

order many elements/words must be thought of together and manipulated in working

memory to determine correct order. Combined with needing to consider other rules of the

language simultaneously our working memory is stretched.

Working memory is known to be limited to about 7 elements (+- 2) Miller (1956). This is

not a particularly large number and in a large number of learning situations the number of

elements and their combinations can easily exceed this number and thus working memory

is overloaded.

Extraneous cognitive load is any extra unnecessary information or material presented to

the learner while they are trying to solve a problem/task/memorisation. Extraneous load

often comes from the way the learning material has been designed. E.g. it may contain

duplicate/repeated information or it may have some material reworded to hopefully offer

an alternative explanation. This may not offer anything new to the learner and may

simply form another element (or set of elements) requiring comparison with the existing

elements in working memory.

Germaine cognitive load is the load on working memory from the material that also

needs to be learned in order to solve a problem/complete an exercise. A learner may need

a definition of a term, a suitable formulae pertinent to a task etc. It is related to “time on

task”.

In order for a learner to avoid cognitive overload it is important that the combined

intrinsic and extraneous cognitive load do not exceed the capacity of working memory –

hence the combination of these two loads must be minimised. Also it is important that the

germane load is maximised to ensure cognitive resources are only used on the necessary

tasks to aide schema acquisition.

Cognitive load for an expert will be less as they pull schema and information/solutions

from Long Term Memory (LTM) that novices do not have. This allows information to be

interpreted in larger chunks in working memory.

Anyone designing instruction needs to minimise extraneous cognitive load and maximise

germane cognitive load. Nothing can be done in regard to intrinsic cognitive load as this

load is inherent to the complexity of the material.

The Split Attention Effect

Sweller (1999) outlines a number of examples where presentation of related information

used during the learning process is unnecessarily separated. For example in worked

examples for kinematics problems an example is used to illustrate how separating out

variables from the problem statement produces increased cognitive load.

Figure 4 Split-attention and integrated kinematics problems

a.

A car moving from rest reaches a speed of 20 m/s after 10 seconds. What is the

acceleration of the car?

u = 0 m/s

v = 20 m/s

t = 10 s

v = u =at

a = (v-u)/t

a = (20-0)/10

a = 2 m/s

b.

A car moving from rest (u) reaches a speed of 20 m/s (v) after 10 seconds (t): [v=u+at,

a=(v-u)/t=(20-0)/10=2 m/s]. What is the acceleration of the car?

Taken from Sweller (1999, pg. 51)

Figure 4 illustrates the kinds of problems that can require learners to mentally integrate

disparate information. In this example the equations and variables being separate can

only be understood by mentally moving back and forth between the problem and the

variables. Only in this way can the variables be associated with the problem statement.

This causes increased cognitive load. The reworked example at b. illustrates an integrated

approach which is much easier to handle cognitively as no mental gymnastics are

required.

Research has shown that experts do not have a problem with the split attention effect

(Kalyuga et. al, 2003). Thus this effect is only applicable to novices. Of course at the start

of any new course a large number of the student body are likely to fit into the category of

novice regardless of level of the course i.e. year 1 verse post grad.

Integration of All 4 theories and research areas to form a meta theory

Conclusion

References

Bruning, H. Roger, Schraw, Norby, M. Monica, J. Gregory, Ronning, R. Royce, (1990). Cognitive Psychology aand Instruction. Prentice Hall, Inc.

Cooper, G. Research into Cognitive Load Theory and Instructional Design at UNSW,

1998 Web: http://education.arts.unsw.edu.au/CLT_NET_Aug_97.HTML

Kalyuga, S. (2000) When using sound with a text or picture is not beneficial for learning

Australian Journal of Educational Technology, 16(2), 161-172.

Kalyuga, S., Ayres, P., Chandler, P., & Sweller, J. (2003). Expertise reversal effect.

Educational Psychologist, 38, 23-31.

Mayer, R. E. Multimedia Learning; PowerPoint presentation

Miller G. A. The Magical Number Seven, Plus or Minus Two: Some Limits on Our

Capacity for Processing Information Web: http://www.well.com/user/smalin/miller.html

Rumelhart, D.E., and Ortony, A. (1977). The representation of knowledge in memory. In

R.C. Anderson, R.J. Spiro and W.E. Montague (Eds.), Schooling and the Acquisition of

Knowledge. (pp 99-136). Hillsdale, N.J.: Erlbaum.

Sweller, J. (1999). Instructional Design in Technical Areas, ACER Press

http://www.well.com/user/smalin/miller.html

an integrative review of cognitive learning theories

Documents