july psychological bulletin - rutgers...

13
VOL. 80, No. 1 Psychological JULY 1973 Bulletin Copyright © 1973 by the American Psychological Association, Inc. WHAT THE MIND'S EYE TELLS THE MIND'S BRAIN: A CRITIQUE OF MENTAL IMAGERY 1 ZENON W. PYLYSHYN2 University of Western Ontario This paper presents a critique of contemporary research which uses the notion of a mental image as a theoretical construct to describe one form of memory representation. It is argued that an adequate characterization of "what we know" requires that we posit abstract mental structures to which we do not have conscious access and which are essentially conceptual and propositional, rather than sensory or pictorial, in nature. Such representations are more accurately referred to as symbolic descriptions than as images in the usual sense. Implications of using an imagery vocabulary are examined, and it is argued that the picture metaphor underlying recent theoretical discussions is seriously misleading-especially as it suggests that the image is an entity to be perceived. The relative merits of several alternative modes of representation (propositions, data structures, and procedures) are discussed. The final section is a more speculative discussion of the nature of the representation which may be involved when people "use" visual images. Cognitive psychology is concerned with two types of questions: What do we know? and How do we acquire and use this knowledge? The first type of question, to which this paper is primarily addressed, concerns itself with what might be called the problem of cognitive representation. It attempts to an- swer the question What is stored? by de- scribing the form in which our knowledge or model of the world is represented in the mind. The second type of question attempts to deal more directly with certain limited-capacity psychological processes which create and ma- nipulate the representations and generate 1 Several of the arguments appearing in this paper were first developed in the course of discussions held with Marvin Minsky and Michael Arbib. Also, the careful critical reading given an earlier version of this paper by Allan Paivio has hopefully led to a more careful and balanced presentation. I am grate- ful to these people for their help but am not so optimistic as to expect that they will agree with all the arguments appearing in the final draft. 2 Requests for reprints should be sent to Zenon W. Pylyshyn, Department of Psychology, University of Western Ontario, London N6A 3K7, Ontario, Canada. 1 appropriate behavior from them (such as questions of limited attention and memory and "psychological cost" factors limiting performance) . The problem of cognitive representation has been approached by a wide variety of paths in the last half century. It is not the purpose of this paper to review this spectrum of alternative 'representations. We concern ourselves with one particular proposal-that cognitive representations take the form of mental images-s-which has recently exploded into fashion and with a number of alterna- tives which derive primarily from computer- simulation work. MENTAL IMAGES After almost 50 years of dedicated avoid- ance, mental imagery appears to be once again at the center of interest in many areas of psychology (Arnheim, 1969; Bower, 1972; Bugelski, 1970; Hebb, 1968; Holt, 1964; Horowitz, 1970; Paivio, 1969, 1971; Reese (1970); Richardson, 1969; Segal, 1971; Sheehan, 1972), although it is not without its

Upload: others

Post on 24-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

VOL. 80, No. 1

PsychologicalJULY 1973

BulletinCopyright © 1973 by the American Psychological Association, Inc.

WHAT THE MIND'S EYE TELLS THE MIND'S BRAIN:

A CRITIQUE OF MENTAL IMAGERY 1

ZENON W. PYLYSHYN2

University of Western Ontario

This paper presents a critique of contemporary research which uses the notionof a mental image as a theoretical construct to describe one form of memoryrepresentation. It is argued that an adequate characterization of "what weknow" requires that we posit abstract mental structures to which we do nothave conscious access and which are essentially conceptual and propositional,rather than sensory or pictorial, in nature. Such representations are moreaccurately referred to as symbolic descriptions than as images in the usualsense. Implications of using an imagery vocabulary are examined, and it isargued that the picture metaphor underlying recent theoretical discussions isseriously misleading-especially as it suggests that the image is an entity tobe perceived. The relative merits of several alternative modes of representation(propositions, data structures, and procedures) are discussed. The final sectionis a more speculative discussion of the nature of the representation which maybe involved when people "use" visual images.

Cognitive psychology is concerned with twotypes of questions: What do we know? andHow do we acquire and use this knowledge?The first type of question, to which thispaper is primarily addressed, concerns itselfwith what might be called the problem ofcognitive representation. It attempts to an­swer the question What is stored? by de­scribing the form in which our knowledge ormodel of the world is represented in the mind.The second type of question attempts to dealmore directly with certain limited-capacitypsychological processes which create and ma­nipulate the representations and generate

1 Several of the arguments appearing in this paperwere first developed in the course of discussions heldwith Marvin Minsky and Michael Arbib. Also, thecareful critical reading given an earlier version ofthis paper by Allan Paivio has hopefully led to amore careful and balanced presentation. I am grate­ful to these people for their help but am not sooptimistic as to expect that they will agree with allthe arguments appearing in the final draft.

2 Requests for reprints should be sent to ZenonW. Pylyshyn, Department of Psychology, Universityof Western Ontario, London N6A 3K7, Ontario,Canada.

1

appropriate behavior from them (such asquestions of limited attention and memoryand "psychological cost" factors limitingperformance) .

The problem of cognitive representationhas been approached by a wide variety ofpaths in the last half century. It is not thepurpose of this paper to review this spectrumof alternative 'representations. We concernourselves with one particular proposal-thatcognitive representations take the form ofmental images-s-which has recently explodedinto fashion and with a number of alterna­tives which derive primarily from computer­simulation work.

MENTAL IMAGES

After almost 50 years of dedicated avoid­ance, mental imagery appears to be onceagain at the center of interest in many areasof psychology (Arnheim, 1969; Bower, 1972;Bugelski, 1970; Hebb, 1968; Holt, 1964;Horowitz, 1970; Paivio, 1969, 1971; Reese(1970); Richardson, 1969; Segal, 1971;Sheehan, 1972), although it is not without its

Page 2: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

2 ZENON W. PYLYSHYN

detractors (Brainerd, 1971; Brown, 1958;Gibson, 1966; Neisser, 1972). It has returnednot only as a phenomenon to be investigatedbut as an explanatory construct in cognitivepsychology. It is of some interest, therefore, toask whether anything new has been learnedabout mental images in the last decade andwhether the critiques widespread early in thecentury have been satisfaotorily countered.

Any analysis in the nature and role ofimagery is fraught with difficulty. The con­cept itself proves to be difficult to pin down.Is a visual image like some conceivable pic­ture? If not, then in what way must it differ?If it is like a picture in some ways, then mustit always be a picture of some specific in­stance,or can it be generic (if such a notionis intelligible)? Could it, for example, repre­sent abstract relations or must the relationsin the image be of an iconic or geometricvariety? Is an entire image available atonoe~as a spatially parellel static picture­or do parts of it come and go? If parts canbe added and .deleted at will, must such partsbe pictorial segments (e.g., geometrically de­finable pieces or sensory attributes such ascolor) or can they be more abstract aspects?Could one, for example, conceive of twoimages of the identical chessboard with oneimage containing the relation "is attackedby" and the other not containing it? If so,then in what sense could such a relation besaid to be "in the image"? Must images insome important sense be modality specific, asimplied by such phrases as visual image, audi­tory image, etc.? And finally, must imagesalways be conscious? Can one, for example,make intelligible the notion of an unconsciousvisual image?

The tumultuous history of the concept ofimagery in both philosophy and psychologyattests to the difficulties which such questionshave raised in the minds of centuries ofscholars. While contemporary psychologistshave attempted to narrow the scope of theconcept by operational definitions and multi­ple empirical and theoretical underpinnings,it is not dear that they have resolved themajor conceptual ambiguities, circularities,and Rylean "category mistakes" which haveplagued the notion in the past. In this paperwe do not attempt a philosophical analysis

of the concept since that would take us toofar afield from our primary objective, whichis to analyze the role of imagery as an ex­planatory construct in cognitive psychology.In attempting this, however, it is impossibleto avoid pointing out some of the conceptualproblems implicit in contemporary uses of theterm "imagery" to explain certain findingsin psychology.

For the sake of avoiding any misinterpreta­tions of the remarks in the remainder of thispaper, it should be stressed that the existenceof the experience of images cannot be ques­tioned. Imagery is a pervasive form of experi­ence and is clearly of utmost, ~mportancetohumans. We cannot speak of consciousnesswithout, at the same time, implicating theexistence of images. Such experiences are notin question here. Nor, in fact, is the statusof imagery either as object of study (i.e., asdependent variable) or as scientific evidencebeing challenged. There are many areas wherethe alternative to cautious acceptance of re­ports of imagery is the rejection of a wholearea of inquiry (e.g., Hebb, 1960; Holt,1964). Furthermore, the extensive experi­mental investigations of imagery in the lastdecade (exhaustively surveyed in Pavio, 197<1)have been of unquestionable value in break­ing through the earlier oppressive structureson what phenomena ought to be studied. Theempirical regularities demonstrated are ofhigh reliability and wide interest, both froma scientific and practical point of view. Noneof these empirical results are questioned. Themain question that is raised is whether theconcept of image can be used as a primitiveexplanatory construct (i.e., one not requiringfurther reduction) in psychological theoriesof cognition. A second and equally importantissue is whether the commonsense under­standing of this term contains misleading im­plications which carryover undetected intopsychological theories.

Information and Experience

While most psychologists are willing toconcede that not all important psychologicalprocesses and structures, are available to' con­scious inspection, it is not generally recog­nized that the converse may also hold: thatwhat is available to conscious inspection may

Page 3: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

A CRITIQUE OF MENTAL IMAGERY 3

not be what plays the important causal rolein psychological processes. This is not toimply that when a subject says that he doessuch-and-such a mental operation that hemay be dishonest in his reports nor even thathe himself is misled as to what he "actually"does. This is not the issue. If someone asksme how many windows there are in my houseand then asks me to report how I went aboutanswering the question, this report (subjectto the usual methodological precautions) maybe taken as an accurate report of what Iexperienced doing. The only trouble is thatI must give such a report in the only lan­guage I have available for describing myawarenesses. A description in such a languagemay be entirely inadequate in meeting whatChomsky (1964) has referred to as "explana­tory adequacy." The description may evenfail to be mechanistic insofar as it may useterms such as "want," "guess," "notice," or"mind's eye," which, while they may faith­fully reflect ones experience of doing. the task,are inadequate as primitive constructs sincethey themselves cry out for a reduction tomechanistic terms.

Even if the description does not containinadmissible terms, there is no guarantee thatit will make sense as an explanatory scien­tific theory. An explanatory theory must meetdifferent criteria of adequacy than would bedemanded of an informal descriptive account.It must first be free of conceptual difficultiesand internal contradictions. Then it must beshown to be capable of providing a mecha­nistic explanation for the widest possibledomain of empirical evidence in a mannerwhich reveals the most general principles in­volved. An explanatory account must ulti­mately appeal to universal mechanistic prin­ciples. Just because we know that we usecertain mnemonic strategies, or that we saycertain things to ourselves, or that we "see"certain objects in our "mind's eye" or "hear"ourselves rehearsing a series of numbers, etc.,we cannot assume that the contents of suchsubjective knowledge can be identified withthe kind of information-processing procedureswhich will go into an explanatory theory.

Perhaps these remarks can be made clearerif we sharply distinguish two senses of cogni­tive verbs like "see," "hear," or "image." In

one sense, they refer to information-process­ing functions-to the reception of informationthrough the visual or auditory systems andthe subsequent transformation or encoding ofthis information in interaction with storedinformation. In the case of imagery it mayperhaps refer to the activation, retrieval, orreorganization of such information. Anothervery different sense of these terms is implied,however, when they are used to designate theconscious experience which may accompanysuch functions, that is, the conscious experi­ence of seeing, hearing, or imaging (or exam­ining an image in the "mind's eye"). In thissense of the word "image" one might claimthat the image can be examined throughintrospection. Clearly, however, the informa­tion-processing function itself cannot. It isimportant to inquire whether the experienceof imaging can reveal important properties ofthe information-processing functions or of themental representation of information on whichthese processes operate. But we must notassume in advance that such observation willreveal the content of the mental representa­tion. Not only does such observation presentserious methodological hazards, it is not primafacie an observation of the functional repre­sentation (i.e., one that figures in the humaninformation-processing function).

In discussing the question of introspectiveknowledge, Natsoulas ('1970) warns,

It may turn out that, even though a useful informa­tional relation (concomitant variation) exists betweenthe contents of our awareness and the properties ofmental episodes, they do not have the intrinsicproperties which we take them to have [po 91].

The recent literature on imagery aboundsin examples which reveal that the investigatortacitly assumes that what is funotional incognition is available to introspection. Con­sider, for example, the widely held view (theso-called dual-code model) that the form ofmental representations is either verbal orimaginal. This partition between two concretemodes has its roots in the persuasive fact thatthe only way in which we clearly experienceour memory of cognitive events is throughsome form of sensory-motor image (includingarticulatory and acoustical images of words).Thus, for example, in a revival of a positionassociated with Berkeley and Hume, Bugelski

Page 4: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

4 ZENON W.PYLYSHYN

(1970) questions "whether there is such athing as an abstract thought or abstraction[po 1006]." The basis of his doubts are hisexperiments, using the Kent-Rosanoff WordAssociation Test, in which he finds:

If you say FLOWER, a categorical term, the sub­jects think of daisies or roses, and highly specificdaisies or roses .... If you say DEMOCRACY,they report a variety of imagery, practically none ofwhich refers to governmental operations. Govern­ment by the people becomes an image of a crowdat a political rally [p. 1006J.

Drawing conclusions about the nature ofcognitive representations from reports of ex­periences evoked by words may appear some­what far-fetched (after all, what else coulda subject report having experienced-otherthan images of objects or of other words?)until we examine the context in which Bugel­ski and his colleagues are working. The thrustof Bugelski's paper is to show the inadequacyof theories of learning and memory whichrely exclusively on postulating associationsamong words. From this excess he adoptsanother equally untena:ble position (which isnowhere stated explicitly): tha:t all learningand memory-and indeed all of cognition­takes place exdusively through the mediumof either words or images. In fact, it appearsthat most modern psychologists working onimagery and learning have succumbed to theassumption that there are no forms of mentalrepresentation other than words and images.Thus Bugelski (1970) argues that the use ofyoung deaf subjects who "have no language"provides an ideal test for the existence ofimagery. He asserts, "If they truly have nospeech or verbal capacity and can lea:rn cer­tain kinds of materials, for example, picturepaired-associates, the conclusion that imagerywas. being used seems logically determined[po 1004]." It is logically determined, ofcourse, only if we accept that images aridwords exhaust the available forms of mentalrepresentation.

Similarly, Paivio (1969) pits his defense ofimagery against the word-assooiation ap­proach, arguing that ". . . one can respondverbally to pictures as well as to words andso, by analogy, one's verbal response couldjust as logically be mediated by a 'mental

picture' as by 'mental words' [po 242]." Theparallel does indeed hold: whatever argu­ments may be marshalled in favor of mentalwords as mediators can be used equally wellto support mental pictures. Thus addingimages to the repertoire of mediators repre­sents a logical extension of mediational ac­counts. What is unsatisfactory about thisextension, however, is that no considerationis given to the possibility that cognition maybe "mediated" by something quite differentfrom either pictures or words, different infact from anything which can be observedeither from within or from without.

In spite of the prevalence of such views,a number of theorists, particularly thoseworking in the information-processing tradi­tion, have found it necessary to postulateforms of representation which differ radicallyfrom the form in which such information ispresented to the senses or the way in whichit .is subjectively experienced. For example,many models of attention (e.g., Triesman,1964) and memory (e.g., Norman, 1968) aswell as all analysis-by-synthesis models (e.g.,Neisser, 1967) require that representationsdiffering from both the sensory pattern andthe name or verbal description be available atsome stage in the process. In the same spirit,the author argued some years ago (Pylyshyn,1963) against the view that representationsin short-term memory went directly from avisual to an auditory form as the informationwas "read" off the image. Instead, a codingcontinuum was proposed. Sperling (1963)also found evidence against the two-stageview. He showed that while information suf­ficient to identify a letter could be. extractedfrom a display in about 10 milliseconds, ittook over 300 milliseconds to name a letter.Sperling developed a model in which informa­tion between these two stages (i.e., visual andverbal) was held in a "recognition buffermemory." Not only is the information in thisbuffer neither in a visual nor an auditoryform, it is not in any form which could bemade conscious. Sperling (1967) comments,

This makes it indeed a mysterious component; itcannot be observed directly from within or fromwithout! However, this inaccessibility should notsurprise us. It is axiomatic that in any system whichexamines itself there ultimately must be some

Page 5: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

A CRITIQUE OF MENTAL IMAGERY 5

part of the mechanism which is inaccessible toexamination from within Ip. 292].

This is indeed true; only perhaps the surprise .ought to be that any of it should be acces­sible from within. One scarcely expects brainprocesses to be available to introspectiveexamination, so why should one expectfunctional information to be thus accessible?

But the need to postulate a more abstractrepresentation-s-one which resembles neitherpictures nor words and is not accessibleto subjective experience-is unavoidable. Aslong as we recognize that people can go frommental pictures to mental words or viceversa, we are forced to conclude that theremust be a representation (which is more ab­stract and not available to conscious experi­ence) which encompasses both. There must,in other words, be some common format orinterlingua. The problem is dramatized if. wepersist in using the common but utterly mis­leading metaphor of the "mind's eye," forthen we have to account for the form of rep­resentation in the "mind's eye's mind" whichclearly is not accessible to introspection.

Any attempt to bypass this difficulty bypositing a direct associative link between amental picture and a mental word meetswith other difficulties. There are an infinitenumber of pictures to which a particular wordapplies. For example, there are an infinitenumber of rectangles of various shapes, sizes,colors, orientations, etc. When the mentalword "rectangle" is elicited by the mentalpicture of a rectangle, it cannot be by virtueof an associative link between the two, sincethis would require that we postulate an in­finite number of such links (one for eachpossible picture). The mental word "rec­tangle" is at best a response to what all thepictures of rectangles have in common,namely, their "rectangleness." The problemarises 'because cognition must deal withpattern types and not tokens, and there is nolimit to the variety of tokens correspondingto each type. Thus the relationship cannotbe described by a finite list of associatedpicture-word tokens.

Propositions and Appearances

Although Bower (1972) appears to recog­nize the existence of such problems to the

extent of a passing admission that peopledo have concepts-and that these are differentfrom either words or pictures-he neverthe­less proceeds to develop an argument for adual-code model of memory of the type earlieradvocated by Paivio (1969). His argumentsare of interest because they shed some lighton the nature of some of the conceptual dif­ficulties to which such models give rise. Inhis account, Bower (1972) make a distinc­tion between memory for appearances (howthings look) and memory for we what callfacts (what things resemble). From this heargues,

This distinction between how something lookedand what it looked like runs parallel to the distinc­tion between cognitive memories, namely, imagesversus propositional memory. That is, we rememberappearances in imagery, and we also rememberpropositions . . . the difference . . . is the same asthe difference between a sighted versus a blind per­son's knowledge of the visual world. In the auditorydomain, it is the difference between my knowledge ofhow an orchestra symphony sounds and how I mighttry to describe it to a deaf person [po 52J.

Not only is this dichotomy not exhaustive,it also gives a misleading account of what itmeans to "know" such things as "how theorchestra sounds." To make this clear, weneed to draw some further distinctions ..First,we must distinguish between. the subjectiveexperience of recalling the sound and thefunctional information which enables us to,say, make judgments of relative similarity oftwo instruments. The term appearance issurely meant to refer only to the former. Anappropriately programmed computer could,no doubt, be made to produce similarityjudgments among sounds but we might stillbe reluctant to describe it as "experiencing"the sound and therefore of being able torecall its appearance. It would, in general,even be unreasonable to require the computerto store the equivalent of a sound-recordingtrace of the stimulus, since this would soontax the storage capacity of any machine. Soeven in this nonexperiential sense of appear­ance, it is still the case that the computerwould not have access to the original patternof stimulation when it made its similarityjudgment. Thus there is no reasonablesense in which the computer would use theappearance of the original sound to make its

Page 6: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

6 ZENON W. PYLYSHYN

judgment. While a person might experiencethe appearance of the sound, it would simi­larly be unreasonable to suppose that he canmake the similarity judgment because he hasstored the original pattern of sensory stimula­tion (we shall return to this point later.)

Second, when Bower and others speak ofnonimaginal memory as propositional, theyimply the storage of actual utterances. Butone must be careful not to equate a proposi­tion with a string of words. A proposition iswhat a string of words may assert. A proposi­tion is either true or false; the string ofwords is neither. A proposition may beasserted by any number of strings of words,in any language and in any modality. Fur­thermore, in the sense in which we use itwhen we speak of "proposltional knowledge,"it may involve no words of any kind. Thuswhen I look at the table in front of me, I seethat there is a vase on it. I do not "see"patches of light or only an array of objects.My knowledge is enriched by (among other'things) the proposition asserted by a sentencesuch as "The vase is on the table," eventhough I did not utter (audibly or otherwise)this or any other sentence. As Hanson (1958)argued "Knowledge of the world is not amontage of sticks, stones, color patches andnoises, but a system of propositions [po 26]."When we use the word "see," we refer to abridge between a pattern of sensory stimula­tion and knowledge which is propositional.This is not to deny that there are suchthings as appearances, only that if they havea role to play in cognition, the nature of sucha role is at present a complete mystery. Wecannot even talk about appearances without,in fact, talking about the propositional con­tent of the appearances. And as Wittgensteinwisely reminded us, in such circumstances itis best to remain silent on the subject.

Failure to grasp the difference between theappearance of visual images' and knowledgeleads to various logical confusions. For in­stance, it leads Bugelski to conclude thatthe only reason deaf children should recallvisual patterns is because they used visualimages which, judging by his use of the term,

.means that they could recreate the appear­ance of the pattern by reactivating somethinglike the original sensory stimulation. By such

an account each time the image is recalled,it must be "seen" anew since deaf childrenpresumably do not possess the appropriatecode (I,e., an auditory image of words) withwhich to represent the proposition as a sub­vocal sentence. If this were so, deaf children(who, incidentally, very likely have a lan­guage of some kind) would be left wallowingin appearances without a single item ofknowledge to which the ascription "true" or"false" was even applicable!

There are even more serious difficultiesinvolved in drawing the distinction betweenappearances and propositions if we carry thisdichotomy through to the area of thinking.The role of experienced images (i.e., appear­ances) in thinking is by no means clear sinceeven if we make the assumption that thecontents of our experiences reveal theoreti­cally useful psychological processes, it stillremains true that very little (if any) of thethinking is carried by such processes. Thus,as Humphrey (1951) points out, while theprocess of thinking ". . . may involve sense­resembling processes of a particular modal­ity . . . this is the cart, not the horse. Theprimary 'work' when one thinks a proposi­tion such as 'Russia is East of Britain' isimageless ... [po 106]." Natsoulas (1970)concurs with this view, arguing that eventhough thinking can involve a succession ofimaginal awarenesses, HIn undergoing theimage, however, one does not have thethought. One has it in noticing somethingabout what is imaginally presented. Suchnoticings are, or course, propositional . . .[po 99J."

The same could also be said about therole of imaging in other cognitive tasks,including the learning of paired associates.For suppose one maintains that a subjectlearns a pair such as boy-play by forming animage of a small boy throwing a ball. Laterwhen the stimulus word boy is presented, thesame image is retrieved. By examining thisimage, so it is argued,the subject is able toproduce the correct response play. Theprob­lem remains, however, to explain why thesubject in this case chooses to respond playand not throw or ball or catch or any of anunlimited number of words equally appropri­ate to that image. Presumably it is because

Page 7: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

A CRITIQUE OF MENTAL IMAGERY 7

he remembers more than is contained in theimage. In fact, this shows that the bulk ofthe work of learning and recalling the pairof words is carried out by a process to whichwe do not have conscious access, but whichmay, in some unspecified manner, make useof the propositional information that boy andplay are related by predication.

Before leaving this discussion of knowledgeas propositional, we must pause to reempha­size the difference among pictures, sentences,and propositions. Both pictures and sentencesmust be interpreted before they become con­ceptual contents. This is because there arean indefinite number of both pictures and ofsentences which are cognitively equivalent.This is not true of propositions as logiciansuse this term. For example, the philosopherFrege (1960) in his seminal work on predi­cate logic (first published in 1879) cites theexample of two sentences which are para­phrases of one another and comments,

... I call the part of the content that is the samein both the conceptual content. Only this has signifi­cance for our symbolic language; we need thereforemake no distinction between propositions that havethe same conceptual content lp. 3],

Thus propositions are to be found in the deepstructure of language and not in its surfaceform. But this is still not sufficiently abstractfor our purposes since it might be taken toimply that each proposition is expressable bysome sentence in a natural language. This isnot, however, a necessary condition for ouruse of the term. We claim that it is stilluseful to think of propositional knowledgeeven when the concepts and predicates insuch propositions do not correspond to avail­able words in our vocabulary. Such conceptsand predicates may be perceptually well de­fined without having any explicit natural lan­guage label. Thus we may have a conceptcorresponding to the equivalence class ofcertain sounds or visual patterns without anexplicit verbal label for it. Such a view impliesthat we can have mental concepts or ways ofabstracting from our sense data which arebeyond the reach of our current stock ofwor:ds, but for which we could develop avocabulary if communicating such conceptsbecame important (e.g., for a professionalmusician, painter, or wine taster). This is not

an unreasonable position to hold in view ofthe fact that conceptual categories are neces­sary not only for communicating but also foracting on the environment. Thus perceptualor motor events which are functionallyequivalent with respect to indicating or lead­ing to functionally similar changes in theorganism's environment might become repre­sented as unique nonverbal mental concepts(for a discussion of such an action-orientedview, see Arbib, 1972). Such a view is inagreement with Newell and Simon's (1972)position that postulating a single set of inter­nal symbol structures provides the mostparsimonious account for both thought andthe deep structure of language. It also receivessupport from evidence (e.g., Macnamara,1972) that children develop conceptual orsemantic structures prior to learning therelated linguistic signs.

In spite of the inexpressibility (for a par­ticular individual at a particular time) ofpropositions containing such nonverbal con­cepts, there are nevertheless some good rea­sons for still referring to such knowledge aspropositional or descriptive. Just as cognitionrequires propositions which stand in a type­token relation to sentences, so also does itrequire something which stands in a type­token relation to pictures or sensory patterns.This something is best characterized as adescriptive symbol structure containing per­ceptual concepts and relations, but having theabstract qualities of propositions rather thanthe partioular qualities of pictorial images.Furthermore, to refer to a representationarising from sensory stimulation as beingpropositional, as we have been advocating,is to imply (a) that it does not correspondto a raw sensory pattern but, rather, is al­ready highly abstracted and interpreted, (b)that it is not different in principle from. thekind of knowledge asserted by a sentence, orpotentially assertable by some sentence, (c)that it depends on the classification of sensoryevents into a finite set of concepts and rela­tions, so that what we know about some eventor object is formally equivalent to (Le., canbe reduced to) a. finite (and, in fact, rela­tively small) number of logically independentdescriptive propositions. The above implica­tions, as we shall see in the next section, are

Page 8: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

8 ZENON W. PYLYSHYN

desirable and yet difficult (if not impossible)to convey using the picture vocabulary of'the imagery literature.

Picture Metaphor

In this section we try to make explicitsome of the implications of using the imageryvocabulary. To begin, oonsider what theterms "image" or "imagery" mean to mostpsychologists who write on the subject. Somewriters have suggested that images are relatedto conditioned sensations (Staats, 1968), to"indireot reactivations of former sensoryor perceptual activity [Bugelski, 1970, p.1002]," or that they are "a faint subjectiverepresentation of a sensation or perceptionwithout an adequate sensory input [Holt,1964, p. 255]," or "the occurrence of per­ceptual processes in the absence of stimula­tion which normally gives rise to perception[Hebb, 1966, p. 41] ," or imagery is definedas "the ability of a subject to generate orsynthesize a sensory-like datum in the absenceof physical stimulation [Weber & Bach, 1969,p.199!]."

Such definitions, however, are not used di­rectly In the empirical researoh.As Paivio(1969) rightly points out, "Both images andverbal processes are operationally defined andthe concern. is with their functional signifi­cance ... [po 243]." However, the impor­tance of the informal notions of imagery inpsychological theories should not be under­estimated. What makes it possible to give aconsistent and systematic .interpretation ofthe empirical findings is not the individualpredictions (e.g., high-imagery sentences arerecalled more easily than low-imagery sen­tences) nor the operationally defined terms(imagery as the rating assigned to a stimu­Ius), but the highly persuasive intuitive no­tions of what images are, what causal effectsthey may exert; and how we can manipulatethem in our mind;

This can be seen clearly if we consider thatvarious different experimental paradigms re­quire different operational definitions of theconstruct image. Thus in research in which theeffects of various mnemonic strategies arecompared (e.g., instructions to use images asopposed to other methods), there is one defini­tion of image (image 1). In experiments in-

vestigating the influence of different stimulusattributes (e.g., high- versus low-imagerywords) there is another operational definition(image 2). Other research procedures involvethe adoption of still other definitions of thetheoretical construct image. The identity ofthese various constructs (image 1 = image 2 =... ) does not, however, follow from any of theoperational definitions nor from the results ofthe experiments (although, of course, similarpatterns among empirical correlates of thevarious manipulations of imagery gives onesome grounds for believing that they arerelated) .

The unity of these constructs, and conse­quently the coherence of the notion ofimagery rests on a meta theoretical assump­tion, This assumption, in turn, rests on thepersuasiveness of subjective experience andon the ordinary informal meaning of theword image. In this context the term reliesheavily on a picture metaphor. The wholevocabulary of imagery uses a language ap­propriate for describing pictures and the pro­cessof perceiving pictures. We speak of clar­ity and vividness of images, of scanningimages, of seeing new patterns in images, andof naming objects or properties depicted inimages.

There is, of course, nothing wrong Withusing metaphors: Virtually all theoreticalideas in science derive hom some relativelyfamiliar metaphor. However, not all meta­phors are equally appropriate and some mayeven be harmful by discouraging certain kindsof fundamental issues being raised and bycarrying too many misleading implications.

For example, one misleading implication in­volved in using the imagery vocabulary isthat what we retrieve from memory when weimage, like what we receive from our sensorysystems, is some sort of undifferentiated (orat least not fully interpreted) signal or pat­tern, a major part of which. (although per­haps not all) is simultaneously available. Thispattern is subsequently scanned perceptuallyin order to obtain meaningful informationre­garding the presence of objects, attributes,relations, etc. This "image retrieval beforeperception" view is phenomenally very power­ful and is implicit in the everyday sense ofthe word "image." It is also present in all

Page 9: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

A CRITIQUE OF MENTAL IMAGERY 9

the illustrative examples used by psycholo­gists to persuade their colleagues of the realityof images. For example, in discussing the useof the "one-bun" rhyming mnemonic usedby his subjects, Bugelski (1968) states,

The most convincing evidence regarding imagerycomes from the reports of many Ss who expressedthe belief that they did not know some or any ofthe words when either the original learning or recalltest began. They would then mumble the numeral,state the rhyme word, and then report "oh, yes,hen-ski." They asserted that the "little hen on skis"had to appear before they could report "ski" lp,332].

Atwood (1971) is quite right when hestates, "The most elementary question whichcan be asked about mnemonic visualizationis the following: does the mnemonic imageactually involve the visual system [po 291]?"Using a method of selective interference, hegathers evidence which leads him to concludethat to a large extent it does. He writes,

Verbal material may be recoded into a visual image(e.g., during application of a mnemonic device) andencoded into memory as a primarily visual schema.During recall, the schema is decoded visually andthen recoded once again into verbal symbols Ip. 297,italics added],

Similarly Bahrick and Boucher (1968) arguein favor of an "image retrieval before per­ception" view. They write,

if one is asked to recall the color of a couch in theliving room of a friend's home, however, it is likelythat the verbal transformation occurs at the timeof recall, and is based upon stored visual information[p. 417],

This is exactly the same type of argumentwhich Shepard (1966) used,

. . . if I am now asked about the number of win­dows in my house, I find that I must picture thehouse, as viewed from different sides or from withindifferent rooms, and then count the windows pre­sented in these various mental images Io. 203],

The view depicted in the above quotations,though phenomenally quite sound, presentsserious problems if it is taken as an explana­tory account of the process of retrieving pic­torial information (Le., information initiallyacquired visually) from memory. This is be­cause however metaphorically one interpretsthe notion of picturing a recalled scene inone's mind, the implication is always that

whatever is retrieved must be perceptuallyinterpreted (or reperceived) before it becomesmeaningful. In other words, the appearanceof a memory image precedes its interpreta­tion by the usual perceptual processes, suchas those resulting in figure-ground distinc­tions, object individuation and identification,and the abstraction of attributes and relation­ships among elements of the scene. But whatcan serve as the input to such a perceptualprocess? Whatever it is, it must be very muchlike the pattern of sensory activity whichtakes place at various levels of the nervoussystem when some sensory event token occurs.

Such a position, however, runs into manydifficulties. First, in supposing that informa­tion received through the senses is stored inmemory and retrieved at a later date in anuninterpreted form, we place an incredibleburden on the storage capacity of the brain.In fact, since there is no limit to the varietyof sensory patterns which are possible (sinceno two sensory events are objectively identi­cal), it would require an unlimited storagecapacity," Second, such a view creates severedifficulties for the retrieval process. Since thesensory events are stored in "raw" form, re­trieval can occur by one of two means. Eitherone retrieves an image by some sort of scan­ning process in which putative candidates areplaced before the mind's eye to determinewhether they are appropriate, or else theimages are tagged by some gross labels andassociatively retrieved by a multiple-sort key.The first of these is unreasonable on severalgrounds. Perceptual processes in the "mind'seye" appear to be no faster than the usualperceptual recognition processes, so the timefor an exhaustive search would be prohibitive.

3 This claim does not depend on any assumptionsabout how information is encoded, so long as wehold that what is stored in some encoding of theparticular stimulus token. Thus it holds for all typesof encoding of the sensory pattern token includinganalogical ones such as holograms. It does not, how­ever, apply to a view. which has humans storingprocedures which construct a representation anewfrom a finite set of primitive symbols each time astimulus is encountered. Thus we are able todiscriminate an unlimited number of stimulus pat­terns (e.g., numbers) even though we cannot storean unlimited number of such (encoded) patterns.This issue is discussed in Pylyshyn (1973) and inFodor, Bever, and Garrett (in press).

Page 10: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

10 ZENON W. PYLYSHYN

Furthermore, the conscious awareness, whichsuggested image storage in the first place,also reveals that we directly retrieve thecorrect information without a series of falseattempts.

The second alternative is implausible onthe grounds that we can retrieve informationabout a whole scene or any part of it byaddressing aspects of the perceptually inter­preted content of the scene. Even if we con­fine ourselves to the retrieval of phenomenalimages, we can argue that the content of suchimages must be already intepreted-in spiteof the fact that we seem to be "perceiving"them as we would novel stimuli. This must beso because retrieval of such images is dearlyhierarchical to an unlimited degree of detailand in the widest range of aspects. Thus, forexample, I might image a certain sequenceof events at a party as 1 recall what happenedat a certain time. Such images may be quiteglobal and could involve a whole scene in aroom over a period of time. But I might alsoimage someone's facial expression or the jewelin their ring or the aroma of some particularitem of food without first calling up the entirescene. Such perceptual attributes must there­fore be available as interpreted integral unitsin my representation of the whole scene. Notonly can such recollections be of fine detail

. but they can also be of rather abstract quali­ties, such as whether some people were angry.Furthermore, when there are parts missingfrom one's recollections, these are never arbi­trary pieces of a visual scene. We do not, forexample, recall a scene wIth some arbitrarysegment missing like a torn photograph. Whatis missing is invariably some integral per­ceptual attribute or relation, for example,colors, patterns, events, or spatial relations(we might, for example, recall who was at theparty without recalling exactly where theywere standing). When our recollections arevague, it is always in the sense that certainperceptual qualities or attributes are absentor uncertain-not that there are geometricallydefinable pieces of a picture missing. All ofthe above suggest that one's representationof a scene must contain already differentiatedand interpreted perceptual aspects. In otherwords, the representation is far from beingraw and, so to speak, in need of "perceptual"

interpretation. The argument is not simplythat retrieval of images would involve a be­wildering cross-classification system while re­trieval in other forms of representation wouldnot. The point is that because retrieval mustbe able to address perceptually interpretedcontent, the network of cross-classified rela­tions must have interpreted objects (i.e.,concepts) at its nodes. Thus storing imagesat these nodes as well is functionally redun­dant. This does not mean, of course, thatwhat we retrieve cannot be further processed.We shall examine several ways in which suchrepresentations can reasonably be thought ofas being processed further after retrieval(e.g., by the application of operations suchas counting). The argument is simply thatthey are not subject to perceptual interpreta­tion the way pictures are interpreted.

Our attack against the notion of an imagebeing an entity to be perceived need not, ofcourse, appeal to phenomenal observations.Consider, for example, the following argu­ment. There are denumerably many logicallyindependent propositions true of any sceneor of any physical object (including a realpicture). Since the brain can store only afinite (in fact, relatively small) amount ofinformation about anyone scene, we mightask about the nature of this selected finitesubset. One possible answer is that the storedrepresentation is a pictorial image of limitedresolution (Le., one which can effectively bereplaced by a finite two-dimensional grid,each element of which contains a selectionfrom a finite set of attributes). But this isunsatisfactory not only because it still leavestoo much information (we can easily show,because of the fineness of some of the detailsrecalled, that the overall resolution of anypictorial representation would still have to berather high), but because such an approachresults in the wrong kind of information beingselected. Thus, as we argued in the previousparagraph, we are more likely to recall suchthings as which objects were present withoutrecalling their exact relations than we areto recall all the detailed information but withlow precision.

We may assume, then, that the representa­tion differs from any conceivable picture-likeentity at least by virtue of containing only

Page 11: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

A CRITIQUE OF MENTAL IMAGERY 11

as much information as can be described bya finite number of propositions. Furthermore,this reduction is not reasonably accounted forby a simple physical reduction such as thatof limited resolution. What type of represen­tation meets such requirements? A numberof alternative forms of representations arediscussed in a subsequent section. For thepresent, it suffices to point out that any repre­sentation having the properties .mentionedabove is much Glaser to being a descriptionof the scene th.an a picture of it. A descriptionis propositional, it contains a finite amountof information, it may contain abstract aswell as concrete aspects and, especially rele­vant to the present discussion, it containsterms (symbols for objects, attributes, andrelations) which are the results of-not inputsto-perceptual processes. Of course, to saythat the representation was a descriptionwithout being more specific about the natureof such descriptions still leaves it vulnerableto some of the types of criticisms which wehave directed against images. Both imagesand descriptions carry too much undesirableexcess meaning; for example, the lattermay imply a fixed-order of aocess, as inreading, which is certainly unwarranted."Descriptions" of the type we have in mindare never accessed in a fixed serial order inany of the systems which we will examinelater. Apart from the arguments made aboveand those which we mentioned earlier in dis­cussing knowledge as being propositional, thenotion of a description gains its greatest ad­vantage from the fact that it has been for­malized in a number of areas (e.g., incomputer-simulation models). In such con­texts the representations provide a formallyadequate amount of certain types of cognitiveactivity while, at the same time, correspond­ing closely 'to what we intuitively mean bythe term "description."

The mental representation differs fromwhat is inferred from the conscious image inmany ways. For example, to use an illustra­tion cited earlier, while two visual images ofa chessboard may be pictorially identical, themental representation of one might containthe relation between two chess pieces whichcould be described by the phrase "being at­tacked by" while the representation under-

lying the second image might not (d. Simon& Barenfeld, 1969). For this reason, it wouldbe reasonable to expect that the mentalrepresentation of a configuration of pieceson a chessboard would be much richer andhighly structured for a chess master than foran inexperienced chess player. This view issupported by de Groot (1966) who foundthat chess masters could recall an authenticboard position much better than inexperi­enced players (after viewing it for 5-10seconds) in spite of the fact that their visualmemories were no better (as measured, say,by their ability to recall chessmen randomlyplaced on a board).

As another example, it would be quite per­missible, according to the view which we havebeen presenting, to have a mental representa­tion of two objects with a relationship be­tween them such as "beside." Such a repre­sentation need not contain a more specificspatial-relation term such as "to the left of"or "to the right of." It would seem to be anunreasonable use of the word "image," how­ever, to speak of an image of two objectsside by side without one of the relationsbetween them being either "to the left of" or"to the right of." (The fact that children,who are especially adept at "visual imagery,"frequently have difficulty in discriminatinga figure such as a letter from its mirrorimage, suggests that their mental representa­tion of such figures suffers precisely from sucha lack of explicit differentiation of the rela­tions "to the left of" or "to {he right of"in favor of a more general relation such as"adjacent to" or "away from the center.")Similarly, we could have a mental representa­tion ofa triangle which might consist of astructure in which the symbol "triangle" washierarchically linked (by the relation "has asparts") to three representations of lines whichwere, in turn, linked to each other via rela­tions labeled "connected to." Such a network(which resembles many of the artificial intel­ligence data structures-see below) need notcontain relations of the type "at an angle ofn degrees to." On the other hand, there isconsiderable uncertainty (as dramatized bythe debate between Locke and Berkely)regarding the possibility of having an imagecorresponding to the above representation,

Page 12: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

12 ZENONW.PYLYSHYN

namely, of a triangle which is "neitheroblique nor rectangular, neither equilateral,equicrural, nor scalenon; but all and none ofthese at once."

To summarize, then, we have argued thatthe functional mental representation is notto be identified with the input to a perceptualstage but rather. with the output of suoh astage, inasmuch as it must already contain,in some e~plicit~anner, those cognitiveproducts which perception normally provides.If we could think of functional (rather thanphenomenal) "images" in this sense, wewould have removed the disturbing dualityof "image" and "mind's eye," while, at thesame time, we would have answered some ofthe puzzling classical questions referred toearlier: An "image" qua representation inour sense can certainly be selective, generic,abstract, and even unconscious inasmuch asthe cognitive products of perception can beall of these.

ALTERNATIVE FORMS OF REPRESENTATION

In .this section we briefly examine threeapproaches which have been used in theo­retical studies of the representation problemfor cognition. The approaches are closelyrelated and are distinguished primarily by theresearch areas in which they are developedand by the descriptive formalisms which theyemploy, although there are one or two moresignificant differences among them which weshall try to draw out. The first approachinvolves the use of propositions and usuallyrelies on deductive proof procedures forprocessing them. The second approach derivesprimarily from work in computer simulationof cognition and in artificial intelligence. Theform of the representation is called an infor­mation or data structure and is frequentlydescribed in terms of directed graphs.The third approach represents conoepts interms of procedures. These three types ofapproaches are described below.

Propositional Representations

Because of the availability of the predicatecalculus as a formal language for expressingthe contents of knowledge, propositions havebeen widely used-especially by students ofartificial intelligence--as an explicit form of

representation. This form has the great ad­vantage that well-known mathematical sys­tems for manipulating formal sentences canbe applied to the representations to derivetheir logical entailments.

In its simplest form, such an approachassumes that what a person knows can berepresented by a finite list of propositions oraxioms (although, to repeat again an earlierpoint, this must not be taken to imply thattokens of actual sentences in some na:turallanguage are stored). Rules of deductivereasoning can then be applied to this list togenerate all the logically valid propositionswhich follow from the initial· "premises."Herein lies one of the attractions of thisapproach: It is generative in the sense thatan unlimited number of "beliefs" can bededuced by a straightforward mechanical pro­cedure from the initial representation. Thusit ought to allow an indefinite number ofquestions to be answered about the knowledgerepresented.

Question-answering systems, in fact, havebeen developed which represent their database in the predicate calculus and which usea theorem-proving procedure for retrievinginformation (e.g., Green & Raphael, 1968).The question to be answered is converted toa proposition to be proven in the system. Aconstructive proof of this proposition thenprovides the answer. For example, a con­structive proof that there exists an object (inthe formal sense of this term, includingmathematical objects such as numbers) whichsatisfies certain 'conditions would actuallyidentify suoh an object. Slagle's question­answering system DEDUCOM and the morecomprehensive MULTIPLE (Slagle, 1971),which can be applied to a wide spectrum ofproblems from playing chess to solving prob­lems in logic, both use an explicit proposi­.tional data representation. The Stanford Re­search Institute robot "Shakey"(Raphael,1968) operates by storing its "knowledge ofthe world" in propositional form and using atheorem prover to respond to commands.For example, the command to push two largeblocks together is transformed into a proposi­tion to the effect that there exists a path oftravel which leads to the desired state withtwo large blocks together, given the present

Page 13: JULY Psychological Bulletin - Rutgers Universityruccs.rutgers.edu/.../class-info/ImageryClass2011/ZP_MindsEye73_Segment.pdf"mind's eye," which, while they may faith fully reflect ones

164

“This was a much more painful paper to writethan might appear. It started off as a criticism ofsome of the claims I heard graduate studentsmaking about the theoretical significance ofexperimental results involving mental imagery. Ihad the impression that they really did believethat there were pictures in the head that weexamined when we created mental images. Ofcourse everyone denied that this is what theyactually believed.

“As I discussed this problem with various peopleat colloquia and conferences I discovered that theequivocation about the nature of mental imageswas extremely deep-seated and pervasive.Almost everyone I spoke to was prepared to admitthat, though they themselves did not take thepicture view literally, the person in the next officedid fall prey to that error. The trouble was thatnobody seemed to be exempt from theconceptual trap. I have to confess that I hadtrouble with it myself. During my first attempts atwriting the paper I was plagued with doubts aboutmy own coherence, so powerful is the subjectiveimpression that imaging is a species of ‘looking’ atsome mental object. It was certainly true of thereviewers of the manuscript I finally submitted toone journal: they rejected it with arguments thatshowed that they were precisely the ‘straw men’that I was arguing against. The universal responseto my arguments was (and, indeed, still is) todeny that any right-thinking person took thepicture view seriously and to insist that thecorrect way to interpret the findings was X. Butwhen I looked closely at X I found that it was eitherthe picture view in very flimsy disguise or it wasnot explanatory at all (i.e., it was merely a namefor the general capacity to reason about visualphenomena—as in the case of the phrase’

This paper presents a critique ofcontemporary research which uses the notionof a mental image as a theoretical construct todescribe one form of memory representation.It is argued that an adequate characterizationof ‘what we know’ requires that we positabstract mental structures to which we do nothave conscious access and which areessentially conceptual and propositional,rather than sensory or pictorial, in nature.[The Social Sciences Citation Index® (SSCI®)indicates that this paper has been cited inover 210 publications since 1973.]

Zenon W. PylyshynCentre for Cognitive ScienceUniversity of Western Ontario

London, Ontario N6A 5C2Canada

August 20, 1982

second-order isomorphism’).“My claim, that the theoretical construct required

to explain the phenomena surrounding mentalimagery had almost none of the characteristicsassociated with pictures, met with enormous andwidespread resistance. I soon realized that whatwas at stake was a serious misunderstanding ofwhat one had access to through consciousintrospection. When you introspect and you seean image of a large heavy green tree-shaped thingthere is nothing in your head that has any of theproperties ‘large,’ ‘heavy,’ ‘green,’ or ‘tree-shaped’—those are properties of the thing you arethinking about. Failure to keep this point in mindhas been the central weakness of two decades oftheorizing about mental images and, as far as Ican see, is still true today (see my 1981 paper fora discussion of this as it applies to more modernresearch1).

“It is this conceptual slipperiness of the idea ofimagery that contributed both to the popularity ofthe paper and to the fact that it is so widelymisunderstood. People cite it mainly because itrepresents a critique of a view about which theyhave some lingering uneasiness —and it came outat a time when the interest in imagery was sostrong that nobody was bothering to be the leastbit critical except the doctrinaire behaviorists. Thebest way to put it is that psychologists found thepaper tantalizing without being convinced. Somewere impressed enough to have serious doubtsabout the interpretation of at least some of theclaims of imagery research (such as that long-term memory consists of an indexed store ofpictures). But most citations look something like,‘Images have been shown to have some propertyP—but see Pylyshyn, 1973, for a contrary view.’

“I have to confess that I find it wearisome to beassociated with the argument about imagery,even though it has played a central role in shapingmy career and my current interest in relatedphilosophical questions.2 In one sense theintuition that people have when they claim thatthere is something special about thinkingimaginally must be true. The problem is thatnobody has been able to give any coherent storyabout what the special thing is. In the process oftrying to do so, however, psychologists haveuttered more silly things per page than in almostany other area of psychology. So while I keepdoggedly pursuing the more fashionablealternative proposals (e.g., see my papersof19783 and 19811), I believe there is something inresearch that will eventually provide clues to thenature of the mind. In the meantime it seems to betelling us more about the problems of interpretingexperimental regularities.”

CC/NUMBER 42OCTOBER 18, 1982This Week’s Citation Classic

Pylyshyn Z W. What the mind’s eye tells the mind’s brain: a critique of mental imagery.Psychol. Bull. 80:1-24, 1973.[Univ. Western Ontario, London, Ontario, Canada]

1. Pylyshyn Z W. The imagery debate: analogue media versus tacit knowledge. Psychol. Rev. 88:16-45, 1981.2. ...................... . Computation and cognition: issues in the foundations of cognitive science. Behav. Brain Sci.

3:111-32. 1980.3. ..................... . Imagery and artificial intelligence. (Savage W, ed.) Perception and cognition: issues in the

foundations of psychology. Minneapolis: University of Minnesota Press. 1978. p. 19-55.

hemchand
hemchand