Even though ever more complex areas of research have found their way into psychology, “Pavlov’s dog” continues to fascinate many researchers. What causes the enduring fascination with conditioning research? Does such research even have psychological significance? Would it not be better if it remained a study field for physiologists, as it originally was? The answers to these questions are partly determined by one’s conceptualization of classical conditioning. Most people are by now sufficiently familiar with its schematic representation: a conditioned stimulus (CS) elicits a conditioned response (CR), provided this stimulus has repeatedly been presented together with an unconditioned stimulus (US) that “inherently” elicits an unconditioned response (UR). Several limiting conditions qualify this schematic depiction. The CS must be “neutral” vis-à-vis the US. In other words, it cannot spontaneously elicit a response that is identical to the UR. The US must “inherently” elicit a well-defined response, which is why stimuli that are biologically significant for the studied organism are typically used (for some theorists, this became a necessary condition for conditioning to take place). The resulting CR must be an autonomous response that is part of the reaction pattern that the US evokes. This schematic depiction also provides insight into the necessary (and sufficient) conditions for conditioning to occur: both stimuli have to occur simultaneously. Finally, this schematic representation already implies what is learned: learning is equated with the modified reaction pattern vis-à-vis the CS. To put it simply, the dog learns to salivate at the sound of the bell.
This schematic depiction and the limiting conditions it implies constitute a strong simplification of the original phenomenon. After all, Pavlov’s interest in conditioning originated from his observation that the dog started to salivate when it heard and saw the man who brought the food. This rather complex event – someone who brings food – was ultimately reduced to a little lamp or an auditory signal predicting food. The “food” event of seeing a meat chunk in a bowl was reduced to the injection of meat powder directly into the animal’s mouth. The dog’s overall reaction pattern upon hearing the man who brings the food – and anyone who has a dog will be familiar with this pattern – was ultimately reduced to droplets of saliva (the reductive nature of this response was already highlighted by Zener, 1937). Moreover, the autonomous reaction that held Pavlov’s primary interest as a physiologist was initially not viewed as a core index of learning the relation between two events, but was subsequently seen as an almost integral part of the definition of classical conditioning (Gormezano & Kehoe, 1975). We can probably all concur with Rescorla and Holland’s related observation that “if conditioning were confined to what some have called “spit and twitches”, it would lose much of its psychological interest” (Rescorla & Holland, 1976, p. 184).
This strong reduction of the original events is probably characteristic of every type of operationalisation. This is justified in and of itself: operationalisations that reduce a phenomenon to its essence are vital for obtaining fundamental knowledge about the necessary and sufficient conditions that determine the occurrence of that phenomenon. But the danger exists that the question behind a concrete operationalisation is simply forgotten after a while. Moreover, there is a real danger that general statements and laws are formulated that are strongly connected to the concrete operationalisation. Something along those lines certainly happened in the study and appreciation of classical conditioning. The aim of this contribution, then, is to shed light on a number of recent trends in classical conditioning studies that might justify the title of this contribution. First, I summarize the most important findings that call for a broader framing of classical conditioning research. This is followed by a comprehensive discussion of one particular form of learning, that is, taste aversion that results from relations between the taste of food or drink on the one hand, and artificially induced nausea on the other hand. This phenomenon is sometimes referred to as the Garcia effect. The topic of taste aversion is discussed not because it is an almost prototypical example of classical conditioning, but because it contributed substantially to the questioning of important assumptions about conditioning. A number of authors have even called this the beginning of a “paradigmatic revolution” (Rozin, 1977; Bolles, 1975). The final part is somewhat speculative in nature: using the preceding observations as a starting point, it argues that a nontrivial similarity exists between recent theories in classical conditioning studies and those in a literature that at first glance appears to bear little relation to it, that is, attribution theories in social psychology.
Classical conditioning: learning associations between two events
Every existing organism must in some way or another be sensitive to both meaningful as well as more coincidental relations between events in the environment, especially when such relations concern biologically significant events. At the same time, it would be maladaptive for an organism if the mere coincident occurrence of two events would be a sufficient condition for the organism to establish a connection between the two. Nevertheless, a coincident occurrence has often been considered a sufficient condition for learning a relation. When doubts were expressed about this idea, they concerned the nature of either one of both events (does one of the events need to have reinforcement value) rather than the nature of the relation itself (i.e., co-occurrence). What follows will demonstrate that every organism can process a wider range of informational relations than the mere joint occurrence of events. In describing this broad range, we aim to list general facts rather than to go deep into possible explanations.
The role of contingency
Instead of using terms indicating co-occurrence, relations can also be expressed in terms of correlation or contingency. What is emphasised in this case is not the temporal relation between two events but their logical relation. Applied to the situation of Pavlov’s dog, this means that a perfect positive correlation is introduced between the CS and the US in the experimental context. In other words, the conditional probability that the US is presented, given that the CS has been presented, equals 1; the probability that the US is presented in the absence of the CS equals 0. This is symbolically expressed as ρ(US/CS) = 1.0 and ρ(US/°CS) = 0. This can probably be further illustrated using what Seligman, Maier and Solomon called “The Pavlovian contingency space” (cf. Figure 1).
Given that the essence of the classical conditioning procedure lies in the experimenter’s full control over the two stimuli that are presented, this paradigm lends itself superbly to a study of the effects of variations in the correlational strength of CS-US relations.
Rescorla (1968) was one of the first to study this issue systematically. Over several experiments (Rescorla, 1975), he demonstrated that animals are sensitive to variations in contingency, ranging from a perfect positive correlation to a perfect negative correlation (respectively below and above the diagonal in Figure 1.) In this way, he succeeded in translating Pavlov’s two most important findings – excitatory and inhibitory conditioning – into contingency terms. Excitatory conditioning occurs whenever the animal learns that the CS and US tend to go together, in other words when ρ(US/CS) > ρ(US/°CS). A large number of behavioural indices then allow one to determine that the animal is behaving as if it “expects” the US when the CS is presented. Inhibitory conditioning occurs when the animal learns that the US and the CS tend not to go together, ρ(US/CS) < ρ(US/°CS). In this case, when the CS is presented, the animal will behave in a manner that is opposite to how it would behave in excitatory conditioning. When the CS and US are “randomly” presented, with no relation between both stimuli in other words, or ρ(US/CS) = ρ(US/°CS), it is observed that the CS does not acquire a new significance for the animal; in other words, the CS does not elicit a differential reaction. This nonetheless represents a form of active learning: learning that there is no relation is not synonymous to not learning (Mackintosh, 1973; Seligman, 1969). Let us illustrate this rather abstract formulation for what is known as “fear conditioning”, which is usually operationalised through the administration of an electrical shock as a US and an external stimulus (e.g., a visual signal) as a CS.
When, within the context of the experiment, the probability of a shock increases after the presentation of a given visual signal, this stimulus acquires a signalling function for the shock: the animal will behave “anxiously” when the CS is presented. If, however, the chance that a shock is administered is lower after the visual stimulus than in the absence of that stimulus, the animal will behave in a fairly “relaxed” fashion when the CS is administered. When the visual signal and the shock are “randomly” presented, the visual signal does not acquire a special meaning. Instead, the context as a whole becomes “fear-inducing” to the animal (Seligman, 1968).
The need for a contingent relation already indicates that mere co-occurrence is not a sufficient condition for an organism to learn the relation between two events. Some way or another, the organism is sensitive to the predictive value of stimuli and the covariance of events in its environment. What follows will illustrate that even a perfect contingency does not constitute a sufficient condition.
Lubov (1973) coined the term “latent inhibition” to describe the following observation: when a stimulus is repeatedly presented by itself (i.e., without a US) in a particular context and when it is subsequently always followed by a US, it is difficult to obtain conditioning. It is not all too clear whether one should invoke non-associative or associative principles to explain this phenomenon. On the one hand, it could be argued that the organism no longer is attentive to the stimulus and, as it were, no longer even notices the stimulus because it has repeatedly been presented in the past. On the other hand, it could be argued that the organism has learned that the stimulus is irrelevant because the stimulus has repeatedly been presented on its own and that it afterwards struggles to realise that it is precisely this stimulus that should be considered the signal for an important event (Mackintosh, 1975). There is a certain similarity between latent inhibition and what occurs when a US is administered repeatedly before it is preceded by a CS. Here too, the results show that it is difficult to make this CS acquire the function of a signal for the US. For instance, when a series of shocks are administered in a non-contingent fashion and every shock is afterwards preceded by a tone in that same context, it takes a long time for the organism to learn this tone-shock relation. Again, the explanation for such data can be sought in non-associative or associative principles (Randich & Lolordo, 1979).
When two stimuli are presented together and consistently followed by a US, often only one of those stimuli will acquire the function of a signal for the US. Pavlov (1927) already discussed this phenomenon extensively and related it to the difference in “saliency” of the stimuli (as determined by the modality and intensity of the stimuli). Formal classical conditioning models have built in this “saliency” as a parameter – either as a fixed value (Rescorla & Wagner, 1972) or as a fluctuating value in accordance with the relation to the US (Mackintosh, 1975).
Relative information value
Suppose that stimulus A and B are presented together and followed by an electric shock. In group I, stimulus B is also presented separately but not followed by a shock in between the A+B presentations. In group II, B is also presented separately every now and then, but here it is followed by a shock. In group III, only A+B trials are presented. The question is what happens to the signal value of stimulus A. A and a shock are after all paired an equal number of times in all groups. The relative information value of A, however, varies between groups because B is presented separately in groups I and II. In group I, A becomes the best predictor for a shock. B is a better predictor in group II, while the information value of both stimuli is equal in group III. When A is now separately tested in the three groups, conditioned responding varies in accordance with the manipulated information value (Wagner, 1969).
No phenomenon has probably made a larger contribution to clarifying the complexity of conditioning than blocking. It would be impossible to comprehensively list the relevant literature. We will therefore limit ourselves to a description of the basic phenomenon. Kamin (1969) was the first to bring this phenomenon to light in his “overshadowing” studies. Stimulus A (e.g., a visual signal) is frequently followed by an electric shock. When the conditioning is complete, stimulus B (e.g., an auditory signal) is presented together with stimulus A, and both are followed by a shock. B does not acquire a signal value even though there is a perfect correlation between B and a shock from this moment onward. This is evident from the fact that when B is presented on its own, it does not elicit a response. It is as if the previous conditioning of A is blocking the conditioning of B, hence the term. There are indications that the animal does notice stimulus B, but that it learns as it were that B presents irrelevant or at least redundant information about the US (Mackintosh, 1978).
The above information clearly indicates that mere stimulus co-occurrence is not a sufficient condition for an organism to relate two events. The discussion below will demonstrate that it is also not a necessary condition, which again offers a different perspective on classical conditioning. Instead of an automatic process that plays out in a passive organism, the organism emerges as an active information-processing system.
It is probably possible to relate all the phenomena that were discussed above to the role of contingency. But the question remains what mechanism can be invoked for explaining the role of contingency. Some do not hesitate to postulate that the animal has a cognitive representation of the contingency space (Alloy & Seligman, 1979). Others have drawn more cautious conclusions. As Rescorla notes:
“Most of us are not comfortable with the notion that organisms take in large blocks of time, count up numbers of US events, and somehow arrive at probability estimates … It is tempting to think of simple “tricks” that the organism could use to perform in this apparently rational fashion” (Rescorla, 1969, p. 84–85).
In other words, being influenced by a correlational relation does not ipso facto imply that the organism concerned has any understanding of this correlation. It is therefore remarkable that Rescorla, who perhaps highlighted the role of contingency more than anyone else, succeeded in developing a theory in which the learning of relations can be traced back to the co-occurrence of two events after all (Rescorla & Wagner, 1972). At the level of formalisation, this theory remains purely descriptive. We would like to note, however, the psychological intuition on which it was built (Rescorla, 1969). The notion of “expectation discrepancy” is central here. As soon as something (important) happens unexpectedly – in other words, it was not predicted – it is as if the animal starts searching for a predictor for this event. Expectation discrepancy appears to be a necessary condition for a stimulus to be interpreted as the signal for this unexpected event. No new learning occurs when either the context (see latent inhibition) or other signals (see blocking) had already predicted the event. This “expectation discrepancy” also explains inhibitory conditioning: when an event that an organism expects to occur in a particular context does not occur, a stimulus that is correlated with this expectation discrepancy may acquire an inhibitory function. Note that this theory emphasizes the role of the environment and the organism’s prior history. We deliberately use metaphors like “to start searching for a predictor”, “to interpret an event” etc. It is as if the facts can only be described in such terms. Such language becomes even more imperative when describing taste aversion.
Taste aversion: The Garcia effect
A short article by Garcia and Koelling published in Psychonomic Science in 1966 was the starting point of the literature on what is now known as the Garcia effect. At the time, there was nothing to suggest that the article would become a classic. Quite the contrary, the article had been rejected by a more renowned journal, which the then editor would later express his regrets about. As is often the case with “classics”, the article was indeed rather weak at the methodological level, but it contained fairly far-reaching theoretical implications. Today, these are referred to as the “Garcia effect”, “the message of Garcia” and “the paradigmatic revolution”. At least 600 articles that were more or less inspired by the Garcia effect have been published since then. This exceptional level of attention does not guarantee scientific relevance in itself. Garcia’s findings may have originally been called into question due to their methodological shortcomings, but the extensive attention has at the least ensured sufficient subsequent independent replications of Garcia’s experiments. The phenomenon is real. The debate about its reach and interpretation, however, remains active today. We first discuss the meaning of “the message of Garcia” by describing a couple of typical experiments. We subsequently reflect on the varying attempts that have been made to interpret this phenomenon.
“The message of Garcia”
What the message of Garcia essentially revolves around is probably best illustrated with an anecdote recounted by Seligman (Seligman & Hager, 1972). After he was served “filet mignon with béarnaise sauce” during a dinner, he became unwell at night. This nausea later proved to be a harbinger of a flu attack. But Seligman had already ascribed it to the béarnaise sauce, and since then he cannot suffer the look, let alone the taste of this sauce. This anecdote raises several questions. Why did he “ascribe” his becoming sick to the béarnaise sauce? Why not to the filet, the dessert or the drinks? Why not to the restaurant or the other guests? Why did his aversion to béarnaise sauce not disappear when it later turned out that the flu was a far more likely cause? Why did béarnaise sauce taste so bad since then? It turns out that answering these questions becomes difficult when this event is translated into a conditioning paradigm, with flavour as the CS and becoming sick as the US (or UR). The most noticeable departures from the normal rules are the extended time period between the CS and US and the difficulty of the extinction, even after the adjusted interpretation. The nature of the US moreover appeared to determine the selection of the CS and, finally, a process that is qualitatively different appears to be at stake here: the béarnaise sauce is avoided not because it is seen as a predictor of nausea, but because it acquires an intrinsically bad flavour.
Garcia’s studies evoke similar questions. The discussion of a typical study will illustrate this further. Garcia and Koelling (1966) deprived caged rats of water for the duration of the experiment. Every day, the rats were placed in individual test cages that contained a drink tube. After an adjustment period in which clean water was offered, the learning phase began. The water was replaced by a saline solution. Every time the drink tube opening was touched, a visual and auditory stimulus were presented so that every drinking attempt was paired with a “bright-noisy-tasty” constellation of stimuli. In the first group, drinking coincided with a period of radiation (X-rays).1 In the second group, lithium chloride was used as the saline solution; this has a poisonous effect but the rats cannot distinguish it from a non-poisonous saline solution (Nachman, 1963). In a third group, an electric shock was administered two seconds after drinking. This learning phase was spread over several days in all the groups. On non-conditioning days, the test cage contained only normal water, the drinking of which was not paired with the abovementioned constellation of stimuli. This was followed by a test phase in which either the audiovisual stimulus or the flavour (saline solution) without the audiovisual stimulus was presented during the drinking of clean water. In the X-ray and lithium groups, there was a clear suppression of drinking with the flavour test but not with the audiovisual test, while precisely the opposite occurred in the shock group. In other words, there appears to be an interaction between the nature of the discriminative stimulus and the drinking consequences.
At first glance, several findings regarding conditioned taste aversion indeed contradicted the basic rules of conditioning. First and foremost, there was a clear parametric difference with more typical conditioning preparations: the time interval between the taste CS and administration of the aversive US (induced sickness) was typically much larger. “Records” of 24 hours of difference were set (Etscorn & Stephens, 1973)! In the experiment described above, an interaction moreover exists between the nature of the CS and the nature of the US, which is probably the finding that has prompted the most discussion: It is not possible to learn an association between whichever two things. Finally, we already noted that we are seemingly dealing with a qualitatively different phenomenon.
The different attempts to explain flavour aversion can be separated into two main orientations. A first orientation refers to the biological nature of every organism. Through the course of natural selection, every organism has come to be equipped with specific learning mechanisms that, depending on the organism’s adaptation, show specific characteristics as a function of the different challenges the animal faces in its environment. For instance, it is indeed vitally important for an animal to learn the association between certain food attributes and certain metabolic effects. A second orientation attempts to reconcile the properties of flavour aversion with the more general fundamental rules that govern the learning of relations between two events. It does not deny that parametric and perhaps qualitative differences clearly exist between learned taste aversion and the more conventional conditioning findings. But these differences supposedly originate from the particular characteristics of the used stimuli. Insofar that these characteristics can be described, their influence can be assessed through experiments – independent of the flavour aversion phenomenon.
It is indeed remarkable that all the factors that influence learning of an association between two events (cf. below) also have an influence on learned flavour aversion. First, there is the impact of contingency. Inducing a flavour aversion requires a positive correlation: “random” administration of a flavour and US does not have an effect, and a negative correlation between a flavour and the US results in a preference for this flavour (Best, 1975). Latent inhibition is also possible: flavour aversion is slow to develop when the animal is made to taste a certain flavour repeatedly before it is paired with an aversive substance (Domjan, 1972; Elkins, 1973). It is equally clear that when the animal is first repeatedly made ill in a way that is non-contingent to ingestion of a particular food, the animal subsequently no longer ascribes this becoming sick to the flavour of the food (Braveman, 1977). “Blocking” finally has also been demonstrated; a learned aversion to a particular flavour can “block” learning of aversion to a different flavour (Revusky, 1971). It is important to note a study by Rudy, Iwens and Best (1977) in this regard. They first induced a contingency between an external stimulus (black cage) and nausea. When the flavour of saccharine was subsequently involved in this contingency, the animal no longer ascribed the nausea to the flavour. This study is important in two regards. First, it demonstrates that associations between external stimuli and “nausea” can indeed be learned as long as an external stimulus is used that is fairly salient and that can compete with a flavour stimulus in terms of “novelty”. In addition, the results of this experiment certainly do not appear to correspond to what one would expect from a “preparedness” view. If learning of a flavour-nausea contingency is truly “prepared”, it does not seem very plausible that learning of this contingency can be fairly easily “blocked” by a pre-induced artificial or at least unprepared contingency.
These findings indicate that the Garcia effect is not as extraordinary as it appears to be at first glance and that it can in fact be integrated into the more general findings about association learning (Logue, 1979). The particular characteristics of the Garcia effect, however, have urged reflection on the more conventional procedures from a different perspective.
Consider, for instance, the parametric difference between flavour aversion and set-ups that are more conventional in terms of the time lapse between the CS and US (or between the discriminative stimulus and reinforcement). The hypothesis of an after-flavour during nausea was of course the most simple one, but it was emphatically rejected empirically (Revusky & Garcia, 1970). It is therefore almost certain that the Garcia effect is due to a memory phenomenon. When the rat becomes sick, he “remembers” the type of food that may have caused this. Revusky (1971, 1977) integrates these findings into what he describes as a more general associative interference theory. This theory inspired Lett (1973, 1974, 1975, 1977) to demonstrate that a rat is capable of bridging a fairly large time interval between a discriminative stimulus and reinforcement – and this with more conventional procedures. For this to happen, however, the situation must be designed so that the animal is urged to again “call to mind” the discriminative stimulus during reinforcement. It is remarkable that the Garcia effect, which is so deeply rooted in the biological singularity of the organism, is an illustration of the animal’s cognitive capabilities and that it has helped integrate recent findings in the psychology of memory into conditioning studies (Best & Gemberling, 1977; Wagner, 1978).
In addition, there is the interaction between the nature of the CS and the US. This interaction is in fact only exceptional when one merely considers the external characteristics of a relationship between two events (contiguity, contingency). It becomes more comprehensible when one assumes that other factors also exist that influence the learning of relations, such as similarity, spatial factors, etc. This insight was probably best articulated by Testa (1975, 1976); he related learned flavour aversion to the more general question of how the animal perceives causal relations in its natural environment. He argues for factors to be integrated into the study of conditioning that had already been previously underlined by gestalt psychologists in relation to perception. We find a similar plea in Revusky (1977) and Rescorla and Cunningham (1979).
Conditioning and attribution
After demonstrating how an organism can process complex relations, we have presented a discussion of learned flavour aversion because the latter highlights a central problem: what pushes the animal to selectively attribute certain effects to the ingestion of food or drink when both events are so far removed in time? Rather than viewing this simply as an innate mechanism, it was argued that this phenomenon should be integrated as much as possible into what we know about the learning of associations between events. What then is the meaning of all of this? It seems that there is a fundamental similarity between these findings in the conditioning literature and attribution theory as it was developed in the social psychological literature. This observation suggests that common principles exist that cause both humans and animals to discover causal relations in their environment. Such a speculative observation probably requires a number of prior explanations. Attribution theory is the study of the manner in which certain events are explained in terms of their potential causes. Born from the field of social psychology – a historic coincidence in Kelley’s view (1967) – the main topic of attribution research was people’s causal analysis of the behaviour of others and one’s self. In other words, on the basis of which rules do I infer the “why” behind my own or other people’s actions? But in essence, a much broader question is at stake in attribution theory: how does one make causal inferences between all sorts of events? The question even arises whether a clear distinction ought to be made between a causal interpretation of events and a causal interpretation of actions. The distinction between “cause” and “reason” is key here, and it was also the focus of a recent discussion (Buss, 1978; Harvey & Tucker, 1979; Kruglanski, 1979; Buss, 1979). Only causal interpretations of events will be discussed below. To put it in more trivial terms, when a rat is administered a shock by the experimenter, it might ask itself: What is this shock due to (asks after the cause)? It does not ask: why did this experimenter give me a shock (asks after the reason)? Both are “why” questions, but they are logically different from each other. Second, a distinction should be made between the attribution process and the content of the attributions (Nisbett & Wilson, 1977; Kruglanski, 1979). At the level of contents, it is obvious that any animal-human comparison would be a tenuous one. But this is also true for a comparison between mutual humans, if only because of cultural differences (Kruglanski, 1979). As regards attribution as process, it is probably possible to arrive at more general statements about the heuristics that apply to both humans and animals. A third introductory remark concerns the status of the concept of attribution. Attribution is intended as a “mediating” concept that can either be assigned a reality value or an “as if” nature. This is true for most “cognitive” concepts that were designed to mediate between input and output (consider, for instance, the concept of “expectancy”). In our view, there is a trend towards increasing emphasis on the “as if” nature of attributions in social psychology. The most common descriptions – somewhat schematically – present this sequence as follows: 1. something occurs (S) – 2. the organism asks itself “why” – 3. following deliberation, it arrives or does not arrive at a judgement – 4. it acts in a manner that is consistent with this (R). If assigned a reality value, it is possible to render the typically non-observable links 2. and 3. observable in humans by simply inquiring after them. It would not be exaggerated to state that this is the focus of most attribution research. And any study of attributions in animals is of course impossible in this respect. But it has been asked more and more whether the “links” do not acquire a different status precisely because they are made observable. Let us again briefly go back to Seligman’s anecdote about the béarnaise sauce. If Seligman is asked: “Why did you become sick?”, he will answer: “Because I had a flu attack.” In other words, does a “conscious” reflection on the occurrence of an event not respond to different rules than the total original experience of this occurrence? Is this not where the truth lies of Pascal’s statement that “le cœur a ces raisons que la raison ne connaît pas”? In a rather extensive article, Nisbett and Wilson (1977) defended the proposition that these cognitive mediating processes circumvent every form of introspection. To support their argument, they cited a number of statements by cognitive psychologists, including Neisser and Mandler, that we would like to cite here. For instance, Neisser writes that “the constructive processes (of encoding perceptual sensations) themselves never appear in consciousness, their products do” (Neisser, 1967, p. 301). And Mandler considers that “there are many systems that cannot be brought into consciousness, and probably most systems that analyze the environment in the first place have that characteristic. In most of these cases, only the products of cognitive and mental activities are available to consciousness” (Mandler, 1975, p. 245). Although Nisbett and Wilson’s proposition is debatable (Smith & Miller, 1978), Langer (1978) does not hesitate to go one step further: she simply denies the mediating role of conscious cognitions in most of our day-to-day actions: “Much psychological research relies on a theoretical model that depicts the individual as one who is cognitively aware most of the time, and who consciously, constantly, and systematically applies “rules” to incoming information about the environment in order to formulate interpretations and courses of actions. Attribution theorists rely on this model in attempting to uncover the sources of regularities in human behaviour. But if in fact it can be demonstrated that much complex human behaviour can and does occur without these assumed cognitive assessments, then we must question both pervasiveness of attribution making as a cognitive process and the assumptions made by most social psychologists” (Langer, 1978, p. 35). We find a similar plea to look for very simple heuristics to explain the notion of attribution in Kahneman and Tversky (1973), Pryor and Kriss (1977) and especially Taylor and Fiske (1978) who concluded that most attribution processes “seem to occur automatically and substantially without awareness, and as such, they differ qualitatively from the intentional, conscious, controlled kind of search which we like to think characterises all our behaviour” (Taylor & Fiske, 1978, p. 283).
These introductory explanations probably create more room for the proposition that there is something common about the way that humans and animals infer causal relations. We rely on a recent overview article by Kelley and Michela (1980) on attribution theory to buttress this claim in a more direct way. They first offer the general scheme that is implicitly contained in the study field of attribution (Figure 2).
As indicated above, any type of direct research into (2) is evidently impossible with animals. One has to limit oneself to manipulations of (1) and inferring what happens in (2) from a change in (3). But as also noted above, this limitation probably also applies to studies of attribution in humans. On the antecedent side (1) then, there is a clear similarity between the factors influencing the nature of attributions and conditioning. Let us illustrate this using the principles that Kelley and Michela distilled from attribution literature. These principles hold that certain aspects of the information that the organism is confronted with lead to attributions. Almost every one of these is a principle that we already mentioned in our discussion of the factors that influence conditioning.
- Covariance: The ANOVA model. This principle was primarily emphasised by Kelley himself (Kelley, 1967, 1973). “The effect is attributed to that condition which is present when the effect is present and which is absent when the effect is absent” (Kelley, 1967, p. 194). This covariance principle is of course heavily analogous, if not identical, to the role of contingency in classical conditioning. This raises a twofold observation: first, there is no reason to suppose, as Kelley does, that the influence of this covariance principle must revert to a model of the human as a “naïve” scientist who thinks according to an ANOVA model (he probably only does so in the context of attribution experiments!). As noted above, the influence of contingency does not necessarily imply that the organism has any notion of contingency. In addition, there is the dilemma of moving from a correlation judgement to a causal judgement. A causal relation after all implies a correlation, but the reverse does not hold. Is a causal judgement only possible when one implicitly also has knowledge of the mechanisms that connect cause and effect? Or are other conditions necessary in addition to perfect correlation for two events to be perceived in a cause-effect relation? This dilemma brings us to the question posed by Michotte (1954): Is causality a phenomenal experience or a “post hoc” reflection? It is interesting to note in this regard that Testa (1974) relied on Michotte’s findings to explain flavour aversion.
- Saliency: “The notion here is that an effect is attributed to the cause that is most salient in the perceptual field at the time the effect is observed.” (Kelley & Michela, 1980, p. 466). This “saliency” is again a factor that plays a role in conditioning (see below).
- Similarity and Contiguity: The principle of contiguity does not require much explanation to be related to conditioning. Rescorla and Furrow (1977) convincingly demonstrated the role of “similarity”, which has always been seen as an associative principle, within a conditioning paradigm.
- Primacy: “The general notion here is that a person scans and interprets a sequence of information until he attains an attribution from it and then disregard later information or assimilates it to his earlier impression” (Kelley & Michela, 1980, p. 467). Conditioning literature analogies also exist for this. It for instance takes a long time for the animal to recognise a “random” relationship such as when a tone and a shock are “randomly” presented but this random series begun with a contingency between both events. The reverse is also true: a contingency is also learned with difficulty in the case of a random start and subsequent contingency (Alloy & Seligman, 1979).
Conditioning literature parallels also exist for the interaction between the nature of the information on the one hand and the existing “beliefs” or causal models and the motivational component (Figure 2) on the other. The “blocking” phenomenon can be considered a causal model that interferes with the learning of other causal relations: both humans and animals do not look for every possible cause but instead suffice themselves with one sufficient cause. In addition, the motivational component has always been central to conditioning. To explain this using anthropomorphic terms, the animal only asks itself a why-question when something important occurs.
We here touch on a point that made us relate the notion of attribution to findings on conditioning. We prefer the term attribution over the term association to denote what happens during conditioning. Not only because “association” is a historically heavily charged concept, but because the term does not permit a distinction between the propositions “event X reminds me of event Y” and “I ascribe event X to event Y”. To again illustrate this using Seligman’s example: when he becomes nauseous, he can perfectly remember the full dinner event, but only one relation, one “attribution” is made with the béarnaise sauce. Remembrance is a necessary but not a sufficient condition to establish a causal relation between two events. Winograd (1971) described this as follows: “Let us imagine that I emerge from my house in the morning and find a flat tire on my car. It occurs to me immediately that around nine o’clock the previous evening, while driving home, I heard a disturbingly loud noise as I drove over something in the road. Now, 12 hours later, I “associate” the flat tire with the impact. This is not an association in the usual S-R contiguity sense; rather, I have related two events which were separated by a long period of time. I can do this only if I have a record of the earlier event, or memory. In fact, I have many memories of previous events, and that is the problem. The question to be dealt with is one of trace selection or contact, of how I have related these two particular events” (Winograd, 1971, p. 272–273). This, precisely, is the dilemma posed by the Garcia effect, looked at from a different perspective. It is why use of the term “attribution” rather than the term “association” becomes even more imperative when we keep the phenomenon of flavour aversion in mind. As Revusky and Garcia write, “Probably, the rat can really associate these events, but will not attribute the production of shock to the flavored water. In other words, a rat can learn that consumption of flavored water precedes shock, but will not readily learn that consumption of flavoured water produces shock” (Revusky & Garcia, 1970, p. 41). A bit further, both authors write: “This paper would probably be more precise if, whenever the term “association” is used, “attribution” were to be substituted” (p. 43). Does, after this discussion, it still seems absurd that an animal responds “as if” it were making an attribution?
Of course, a change in terminology is only a pseudo-solution to a dilemma that has been key since Pavlov’s dog: How is a relation between two events learned? This contribution offers only limited insight into this question, and it is quite fortunate that the effective learning of such relations does not depend on its explanation. But the search for such an explanation becomes imperative the moment it is established that the learning of relations fails. Because this probably constitutes a true breeding ground for human and animal suffering: the inability to explain an important event.
Finally, this contribution might foster the impression that contemporary conditioning psychology tries to anthropomorphise the rat too much, when in the past humans were seen too much as rats. But a rat is a rat and a human a human. Nevertheless, it does not seem very fruitful to me to hermetically seal off both study domains. Whereas Estes notes that “the thought arises that the processes and mechanisms of human cognition represent specializations and elaborations of processes and mechanisms which can advantageously be studied in animals that learn as well as in machines that think” (Estes, 1975, p. 6), this contribution was written from the conviction that Estes’ first alternative continues to be valuable. For as long as a computer does not salivate upon seeing a chunk of meat, “Pavlov’s dog” continues to be a fascinating phenomenon.