Embodied cognition is the theory that many features of cognition, whether human or otherwise, are shaped by aspects of the entire body of the organism. The features of cognition include high level mental constructs (such as concepts and categories) and performance on various cognitive tasks (such as reasoning or judgment). The aspects of the body include the motor system, the perceptual system, bodily interactions with the environment (situatedness) and the assumptions about the world that are built into the structure of the organism.
The embodied mind thesis challenges other theories, such as cognitivism, computationalism, and Cartesian dualism. It is closely related to the extended mind thesis, situated cognition and enactivism. The modern version depends on insights drawn from recent research in psychology, linguistics, cognitive science, dynamical systems, artificial intelligence, robotics, animal cognition, plant cognition and neurobiology.
The embodied mind thesis challenges other theories, such as cognitivism, computationalism, and Cartesian dualism. It is closely related to the extended mind thesis, situated cognition and enactivism. The modern version depends on insights drawn from recent research in psychology, linguistics, cognitive science, dynamical systems, artificial intelligence, robotics, animal cognition, plant cognition and neurobiology.
Embodiment thesisIn philosophy, embodied cognition holds that an agent's cognition is strongly influenced by aspects of an agent's body beyond the brain itself. In their proposal for an enactive approach to cognition Varela et al. defined "embodied":
- "By using the term embodied we mean to highlight two points:
first that cognition depends upon the kinds of experience that come from
having a body with various sensorimotor capacities, and second, that
these individual sensorimotor capacities are themselves embedded in a
more encompassing biological, psychological and cultural context."
- — Eleanor Rosch, Evan Thompson, Francisco J. Varela: The Embodied Mind: Cognitive Science and Human Experience pages 172–173
- "In viewing cognition as embedded or situated, embodied cognitive
science emphasizes feedback between an agent and the world. We have seen
that this feedback is structured by the nature of an agent's
body...This in turn suggests that agents with different kinds of bodies
can be differentiated in terms of degrees of embodiment...Embodiment can
be defined as the extent to which an agent can alter its environment."
[Citations have been omitted]
- — Michael Dawson: Degrees of embodiment; The Routledge Handbook of Embodied Cognition, page 62
- "It would be a mistake, however, to suppose that cognition consists
simply of building maximally accurate representations of input
information...the gaining of knowledge is a stepping stone to achieving
the more immediate goal of guiding behavior in response to the system's
- — Marcin Miłkowski: Explaining the Computational Mind, p. 4
- Embodiment thesis: Many features of cognition are
embodied in that they are deeply dependent upon characteristics of the
physical body of an agent, such that the agent's beyond-the-brain body
plays a significant causal role, or a physically constitutive role, in
that agent's cognitive processing.
- —RA Wilson and L Foglia, Embodied Cognition in the Stanford Encyclopedia of Philosophy
Philosophical backgroundIn his Universal Natural History and Theory of Heaven (1755). philosopher Immanuel Kant advocated a view of the mind–body problem and the subject–object problem with parallels to the embodied view. Some difficulties with this interpretation of Kant include (i) the view that Kant holds the empirical, and specifically knowledge of the body, cannot support a priori transcendental claims, and (ii) the view that Kant holds that transcendental philosophy, although charged with the responsibility of explaining how we can have empirical knowledge, is not itself empirical.
José Ortega y Gasset, George Santayana, Miguel de Unamuno, Maurice Merleau-Ponty, Martin Heidegger and others in the broadly existential tradition have proposed philosophies of mind influencing the development of the modern 'embodiment' thesis.
The embodiment movement in artificial intelligence has fueled the embodiment argument in philosophy and a revised view of ethology:
- "Species-typical activity patterns must be thought of as emergent
phenomena in three different senses of the word. They have
emerged...through natural selection, ....by a process of maturation
and/or learning, ...and from interactions between the creature's
low-level activities and its species-typical environment."
- —Horst Hendriks-Jansen Catching Ourselves in the Act, p. 10
Connections with the sciencesEmbodied cognition is a topic of research in social and cognitive psychology, covering issues such as social interaction and decision-making. Embodied cognition reflects the argument that the motor system influences our cognition, just as the mind influences bodily actions. For example, when participants hold a pencil in their teeth engaging the muscles of a smile, they comprehend pleasant sentences faster than unpleasant ones, while holding a pencil between their nose and upper lip to engage the muscles of a frown has the reverse effect.
George Lakoff (a cognitive scientist and linguist) and his collaborators (including Mark Johnson, Mark Turner, and Rafael E. Núñez) have written a series of books promoting and expanding the thesis based on discoveries in cognitive science, such as conceptual metaphor and image schema.
Robotics researchers such as Rodney Brooks, Hans Moravec and Rolf Pfeifer have argued that true artificial intelligence can only be achieved by machines that have sensory and motor skills and are connected to the world through a body. The insights of these robotics researchers have in turn inspired philosophers like Andy Clark and Horst Hendriks-Jansen.
Neuroscientists Gerald Edelman, António Damásio and others have outlined the connection between the body, individual structures in the brain and aspects of the mind such as consciousness, emotion, self-awareness and will. Biology has also inspired Gregory Bateson, Humberto Maturana, Francisco Varela, Eleanor Rosch and Evan Thompson to develop a closely related version of the idea, which they call enactivism. The motor theory of speech perception proposed by Alvin Liberman and colleagues at the Haskins Laboratories argues that the identification of words is embodied in perception of the bodily movements by which spoken words are made. In related work at Haskins, Paul Mermelstein, Philip Rubin, Louis Goldstein, and colleagues developed articulatory synthesis tools for computationally modeling the physiology and aeroacoustics of the vocal tract, demonstrating how cognition and perception of speech can be shaped by biological constraints. This was extended into the audio-visual domain by the "talking heads" approach of Eric Vatikiotis-Bateson, Rubin, and other colleagues.
More detail is provided in the sections that follow.
One embodied cognition study shows that action intention can affect processing in visual search, with more orientation errors for pointing than for grasping. Participants either pointed to or grasped target objects of 2 colors and 2 orientations (45° and 135°). There were randomized numbers of distractors as well (0, 3, 6, or 9), which differed from the target in color, orientation, or both. A tone sounded to inform participants which target orientation to find. Participants kept their eyes on a fixation point until it turned from red to the target color. The screen then lit up and the participants searched for the target, either pointing to it or grasping it (depending on the block). There were 2 blocks for pointing and 2 for grasping, with the order counterbalanced. Each block had 64 trials.
Results from the experiment show that accuracy decreases with an increase in the number of distractors. Overall, participants made more orientation errors than color errors. There was no main effect of accuracy between the pointing and grasping conditions, but participants made significantly fewer orientation errors in the grasping condition than in the pointing condition. Color errors were the same in both conditions. Because orientation is important in grasping an object, these results fit with the researchers' hypothesis that the plan to grasp an object will aid in orientation accuracy. This supports embodied cognition because action intention (planning to grasp an object) can affect visual processing of task-relevant information (orientation).
Distance perceptionInternal states can affect distance perception, which relates to embodied cognition. Researchers randomly assigned college student participants to high-choice, low-choice, and control conditions. The high-choice condition signed a “freedom of choice” consent form indicating their decision to wear a Carmen Miranda costume and walk across a busy area of campus. Low-choice participants signed an “experimenter choice” consent form, indicating the experimenter assigned the participant to wear the costume. A control group walked across campus but did not wear a costume. At the conclusion of the experiment, each participant completed a survey which asked them to estimate the distance they walked.
The high-choice participants perceived the distance walked as significantly shorter than participants in the low-choice and control groups, even though they walked the same distance. The manipulation caused high-choice participants to feel responsible for the choice to walk in the embarrassing costume. This created cognitive dissonance, which refers to a discrepancy between attitudes and behaviors. High-choice participants reconciled their thoughts and actions by perceiving the distance as shorter. These results show the ability of internal states to affect perception of physical distance moved, which illustrates the reciprocal relationship of the body and mind in embodied cognition.
Researchers have found that when making judgements about objects in photographs, people will take the perspective of a person in the picture instead of their own. They showed college undergraduate participants 1 of 3 photographs and asked where 1 object in the picture was compared to the other object. For example, if the 2 objects were an apple and a banana, the participants would have to respond to a question about the location of the apple compared to the banana. The photographs either had no person, a person looking at the object, in this case the banana, or a person reaching for the banana. The photograph and question appeared in a larger set of questionnaires not related to the study.
Results show that participants who viewed photographs that included a person were significantly more likely to respond from another's perspective than those who saw photographs with no person. There were no differences in perspective of responses for the person looking versus reaching. Participants who saw the scene without a person were significantly more likely to respond from their own perspective. This means that the presence of a person in the photograph affected the perspective used even though the question focused solely on the two objects. The researchers state that these results suggest disembodied cognition, in which the participants put themselves into the body of the person in the photograph.
Language comprehensionSome researchers extend embodied cognition to include language. They describe language as a tool that aids in broadening our sense of body. For instance, when asked to identify “this” object, participants most often choose an object near to them. Conversely, when asked to identify “that” object, participants choose an object further away from them. Language allows us to distinguish between distances in more complex ways than the simple perceptual difference between near and far objects.
The motor system is involved in language comprehension, in this case when sentences were performable by a human, there was a change in participants' overall movement of a pendulum. Researchers performed an experiment in which college undergraduate participants swung a pendulum while completing a "sentence judgement task." Participants would swing the pendulum with both hands for 10 seconds before a prompt and then a sentence would appear on the screen until the participant responded. In the control condition, participants swung the pendulum without performing the "sentence judgement task." Each trial had half "plausible" and half "implausible" sentences. The "plausible" sentences made sense semantically, while the "implausible" ones did not. The "performable" sentences could be performed by a human, while the "inanimate" sentences could not. Participants responded by saying "yes" to the "plausible" sentences.
Results show a significant "relative phase shift," or overall change in movement of the swinging pendulum, for the "performable" sentences. This change did not occur for "inanimate" sentences or the control condition. The researchers did not expect an overall phase shift, instead they expected a change in the variability of movement, or the "standard deviation of relative phase shift." Although not entirely expected, these results support embodied cognition and show that the motor system is involved in the understanding of language. The researchers suggest that the nature of this relationship needs to be further studied to determine the exact correlation this task has to bi-manual motor movements.
Embodiment effects emerge in the way in which people of different sex and temperament perceive verbal material, such as common adjectives and abstract and neutral nouns. Trofimova, who first described this phenomenon in her experiments, called it "projection through capacities". This phenomenon emerges when people's lexical perception depends upon their capacities to handle the events; when their information processing registers mostly those aspects of objects or of a situation that they can properly react to and deal with according to their inherent capacities. For example, in these studies males with stronger motor-physical endurance estimated abstractions describing people-, work/reality- and time-related concepts in more positive terms than males with a weaker endurance. Females with stronger social or physical endurance estimated social attractors in more positive terms than weaker females. Both male and female temperament groups with higher sociability showed a universal positive bias in their estimations of neutral words, especially for social and work/reality-related concepts, in comparison to participants with lower sociability. Capacities related to the tempo of activities also appeared to impact the perception of lexical material: men with faster motor-physical tempo estimated neutral, abstract time-related concepts significantly in more positive terms than men with slower tempo.
MemoryA study examining memory and embodied cognition illustrates that people remember more of the gist of a story when they physically act it out. Researchers divided female participants randomly into 5 groups, which were "Read Only," "Writing," "Collaborative Discussion," "Independent Discussion," and "Improvisation." All participants received a monologue about teen addiction and were told to pay attention to details about the character and action in the monologue. Participants were given 5 minutes to read the monologue twice, unaware of a future recall test. In the "Read Only" condition participants filled out unrelated questionnaires after reading the monologue. In the "Writing" condition participants responded to 5 questions about the story from the perspective of the character in the monologue. They had 6 minutes to answer each question. In the "Collaborative Discussion" condition participants responded from the character's perspective to the same questions as the "Writing" group, but in groups of 4 or 5 women. They were also given 6 minutes per question and everyone participated in answering each question. The "Independent Discussion" condition was the same as the "Collaborative Discussion," except 1 person answered each question. In the "Improvisation" condition participants acted out 5 scenes from the monologue in groups of 5 women. The researchers suggest that this condition involves embodied cognition and will produce better memory for the monologue. Every participant played the main character and a supporting character once. Participants were given short prompts from lines in the monologue, which were excluded from the memory test. Participants had 2 minutes to choose characters and 4 minutes for improvisations. The recall test was the monologue with 96 words or phrases missing. Participants had to fill in the blanks as accurately as possible.
Researchers gave the recall test to a group who did not read the monologue. They scored significantly lower than the other groups, which indicated that guessing was not easy. In coding the answers to the recall test, exact words were labeled "Verbatim", and correct content but varied wording was labeled "Gist". The combination of "Verbatim" and "Gist" was called "Total Memory." The "Improvisational" group had more "Gist" memories than any other group and had more "Total Memory" than both of the discussion groups. The results fit the researchers' hypothesis that the "Improvisational" group would remember more because they actively rehearsed the information from the monologue. Although other groups had also elaborately encoded the information, the "Improvisation" group remembered significantly more than the discussion groups and marginally more than the "Reading Only" and "Writing" groups. Simply experiencing the monologue in an active way aids in remembering the "Gist." There were no differences across groups for "Verbatim" memory, which they suggest could take longer than the limited time during the experiment to develop.
ReasoningA series of experiments demonstrated the interrelation between motor experience and high-level reasoning. For example, although most individuals recruit visual processes when presented with spatial problems such as mental rotation tasks motor experts favor motor processes to perform the same tasks, with higher overall performance. A related study showed that motor experts use similar processes for the mental rotation of body parts and polygons, whereas non-experts treated these stimuli differently. These results were not due to underlying confounds, as demonstrated by a training study which showed mental rotation improvements after a one-year motor training, compared with controls. Similar patterns were also found in working memory tasks, with the ability to remember movements being greatly disrupted by a secondary verbal task in controls and by a motor task in motor experts, suggesting the involvement of different processes to store movements depending on motor experience, namely verbal for controls and motor for experts.
Approach and avoidance
In research focused on the approach and avoidance effect, people showed an approach effect for positive words. In the "positive toward condition," participants moved positive words toward the center of the screen and negative words away. In the "negative toward condition," participants moved negative words toward the center and positive words away. Participants were given feedback about their accuracy at the end of each of the 4 experimental blocks. In the first experiment the word at the center of the screen had a positive valence, while in the second experiment the central word had a negative valence. In the third experiment, the center of the screen had an empty box.
As predicted, in the first experiment participants in the "positive toward condition" responded significantly faster than those in the "negative toward condition." This fits the approach/avoidance effect in embodied cognition, which states that people are faster to approach positive things and avoid negative ones. In the second experiment, researchers expected participants in the "negative toward condition" to be faster, yet those in the "positive toward condition" responded significantly faster. Although effects were smaller in the third experiment, participants in the "positive toward condition" were still faster. Overall, people were faster in the "positive toward condition," regardless of the valence of the central word. Despite mixed results regarding the researchers' expectations, they maintain that the motor system is important in processing higher level representations such as the action goal. In this study, participants showed strong approach effects in the "positive toward condition," which supports embodied cognition.
As part of a larger study, researchers separated participants into 5 groups with different instructions. In the "approach" condition, participants were instructed to imagine physically moving the product toward them, but in the "avoid" condition, participants had to imagine moving the product away from them. In the "control" condition, participants were instructed to simply observe the product. The "correction" condition involved the same instructions as the approach condition, except participants were told that the body can affect judgment. In the "approach information" condition, participants had to list 5 reasons why they would obtain the product. After viewing a picture of an aversive product, participants rated on a scale of 1 to 7 how desirable the product was and how much they approached of or avoided the product. They also provided how much they would pay for the product.
An approach/avoidance effect was found in relation to product evaluation. Participants in the "approach" condition liked the aversive product significantly more and would pay more for it. There were no differences between the "avoidance," "control," "correction," and "approach information" conditions. Simulation of approach can affect liking and willingness to pay for a product, but the effect can be reversed if the person knows about this influence. This supports embodied cognition.
Self-regulationAs part of a larger study, one experiment randomly assigned college undergraduates to 2 groups. In the "muscle-firming" condition participants grasped a pen in their hand, while in the "control" condition participants held the pen in their fingers. The participants were then asked to fill out donations to Haiti for the Red Cross in sealed envelopes. They were told to return the envelope regardless of whether they donated. They also filled out questionnaires about their feelings about the Red Cross, their tendency to donate, their feelings about Haiti, what they thought the purpose of the study was, etc.
Significantly more participants in the "muscle-firming" condition than in the "control" condition donated money. Condition did not affect the actual amount donated when participants chose to donate. As the researchers predicted, the "muscle-firming" condition helped participants get over their physical aversion to viewing the devastation in Haiti and spend money. Muscle-firming in this experiment may also be related to an increase in self-control, suggesting embodied cognition can play a role in self-regulation.
Another set of studies was conducted by Shalev (2014), indicating that exposure to physical or conceptual thirst or dryness-related cues influence perceived energy and reduce self-regulation. In Study 1, participants primed with dryness-related concepts reported greater physical thirst and tiredness and lower subjective vitality. In Study 2, participants who were physically thirstywere less persistent in investing effort in an unsolvable anagram task. In Study 3, images of arid land influenced time preference regarding when to begin preparation to make a monetary investment. Finally, in Studies 4a and 4b, exposure to the names of dryness-related products influenced impressions of the vitality of a target person.
Some suggest that the embodied mind serves self-regulatory processes by combining movement and cognition to reach a goal. Thus, the embodied mind has a facilitative effect. Some judgments, such as the emotion of a face, are detected more quickly when a participant mimics the facial expression that is being evaluated. Individuals holding a pen in their mouths to freeze their facial muscles and make them unable to mimic the expression were less able to judge emotions. Goal-relevant actions may be encouraged by embodied cognition, as evidenced by the automated approach and avoidance of certain environmental cues. Embodied cognition is also influenced by the situation. If one moves in a way previously associated with danger, the body may require a greater level of information processing than if the body moves in a way associated with a benign situation.
Some social psychologists examined embodied cognition and hypothesized that embodied cognition would be supported by embodied rapport. Embodied rapport would be demonstrated by pairs of same-sex strangers using Aron’s paradigm, which instructs participants to alternate asking certain questions and to progressively self-disclose. The researchers predicted that participants would mimic each other’s movements, reflecting embodied cognition. Half the participants completed a control task of reading and editing a scientific article, while half the participants completed a shortened version of Aron’s self-disclosure paradigm.
There is a significant correlation between self-disclosure and positive emotions towards the other participant. Participants randomly assigned to the self-disclosure task displayed more behavioral synchrony (rated by independent judges watching the tapes of each condition on mute) and reported more positive emotions than the control group. Since bodily movements influence the psychological experience of the task, the relationship between self-disclosure and positive feelings towards one's partner may be an example of embodied cognition.
Evolutionary viewEmbodied cognition may also be defined from the perspective of evolutionary psychologists. Evolutionary psychologists view emotion as an important self-regulatory aspect of embodied cognition, and emotion as a motivator towards goal-relevant action. Emotion helps drive adaptive behavior. The evolutionary perspective cites language, both spoken and written, as types of embodied cognition. Pacing and non-verbal communication reflect embodied cognition in spoken language. Technical aspects of written language, such as italics, all caps, and emoticons promote an inner voice and thereby a sense of feeling rather than thinking about a written message.
Cognitive science and linguisticsGeorge Lakoff and his collaborators have developed several lines of evidence that suggest that people use their understanding of familiar physical objects, actions and situations (such as containers, spaces, trajectories) to understand other more complex domains (such as mathematics, relationships or death). Lakoff argues that all cognition is based on knowledge that comes from the body and that other domains are mapped onto our embodied knowledge using a combination of conceptual metaphor, image schema and prototypes.
Conceptual metaphorLakoff and Mark Johnson showed that humans use metaphor ubiquitously and that metaphors operate at a conceptual level (i.e., they map one conceptual domain onto another), they involve an unlimited number of individual expressions and that the same metaphor is used conventionally throughout a culture. Lakoff and his collaborators have collected thousands of examples of conceptual metaphors in many domains.
For example, people will typically use language about journeys to discuss the history and status of a love affair, a metaphor Lakoff and Johnson call "LOVE IS A JOURNEY". It is used in such expression as: "we arrived at a crossroads," "we parted ways", "we hit the rocks" (as in a sea journey), "she's in the driver's seat", or, simply, "we're together". In cases like these, something complex (a love affair) is described in terms of something that can be done with a body (travel through space).
PrototypesPrototypes are "typical" members of a category, e.g. a robin is a prototypical bird, but a penguin is not. The role of prototypes in human cognition was first identified and studied by Eleanor Rosch in the 1970s. She was able to show that prototypical objects are more easily categorized than non-prototypical objects, and that people answered questions about a category as a whole by reasoning about a prototype. She also identified basic level categories: categories that have prototypes that are easily visualized (such as a chair) and are associated with basic physical motions (such as "sitting"). Prototypes of basic level categories are used to reason about more general categories.
Prototype theory has been used to explain human performance on many different cognitive tasks and in a large variety of domains. George Lakoff argues that prototype theory shows that the categories that people use are based on our experience of having a body and have no resemblance to logical classes or types. For Lakoff, this shows that traditional objectivist accounts of truth cannot be correct.
A classic argument against embodiment in its strict form is based on abstract meaning. Whereas the meanings of the words ‘eye’ and ‘grasp’ can be explained, to a degree, by pointing to objects and actions, those of ‘beauty’ and ‘freedom’ cannot. It may be that some common sensorimotor knowledge is immanent in freeing actions or instantiations of beauty, but it seems likely that additional semantic binding principles are behind such concepts. So might it be necessary, after all, to place abstract semantics in an amodal meaning system? A remarkable observation has recently been offered that may be of the essence in this context: abstract terms show an over-proportionally strong tendency to be semantically linked to knowledge about emotions. This additional embodied–semantic link accounts for advantages in processing speed for abstract emotional terms over otherwise matched control words. In addition, abstract words strongly activate anterior cingulate cortex, a site known to be relevant for emotion processing Thus, it appears that at least some abstract words are semantically grounded in emotion knowledge.
If abstract emotion words indeed receive their meaning through grounding in emotion it is of crucial relevance Therefore, the link between an abstract emotion word and its abstract concept is via manifestation of the latter in prototypical actions. The child learns an abstract emotion word such as 'joy' because it shows JOY-expressing action schemas, which language-teaching adults use as criteria for correct application of the abstract emotion word Thus, the manifestation of emotions in actions becomes the crucial link between word use and internal state, and hence between sign and meaning. Only after a stock of abstract emotion words has been grounded in emotion-expressing action can further emotion terms be learnt from context.
Artificial intelligence and robotics
History of artificial intelligenceThe experience of AI research provides another line of evidence supporting the embodied mind thesis. In the early history of AI successes in programming high-level reasoning tasks such as chess-playing led to an unfounded optimism that all AI problems would be relatively quickly solved. These programs simulated intelligence using logic and high-level abstract symbols (an approach called Good old-fashioned AI). This "disembodied" approach ran into serious difficulties in the 1970s and 80s, as researchers discovered that abstract, disembodied reasoning was highly inefficient and could not achieve human-levels of competence on many simple tasks. Funding agencies (such as DARPA) withdrew funding because the field of AI had failed to achieve its stated objectives, leading to difficult period now known as the "AI winter". Many AI researchers began to doubt that high level symbolic reasoning could ever perform well enough to solve simple problems.
Rodney Brooks argued in the mid-80s that these symbolic approaches were failing because researchers did not appreciate the importance of sensorimotor skills to intelligence in general, and applied these principals to robotics (an approach he called "Nouvelle AI"). Another successful new direction was neural networks—programs based on the actual structures within human bodies that gave rise to intelligence and learning. In the 90s, statistical AI achieved high levels of success in industry without using any symbolic reasoning, but instead using probabilistic techniques to make "guesses" and improve them incrementally. This process is similar to the way human beings are able to make fast, intuitive choices without stopping to reason symbolically.
Moravec's paradoxMoravec's paradox is the discovery by artificial intelligence and robotics researchers that, contrary to traditional assumptions, high-level reasoning requires very little computation, but low-level sensorimotor skills require enormous computational resources. The principle was articulated by Hans Moravec (whence the name) and others in the 1980s.
As Moravec writes:
Encoded in the large, highly evolved sensory and motor portions of the human brain is a billion years of experience about the nature of the world and how to survive in it. The deliberate process we call reasoning is, I believe, the thinnest veneer of human thought, effective only because it is supported by this much older and much powerful, though usually unconscious, sensorimotor knowledge. We are all prodigious olympians in perceptual and motor areas, so good that we make the difficult look easy. Abstract thought, though, is a new trick, perhaps less than 100 thousand years old. We have not yet mastered it. It is not all that intrinsically difficult; it just seems so when we do it.
Approach to artificial intelligence
Solving problems of perception and locomotion directlyMany artificial intelligence researchers have argued that a machine may need a human-like body to think and speak as well as a human being. As early as 1950, Alan Turing wrote:
It can also be maintained that it is best to provide the machine with the best sense organs that money can buy, and then teach it to understand and speak English. That process could follow the normal teaching of a child. Things would be pointed out and named, etc. (Turing, 1950).Embodiment theory was brought into artificial intelligence most notably by Rodney Brooks who showed in the 1980s that robots could be more effective if they 'thought' (planned or processed) and perceived as little as possible. The robot's intelligence is geared towards only handling the minimal amount of information necessary to make its behavior be appropriate and/or as desired by its creator.
Others have argued that without taking into account both the architecture of the human brain, and embodiment, it is unrealistic to replicate accurately the processes which take place during language acquisition, comprehension, production, or during non-linguistic actions. There have thus been suggestions that while robots are far from isomorphic with humans, they could benefit from strengthened associative connections in the optimization of their processes and their reactivity and sensitivity to environmental stimuli, and in situated human-machine interaction, and that the concept of multisensory integration be extended to cover linguistic input and the complementary information combined from temporally coincident sensory impressions.
The embodied approach to AI has been given several names by different schools of researchers, including: Nouvelle AI (Brooks' term), Situated AI, Behavior based AI and Embodied cognitive science.
NeuroscienceOne source of inspiration for embodiment theory has been research in cognitive neuroscience, such as the proposals of Gerald Edelman concerning how mathematical and computational models such as neuronal group selection and neural degeneracy result in emergent categorization.
Rohrer (2005) discusses how both our neural and developmental embodiment shape both our mental and linguistic categorizations. The degree of thought abstraction has been found to be associated with physical distance which then affects associated ideas and perception of risk.
The embodied mind thesis is compatible with some views of cognition promoted in neuropsychology, such as the theories of consciousness of Vilayanur S. Ramachandran, Gerald Edelman, and Antonio Damasio.
The modeling work of cognitive neuroscientists such as Francisco Varela and Walter Freeman seeks to explain embodied and situated cognition in terms of dynamical systems theory and neurophenomenology, but rejects the idea that the brain uses representations to do so (a position also espoused by Gerhard Werner).
CriticismsResearch on embodied cognition is extremely broad, covering a wide range of concepts. Methods to study embodied cognition vary from experiment to experiment based on the operational definition used by researchers. There is much evidence for embodied cognition, although interpretation of results and their significance may be disputed. Researchers continue to search for the best way to study and interpret embodied cognition.
Infants as examplesSome criticize the notion that pre-verbal children provide an ideal channel for studying embodied cognition, especially embodied social cognition. It may be impossible to know when a pre-verbal infant is a "pure model" of embodied cognition, since infants experience dramatic changes in social behavior throughout development. A 9-month old has reached a different developmental stage than a 2-month old. Looking-time and reaching measures of embodied cognition may not represent embodied cognition since infants develop object permanence of objects they can see before they develop object permanence with objects they can touch. True embodied cognition suggests that children would have to first physically engage with an object to understand object permanence.
The response to this critique is that infants are "ideal models" of embodied cognition. Infants are the best models because they utilize symbols less than adults do. Looking-time could likely be a better measure of embodied cognition than reaching because infants have not developed certain fine motor skills yet. Infants may first develop a passive mode of embodied cognition before they develop the active mode involving fine motor movements.
Overinterpretation?Some criticize the conclusions made by researchers about embodied cognition. The pencil-in-teeth study is frequently cited as an example of these invalidly drawn conclusions. The researchers believed that the quicker responses to positive sentences by participants engaging their smiling muscles represented embodied cognition. However, opponents argue that the effects of this exercise were primed or facilitated by the engagement of certain facial muscles. Many cases of facilitative movements of the body may be incorrectly labeled as evidence of embodied cognition.
Six views of embodied cognitionThe following "Six Views of Embodied Cognition" are taken from Margaret Wilson:
- "Cognition is situated. Cognitive activity takes place in the context of a real-world environment, and inherently involves perception and action." One example of this is moving around a room while, at the same time, trying to decide where the furniture should go.
- "Cognition is time-pressured. We are 'mind on the hoof' (Clark, 1997), and cognition must be understood in terms of how it functions under the pressure of real-time interaction with the environment." When you're under pressure to make a decision, the choice that is made emerges from the confluence of pressures that you're under. In the absence of pressure, a decision may be made differently.
- "We off-load cognitive work onto the environment. Because of limits on our information-processing abilities (e.g., limits on attention and working memory), we exploit the environment to reduce the cognitive workload. We make the environment hold or even manipulate information for us, and we harvest that information only on a need-to-know basis." This is seen when people have calendars, agendas, PDAs, or anything to help them with everyday functions. We write things down so we can use the information when we need it, instead of taking the time to memorize or encode it into our minds.
- "The environment is part of the cognitive system. The information flow between mind and world is so dense and continuous that, for scientists studying the nature of cognitive activity, the mind alone is not a meaningful unit of analysis." This statement means that the production of cognitive activity does not come from the mind alone, but rather is a mixture of the mind and the environmental situation that we are in. These interactions become part of our cognitive systems. Our thinking, decision-making, and future are all impacted by our environmental situations.
- "Cognition is for action. The function of the mind is to guide action and things such as perception and memory must be understood in terms of their contribution to situation-appropriate behavior." This claim has to do with the purpose of perception and cognition. For example, visual information is processed to extract identity, location, and affordances (ways that we might interact with objects). A prominent anatomical distinction is drawn between the "what" (ventral) and "where" (dorsal) pathways in visual processing. However, the commonly labeled "where" pathway is also the "how" pathway, at least partially dedicated to action.
- "Off-line cognition is body-based. Even when decoupled from
the environment, the activity of the mind is grounded in mechanisms that
evolved for interaction with the environment – that is, mechanisms of
sensory processing and motor control." This is shown with infants or
toddlers best. Children utilize skills and abilities they were born
with, such as sucking, grasping, and listening, to learn more about the
environment. The skills are broken down into five main categories that
combine sensory with motor skills, sensorimotor functions. The five main
- Mental Imagery: Is visualizing something that is not currently present in your environment. For example, imagining a future activity, or recalling how many windows are on the first floor of a house you once lived in (even though you did not count them explicitly while living there).
- Working Memory: Short term memory
- Episodic Memory: Long term memory of specific events.
- Implicit Memory: means by which we learn certain skills until they become automatic for us. An example of this would be an adult brushing his/her teeth, or an expert race car driver putting the car in drive.
- Reasoning and Problem-Solving: Having a mental model of something will increase problem-solving approaches.