A Medley of Potpourri: Pattern recognition (psychology)

Sunday, November 10, 2024

Pattern recognition (psychology)

From Wikipedia, the free encyclopedia

In psychology and cognitive neuroscience, pattern recognition is a cognitive process that matches information from a stimulus with information retrieved from memory.

Pattern recognition occurs when information from the environment is received and entered into short-term memory, causing automatic activation of a specific content of long-term memory. An example of this is learning the alphabet in order. When a carer repeats "A, B, C" multiple times to a child, the child, using pattern recognition, says "C" after hearing "A, B" in order. Recognizing patterns allows anticipation of what is to come. Making the connection between memories and information perceived is a step in pattern recognition called identification. Pattern recognition requires repetition of experience. Semantic memory, which is used implicitly and subconsciously, is the main type of memory involved in recognition.

Pattern recognition is crucial not only to humans, but also to other animals. Even koalas, which possess less-developed thinking abilities, use pattern recognition to find and consume eucalyptus leaves. The human brain has developed more, but holds similarities to the brains of birds and lower mammals. The development of neural networks in the outer layer of the brain in humans has allowed for better processing of visual and auditory patterns. Spatial positioning in the environment, remembering findings, and detecting hazards and resources to increase chances of survival are examples of the application of pattern recognition for humans and animals.

There are six main theories of pattern recognition: template matching, prototype-matching, feature analysis, recognition-by-components theory, bottom-up and top-down processing, and Fourier analysis. The application of these theories in everyday life is not mutually exclusive. Pattern recognition allows us to read words, understand language, recognize friends, and even appreciate music. Each of the theories applies to various activities and domains where pattern recognition is observed. Facial, music and language recognition, and seriation are a few of such domains. Facial recognition and seriation occur through encoding visual patterns, while music and language recognition use the encoding of auditory patterns.

Theories

Template matching

Template matching theory describes the most basic approach to human pattern recognition. It is a theory that assumes every perceived object is stored as a "template" into long-term memory. Incoming information is compared to these templates to find an exact match. In other words, all sensory input is compared to multiple representations of an object to form one single conceptual understanding. The theory defines perception as a fundamentally recognition-based process. It assumes that everything we see, we understand only through past exposure, which then informs our future perception of the external world. For example, A, A, and A are all recognized as the letter A, but not B. This viewpoint is limited, however, in explaining how new experiences can be understood without being compared to an internal memory template.

Prototype matching

Unlike the exact, one-to-one, template matching theory, prototype matching instead compares incoming sensory input to one average prototype. This theory proposes that exposure to a series of related stimuli leads to the creation of a "typical" prototype based on their shared features. It reduces the number of stored templates by standardizing them into a single representation. The prototype supports perceptual flexibility, because unlike in template matching, it allows for variability in the recognition of novel stimuli. For instance, if a child had never seen a lawn chair before, they would still be able to recognize it as a chair because of their understanding of its essential characteristics as having four legs and a seat. This idea, however, limits the conceptualization of objects that cannot necessarily be "averaged" into one, like types of canines, for instance. Even though dogs, wolves, and foxes are all typically furry, four-legged, moderately sized animals with ears and a tail, they are not all the same, and thus cannot be strictly perceived with respect to the prototype matching theory.

Multiple discrimination scaling

Template and feature analysis approaches to recognition of objects (and situations) have been merged / reconciled / overtaken by multiple discrimination theory. This states that the amounts in a test stimulus of each salient feature of a template are recognized in any perceptual judgment as being at a distance in the universal unit of 50% discrimination (the objective performance 'JND') from the amount of that feature in the template.

Recognition–by–components theory

Similar to feature–detection theory, recognition by components (RBC) focuses on the bottom-up features of the stimuli being processed. First proposed by Irving Biederman (1987), this theory states that humans recognize objects by breaking them down into their basic 3D geometric shapes called geons (i.e., cylinders, cubes, cones, etc.). An example is how we break down a common item like a coffee cup: we recognize the hollow cylinder that holds the liquid and a curved handle off the side that allows us to hold it. Even though not every coffee cup is exactly the same, these basic components help us recognize the consistency across examples (or pattern). RBC suggests that there are fewer than 36 unique geons that when combined can form a virtually unlimited number of objects. To parse and dissect an object, RBC proposes we attend to two specific features: edges and concavities. Edges enable the observer to maintain a consistent representation of the object regardless of the viewing angle and lighting conditions. Concavities are where two edges meet and enable the observer to perceive where one geon ends and another begins.

The RBC principles of visual object recognition can be applied to auditory language recognition as well. In place of geons, language researchers propose that spoken language can be broken down into basic components called phonemes. For example, there are 44 phonemes in the English language.

Top-down and bottom-up processing

Top-down processing

Top-down processing refers to the use of background information in pattern recognition. It always begins with a person's previous knowledge, and makes predictions due to this already acquired knowledge. Psychologist Richard Gregory estimated that about 90% of the information is lost between the time it takes to go from the eye to the brain, which is why the brain must guess what the person sees based on past experiences. In other words, we construct our perception of reality, and these perceptions are hypotheses or propositions based on past experiences and stored information. The formation of incorrect propositions will lead to errors of perception such as visual illusions. Given a paragraph written with difficult handwriting, it is easier to understand what the writer wants to convey if one reads the whole paragraph rather than reading the words in separate terms. The brain may be able to perceive and understand the gist of the paragraph due to the context supplied by the surrounding words.

Bottom-up processing

Bottom-up processing is also known as data-driven processing, because it originates with the stimulation of the sensory receptors. Psychologist James Gibson opposed the top-down model and argued that perception is direct, and not subject to hypothesis testing as Gregory proposed. He stated that sensation is perception and there is no need for extra interpretation, as there is enough information in our environment to make sense of the world in a direct way. His theory is sometimes known as the "ecological theory" because of the claim that perception can be explained solely in terms of the environment. An example of bottom up-processing involves presenting a flower at the center of a person's field. The sight of the flower and all the information about the stimulus are carried from the retina to the visual cortex in the brain. The signal travels in one direction.

Seriation

In psychologist Jean Piaget's theory of cognitive development, the third stage is called the Concrete Operational State. It is during this stage that the abstract principle of thinking called "seriation" is naturally developed in a child. Seriation is the ability to arrange items in a logical order along a quantitative dimension such as length, weight, age, etc. It is a general cognitive skill which is not fully mastered until after the nursery years. To seriate means to understand that objects can be ordered along a dimension, and to effectively do so, the child needs to be able to answer the question "What comes next?" Seriation skills also help to develop problem-solving skills, which are useful in recognizing and completing patterning tasks.

Piaget's work on seriation

Piaget studied the development of seriation along with Szeminska in an experiment where they used rods of varying lengths to test children's skills. They found that there were three distinct stages of development of the skill. In the first stage, children around the age of 4 could not arrange the first ten rods in order. They could make smaller groups of 2–4, but could not put all the elements together. In the second stage where the children were 5–6 years of age, they could succeed in the seriation task with the first ten rods through the process of trial and error. They could insert the other set of rods into order through trial and error. In the third stage, the 7-8-year-old children could arrange all the rods in order without much trial and error. The children used the systematic method of first looking for the smallest rod first and the smallest among the rest.

Development of problem-solving skills

To develop the skill of seriation, which then helps advance problem-solving skills, children should be provided with opportunities to arrange things in order using the appropriate language, such as "big" and "bigger" when working with size relationships. They should also be given the chance to arrange objects in order based on the texture, sound, flavor and color. Along with specific tasks of seriation, children should be given the chance to compare the different materials and toys they use during play. Through activities like these, the true understanding of characteristics of objects will develop. To aid them at a young age, the differences between the objects should be obvious. Lastly, a more complicated task of arranging two different sets of objects and seeing the relationship between the two different sets should also be provided. A common example of this is having children attempt to fit saucepan lids to saucepans of different sizes, or fitting together different sizes of nuts and bolts.

Application of seriation in schools

To help build up math skills in children, teachers and parents can help them learn seriation and patterning. Young children who understand seriation can put numbers in order from lowest to highest. Eventually, they will come to understand that 6 is higher than 5, and 20 is higher than 10. Similarly, having children copy patterns or create patterns of their own, like ABAB patterns, is a great way to help them recognize order and prepare for later math skills, such as multiplication. Child care providers can begin exposing children to patterns at a very young age by having them make groups and count the total number of objects.

Facial pattern recognition

Recognizing faces is one of the most common forms of pattern recognition. Humans are extremely effective at remembering faces, but this ease and automaticity belies a very challenging problem. All faces are physically similar. Faces have two eyes, one mouth, and one nose all in predictable locations, yet humans can recognize a face from several different angles and in various lighting conditions.

Neuroscientists posit that recognizing faces takes place in three phases. The first phase starts with visually focusing on the physical features. The facial recognition system then needs to reconstruct the identity of the person from previous experiences. This provides us with the signal that this might be a person we know. The final phase of recognition completes when the face elicits the name of the person.

Although humans are great at recognizing faces under normal viewing angles, upside-down faces are tremendously difficult to recognize. This demonstrates not only the challenges of facial recognition but also how humans have specialized procedures and capacities for recognizing faces under normal upright viewing conditions.

Neural mechanisms

Scientists agree that there is a certain area in the brain specifically devoted to processing faces. This structure is called the fusiform gyrus, and brain imaging studies have shown that it becomes highly active when a subject is viewing a face.

Several case studies have reported that patients with lesions or tissue damage localized to this area have tremendous difficulty recognizing faces, even their own. Although most of this research is circumstantial, a study at Stanford University provided conclusive evidence for the fusiform gyrus' role in facial recognition. In a unique case study, researchers were able to send direct signals to a patient's fusiform gyrus. The patient reported that the faces of the doctors and nurses changed and morphed in front of him during this electrical stimulation. Researchers agree this demonstrates a convincing causal link between this neural structure and the human ability to recognize faces.

Facial recognition development

Although in adults, facial recognition is fast and automatic, children do not reach adult levels of performance (in laboratory tasks) until adolescence. Two general theories have been put forth to explain how facial recognition normally develops. The first, general cognitive development theory, proposes that the perceptual ability to encode faces is fully developed early in childhood, and that the continued improvement of facial recognition into adulthood is attributed to other general factors. These general factors include improved attentional focus, deliberate task strategies, and metacognition. Research supports the argument that these other general factors improve dramatically into adulthood. Face-specific perceptual development theory argues that the improved facial recognition between children and adults is due to a precise development of facial perception. The cause for this continuing development is proposed to be an ongoing experience with faces.

Developmental issues

Several developmental issues manifest as a decreased capacity for facial recognition. Using what is known about the role of the fusiform gyrus, research has shown that impaired social development along the autism spectrum is accompanied by a behavioral marker where these individuals tend to look away from faces, and a neurological marker characterized by decreased neural activity in the fusiform gyrus. Similarly, those with developmental prosopagnosia (DP) struggle with facial recognition to the extent they are often unable to identify even their own faces. Many studies report that around 2% of the world's population have developmental prosopagnosia, and that individuals with DP have a family history of the trait. Individuals with DP are behaviorally indistinguishable from those with physical damage or lesions on the fusiform gyrus, again implicating its importance to facial recognition. Despite those with DP or neurological damage, there remains a large variability in facial recognition ability in the total population. It is unknown what accounts for the differences in facial recognition ability, whether it is a biological or environmental disposition. Recent research analyzing identical and fraternal twins showed that facial recognition was significantly higher correlated in identical twins, suggesting a strong genetic component to individual differences in facial recognition ability.

Language development

Pattern recognition in language acquisition

Research from Frost et al., 2013 reveals that infant language acquisition is linked to cognitive pattern recognition. Unlike classical nativist and behavioral theories of language development, scientists now believe that language is a learned skill. Studies at the Hebrew University and the University of Sydney both show a strong correlation between the ability to identify visual patterns and to learn a new language. Children with high shape recognition showed better grammar knowledge, even when controlling for the effects of intelligence and memory capacity. This is supported by the theory that language learning is based on statistical learning, the process by which infants perceive common combinations of sounds and words in language and use them to inform future speech production.

Phonological development

The first step in infant language acquisition is to decipher between the most basic sound units of their native language. This includes every consonant, every short and long vowel sound, and any additional letter combinations like "th" and "ph" in English. These units, called phonemes, are detected through exposure and pattern recognition. Infants use their "innate feature detector" capabilities to distinguish between the sounds of words. They split them into phonemes through a mechanism of categorical perception. Then they extract statistical information by recognizing which combinations of sounds are most likely to occur together, like "qu" or "h" plus a vowel. In this way, their ability to learn words is based directly on the accuracy of their earlier phonetic patterning.

Grammar development

The transition from phonemic differentiation into higher-order word production is only the first step in the hierarchical acquisition of language. Pattern recognition is furthermore utilized in the detection of prosody cues, the stress and intonation patterns among words. Then it is applied to sentence structure and the understanding of typical clause boundaries. This entire process is reflected in reading as well. First, a child recognizes patterns of individual letters, then words, then groups of words together, then paragraphs, and finally entire chapters in books. Learning to read and learning to speak a language are based on the "stepwise refinement of patterns" in perceptual pattern recognition.

Music pattern recognition

Music provides deep and emotional experiences for the listener. These experiences become contents in long-term memory, and every time we hear the same tunes, those contents are activated. Recognizing the content by the pattern of the music affects our emotion. The mechanism that forms the pattern recognition of music and the experience has been studied by multiple researchers. The sensation felt when listening to our favorite music is evident by the dilation of the pupils, the increase in pulse and blood pressure, the streaming of blood to the leg muscles, and the activation of the cerebellum, the brain region associated with physical movement. While retrieving the memory of a tune demonstrates general recognition of musical pattern, pattern recognition also occurs while listening to a tune for the first time. The recurring nature of the metre allows the listener to follow a tune, recognize the metre, expect its upcoming occurrence, and figure the rhythm. The excitement of following a familiar music pattern happens when the pattern breaks and becomes unpredictable. This following and breaking of a pattern creates a problem-solving opportunity for the mind that form the experience. Psychologist Daniel Levitin argues that the repetitions, melodic nature and organization of this music create meaning for the brain. The brain stores information in an arrangement of neurons which retrieve the same information when activated by the environment. By constantly referencing information and additional stimulation from the environment, the brain constructs musical features into a perceptual whole.

The medial prefrontal cortex – one of the last areas affected by Alzheimer's disease – is the region activated by music.

Cognitive mechanisms

To understand music pattern recognition, we need to understand the underlying cognitive systems that each handle a part of this process. Various activities are at work in this recognition of a piece of music and its patterns. Researchers have begun to unveil the reasons behind the stimulated reactions to music. Montreal-based researchers asked ten volunteers who got "chills" listening to music to listen to their favorite songs while their brain activity was being monitored. The results show the significant role of the nucleus accumbens (NAcc) region – involved with cognitive processes such as motivation, reward, addiction, etc. – creating the neural arrangements that make up the experience. A sense of reward prediction is created by anticipation before the climax of the tune, which comes to a sense of resolution when the climax is reached. The longer the listener is denied the expected pattern, the greater the emotional arousal when the pattern returns. Musicologist Leonard Meyer used fifty measures of Beethoven's 5th movement of the String Quartet in C-sharp minor, Op. 131 to examine this notion. The stronger this experience is, the more vivid memory it will create and store. This strength affects the speed and accuracy of retrieval and recognition of the musical pattern. The brain not only recognizes specific tunes, it distinguishes standard acoustic features, speech and music.

MIT researchers conducted a study to examine this notion. The results showed six neural clusters in the auditory cortex responding to the sounds. Four were triggered when hearing standard acoustic features, one specifically responded to speech, and the last exclusively responded to music. Researchers who studied the correlation between temporal evolution of timbral, tonal and rhythmic features of music, came to the conclusion that music engages the brain regions connected to motor actions, emotions and creativity. The research indicates that the whole brain "lights up" when listening to music. This amount of activity boosts memory preservation, hence pattern recognition.

Recognizing patterns of music is different for a musician and a listener. Although a musician may play the same notes every time, the details of the frequency will always be different. The listener will recognize the musical pattern and their types despite the variations. These musical types are conceptual and learned, meaning they might vary culturally. While listeners are involved with recognizing (implicit) musical material, musicians are involved with recalling them (explicit).

A UCLA study found that when watching or hearing music being played, neurons associated with the muscles needed for playing the instrument fire. Mirror neurons light up when musicians and non-musicians listen to a piece.

Developmental issues

Pattern recognition of music can build and strengthen other skills, such as musical synchrony and attentional performance and musical notation and brain engagement. Even a few years of musical training enhances memory and attention levels. Scientists at University of Newcastle conducted a study on patients with severe acquired brain injuries (ABIs) and healthy participants, using popular music to examine music-evoked autobiographical memories (MEAMs). The participants were asked to record their familiarity with the songs, whether they liked them and what memories they evoked. The results showed that the ABI patients had the highest MEAMs, and all the participants had MEAMs of a person, people or life period that were generally positive. The participants completed the task by utilizing pattern recognition skills. Memory evocation caused the songs to sound more familiar and well-liked. This research can be beneficial to rehabilitating patients of autobiographical amnesia who do not have fundamental deficiency in autobiographical recall memory and intact pitch perception.

In a study at University of California, Davis mapped the brain of participants while they listened to music. The results showed links between brain regions to autobiographical memories and emotions activated by familiar music. This study can explain the strong response of patients with Alzheimer's disease to music. This research can help such patients with pattern recognition-enhancing tasks.

False pattern recognition

The human tendency to see patterns that do not actually exist is called apophenia. Examples include the Man in the Moon, faces or figures in shadows, in clouds, and in patterns with no deliberate design, such as the swirls on a baked confection, and the perception of causal relationships between events which are, in fact, unrelated. Apophenia figures prominently in conspiracy theories, gambling, misinterpretation of statistics and scientific data, and some kinds of religious and paranormal experiences. Misperception of patterns in random data is called pareidolia. Recent researches in neurosciences and cognitive sciences suggest to understand 'false pattern recognition', in the paradigm of predictive coding.

A Medley of Potpourri

Search This Blog

Sunday, November 10, 2024

Pattern recognition (psychology)

Theories

Template matching

Prototype matching

Multiple discrimination scaling

Recognition–by–components theory

Top-down and bottom-up processing

Top-down processing

Bottom-up processing

Seriation

Piaget's work on seriation

Development of problem-solving skills

Application of seriation in schools

Facial pattern recognition

Neural mechanisms

Facial recognition development

Developmental issues

Language development

Pattern recognition in language acquisition

Phonological development

Grammar development

Music pattern recognition

Cognitive mechanisms

Developmental issues

False pattern recognition

Neurosis

Followers

Total Pageviews