Home
This Title All WIREs
WIREs RSS Feed
How to cite this WIREs title:
WIREs Cogn Sci

Models of spoken‐word recognition

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

All words of the languages we know are stored in the mental lexicon. Psycholinguistic models describe in which format lexical knowledge is stored and how it is accessed when needed for language use. The present article summarizes key findings in spoken‐word recognition by humans and describes how models of spoken‐word recognition account for them. Although current models of spoken‐word recognition differ considerably in the details of implementation, there is general consensus among them on at least three aspects: multiple word candidates are activated in parallel as a word is being heard, activation of word candidates varies with the degree of match between the speech signal and stored lexical representations, and activated candidate words compete for recognition. No consensus has been reached on other aspects such as the flow of information between different processing levels, and the format of stored prelexical and lexical representations. WIREs Cogn Sci 2012, 3:387–401. doi: 10.1002/wcs.1178

Figure 1.

Waveform and spectrogram of the utterance “The sun began to rise”. The horizontal axis represents time and the vertical axis amplitude for the waveform and frequency for the spectrogram, with greater energy being represented by darker shading. White spaces in the spectrogram correspond to breaks in the speech signal. The vertical dotted lines are aligned as closely as possible with word boundaries.

[ Normal View 59K | Magnified View 88K ]
Figure 2.

Recognition process of the word sun by TRACE. For every time slice the entire network is copied, for better visualisation this duplication is not depicted in the figure. Activation in the lower layers flows upwards to the higher levels to all nodes that incorporate the lower layer node. Activation from the word layer also flows back to the phoneme layer.

[ Normal View 28K | Magnified View 66K ]
Figure 3.

Recognition process of the word sun by Shortlist. For every time slice, a new shortlist (indicated with the gray box) is created, which is subsequently wired into the competition stage. For better visualisation this repetition is not depicted in the figure. Candidate words that overlap with each other at any position compete with one another. In this example, all candidate words would inhibit one another; however, for better visualisation not all inhibitory connections are shown.

[ Normal View 25K | Magnified View 59K ]
Figure 4.

Recognition process of the word sun by Fine‐Tracker. The acoustic signal is transformed into a sequence of feature vectors over time by a set of artificial neural networks. At the word layer, words are represented as feature vectors, for better visualisation they are depicted as phonemes in the figure. Fine‐Tracker's lexicon is implemented as a lexical tree, with ‘B’ as the beginning of the tree. Not all possible paths in the lexical tree are shown. Each node can be followed by multiple other nodes, indicated with the dotted arrows as examples. The input feature vectors and the lexical feature vectors are mapped onto one another using a probabilistic word search.

[ Normal View 41K | Magnified View 115K ]

Browse by Topic

Linguistics
blog comments powered by Disqus

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts

In the Spotlight

Konrad Körding

Konrad Körding

Konrad Körding is Assistant Professor of Physiology and Physical Medicine and Rehabilitation at the Rehabilitation Institute of Chicago, part of Northwestern University. Before joining Northwestern in 2006, Professor Körding worked in three different research groups, most recently in 2004-2005 at MIT, studying machine learning and hierarchical Bayesian models.


Professor Körding is a member of the Swiss Society for Neuroscience, the German Society for Neuroscience, the Society for Neuroscience (USA) and the Electronic Frontier Foundation.

Professor Körding’s current research with the Bayesian Behavior group aims to improve rehabilitation procedures through a greater understanding of motor learning. In order to do this the team studies how people move, and how these movements are affected by uncertainty.

Learn More

Twitter: WIREsCogScience Follow us on Twitter