Your Brain Hears the Lips, Not the Voice
Watch a video of someone mouthing 'ga' over an audio of 'ba' and you will hear 'da' — even after you know the trick.
In 1976 the developmental psychologist Harry McGurk and his postdoc John MacDonald were studying how infants learn speech. By accident, they ran a tape with the audio of one syllable dubbed over a video of someone speaking a different one. Adults watching the result reported hearing a sound that was on neither track.
Their paper, "Hearing lips and seeing voices," appeared in Nature the same year. The classic demonstration: audio of someone saying "ba-ba" laid over silent video of the same person mouthing "ga-ga". Most English-speaking listeners hear "da-da". Close their eyes and the illusion vanishes; reopen them and it returns.
What makes the effect striking is its resistance to insight. Knowing the trick — even watching the audio and video files separately — does not stop the brain from reassembling them. Speech perception is not the auditory system passing sound up to higher cognition. It is a multimodal inference in which lip shape, jaw movement, and acoustic data are integrated before they reach awareness.
The effect is not universal. Reviews of cross-cultural studies show that English speakers fall for it more often than Japanese or Mandarin speakers. The gap is not fully explained, but face-watching norms, phoneme inventories, and the visual saliency of bilabial sounds are all candidates. Within any given language, individuals vary widely; some adults experience the fusion almost every time, others rarely.
The McGurk effect is now a routine demonstration in undergraduate classes and a target of replication audits. The 1976 result has held up across half a century of testing — within the populations where it was first found.
Make Recess yours.
Sign in to save the ones you loved, never see the same thing twice, and tell us what you want more of.