Figure 1: McGurk effect. Visual stimuli (a) are combined
with audio stimuli (b). Subjects will expect congruent
stimuli (b
1
), where visual and auditory clues are
concordant (e.g. happy face and happy voice). Non-
congruent stimuli (b
2
), where visual and auditory clues are
discordant (e.g. happy face and angry voice), will induce
distortions in either the visual or auditory perception (this
distortion is termed as a “McGurk effect”.
the brain by multi-modal stimuli, and to control if
such stimuli could induce reproducible changes in
EEG signal. Through the investigation of this
‘abnormal’ perceptual condition, we intend to reveal
the mechanisms of normal emotional judgment (how
one can distinguish the valence of emotions in a
given stimulus). The use of different valence stimuli
(neutral, aggressive, appeasing, etc.) will be
compared.
3 METHOD
We exposed two right handed, male subjects to
auditory, visual, or combined audio-visual stimuli.
Stimuli were presented for 2 sec, the subjects was
asked to answer afterwards within a 3 sec window,
and then had 5 sec of rest (one trial = 10 sec). Audio
stimuli consisted of voice recordings of the word
‘arigato’ (thank you) pronounced with three
different intonations (Angry - A, Happy - H or
Neutral - N). Visual stimuli consisted of faces of
women expressing the same emotional valences (A,
H or N), taken from the JACfee and JACNeuf
Japanese-Caucasian photo databases (Biehl et al.
1997). Audio-visual stimuli were composed using
either congruent combinations of faces and voices
(e.g. HxH) or non-congruent (e.g. AxH). The
experiment consisted in three different sessions:
• In the first session, the subjects were
exposed to visual stimuli only. Their task
was to judge if the face was neutral, angry,
or happy. 60 stimuli were presented in a
pre-decided random order, and so that two
consecutive emotions were always
different.
• In the second session, the subjects were
exposed to visual stimuli only. Their task
was to judge if the voice was neutral,
angry, or happy. 60 stimuli were presented
in a pre-decided random order, and so that
two consecutive emotions were always
different.
• In the third session, the subjects were
exposed to the combined audio-visual
stimuli. Their task was to judge if the
percept was neutral, angry, or happy. 60
stimuli were presented in a pre-decided
random order, and so that two consecutive
emotions were always different, and so that
the same number of trials occurred for all
possible pairs of stimuli.
In each of the three sessions, the task of the subjects
was to judge if the face was neutral, angry, or happy,
and to provide this response with a keyboard.
The data was collected with a 64-channel
Biosemi EEG system with active electrodes in a
shielded room. Sampling rate was fixed at 1024 Hz,
notch filter at 50 Hz and analog bandpass filter
between 0.5 and 100 Hz. The topographic
distributions of EEG signals (relative power) in the
theta (4-8 Hz), apha (8-12 Hz) and beta (12-25 Hz)
ranges was afterwards computed using the Welch
periodogram method (Welch, 1967) on the trials of
the third session.
4 RESULT
We compared the conditions stimuli (A or H) vs.
control (N), and congruent vs. non-congruent.
Topographic maps of EEG power differed between
those conditions on both subjects. Generally, the
difference is maximized for HxH vs. HxA (Figure
2), in other words, the non-congruent stimuli are
“more different” than the neutral stimuli. Significant
changes are observed in the alpha range in the
frontal and right temporal areas; and in the left
parietal area in the theta range (Figure 2, Figure 3).
These changes are specific to the McGurk effect
NCTA 2011 - International Conference on Neural Computation Theory and Applications
462