Dear WAVE participants:
Thank you very much for your participation. The workshop was a great success. We will update this site shortly for the record of your participation.
( group photo (1 MB)) (Snap shots)
We appreciate, if you would send us some of photos of the WAVE workshop you shot. We will add then to the collection of snap shots. Thank you in advance for your help.
A specialized workshop on auditory aspects of speech perception and production will be held on the 10th and 11th April 2004. We hope to provide a comfortable setting for discussion between interested researchers in order to create and merge ideas towards a better understanding of this exciting and emerging research field. This can only be achieved by your time and participation in this workshop.
Introduction of a very high-quality speech analysis/modification/synthesis method STRAIGHT and a new auditory computational theory based on stabilized wavelet-Mellin transform has opened up interesting research questions on the basis of our auditory and speech perception. This workshop is intended to disseminate our ideas and to induce discussions on how to approach these questions. Important topics to be discussed are on vowel normalization and temporal dynamics of speech. This workshop is also an extension to our previous workshops under CREST's Auditory Brain Project and our special session in ICA 2004 .
The workshop is currently scheduled to start at 13:30 on 10th April and completes discussions at 12:00 on 11th April. This will allow plenty of time for discussions and extra personal activities after the workshop.
|April 10 (Sat.)|
|12:30||Hotel shuttle bus departing from JR Wakayama station|
|Session 1: Engineering perspective (Chair: Roy Patterson)|
|13:30 - 15:00||ISSUES to be discussed:
a) Computational basis of STRAIGHT and its basic problems.
b) The Uninvited Guest - Information's Role in Spoken Language
c) Perception of 'Singing-ness' and its application to
singing-voice synthesis from read speech
| Hideki Kawahara
|Free discussion (15:00 - 15:30)|
|Session 2: Biological constraints (Chair: Steven Greenberg)|
|15:30 - 17:00||ISSUES to be discussed:
a) Exploring human speech production mechanisms by MRI
c) Searching for phonological processing in the human brain using functional MRI
b) Dynamics of intergestural timing: prosodic and syllabic influences
|Free discussion (17:00 - 17:30)|
|Session 3: Modulation and percedption (Chair: Elliot Saltzman)|
|17:30 - 19:00||ISSUES to be discussed:
a) AGC, Envelope Fluctuations and Speech Intelligibility
b) The role of modulation filterbank in reverberantion-robust ASR
c) The case of the missing delay lines: large delays produced by crosschannel interaction
Alain de Chevengne
|Dinner & free time|
|April 11th (Sun.)|
|Breakfast (7:00 - 8:00)|
|Session 4: Scale as a new dimension (Chair: Hideki Kawahara)|
|9:00 - 10:30||ISSUES to be discussed:
a) The perception of speaker size in speech and instrument size in music indicates
that size is a natural and simple judgement for human listeners.
b) Identification of Scale-Modulated Vowel Sequences
c) Scale perception in class room environment
|10:30 - 11:50||General Discussions||Closing remarks (11:50 - 12:00)||Hideki Kawahara|
|13:00||Bus leaving to JR Wakayama station|
|13:30 - 17:30||Steering committee meeting|
There are PDFs as advance reading materials. (Password protected.)
Roy Patterson (CNBH, Cambridge University, UK), Brian C. J. Moore(Cambridge University, UK), Steven Greenberg (The Speech Institute, USA), Alain de Cheveigne (IRCAM, France), Elliot Saltzman (Boston University, USA), Guy Brown (Sheffield University, UK), Stefan Uppenkamp (University Oldenburg, Germany), Kiyoshi Honda (ATR, Japan) Minoru Tsuzaki (ATR, Japan) Masato Akagi (JAIST, Japan), Masashi Unoki (JAIST, Japan), Parham Zolfaghari (NTT, Japan), Keiichi Yasu (Sophia University, Japan)
Hideki Kawahara, Toshio Irino, Hideki Banno, Ryuichi Nishimura, Toru Takahashi, Akiko Fukuyasu , Kyoko Fujita
A resort accommodation called "Futago shima soh" (Twin Islands Villa), which is located next to a famous viewing park "Banndoko Teien" (Lookout Garden) , is reserved for isolation and concentration with some excursions.
Going to the venue:
A limited express "Super Kuroshio" leaves from JR Kyoto station at 9:34 AM and bound for Wakayama JR station. (Expected arrival time is 11:01.) A hotel shuttle bus will depart from EAST gate (The primary gate is west. EAST gate is the smaller one.) of JR Wakayama station at 12:30. (Please take a lunch at Wakayama station.)
Please note that the departing time from JR Kyoto station was changed from 10:32 to 9:34. Don't miss it.
Most of invited speakers are expected to get on the same train.
Leaving from the venue:
Wakayama University established its Faculty of Systems Engineering on
October 1995. A group of researchers working in auditory, speech and
vision research fields were recruited from ATR, NTT, Kyoto University,
Nagoya University and NAIST to form a COE project to lead its research
and development activities. This workshop is one of the projects'
activities expected to be held on a regular basis. For this first
occasion, the acronym "WAVE" stands for "Wakayama workshop on Auditory
and Vocal rEsearch". Depending on the topic, the name of the workshop
can change. We are quite flexible in accepting your suggestions for
topics of coming workshops. (This workshop is sponsored by Wakayama
For further information, please send an e-mail to: Secritariat