SP-P4: Speech Synthesis II |
| Session Type: Poster |
| Time: Wednesday, March 17, 10:00 - 12:00 |
| Location: Poster Area A |
| Session Chair: Tomoki Toda, NAIST, Japan |
| SP-P4.1: UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS |
| Keiichiro Oura; Nagoya Institute of Technology |
| Keiichi Tokuda; Nagoya Institute of Technology |
| Junichi Yamagishi; University of Edinburgh |
| Simon King; University of Edinburgh |
| Mirjam Wester; University of Edinburgh |
| SP-P4.2: A COMPARISON OF SUPERVISED AND UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION APPROACHES FOR HMM-BASED SPEECH SYNTHESIS |
| Hui Liang; Idiap Research Institute and Ecole Polytechnique Fédérale de Lausanne (EPFL) |
| John Dines; Idiap Research Institute |
| Lakshmi Saheer; Idiap Research Institute and Ecole Polytechnique Fédérale de Lausanne (EPFL) |
| SP-P4.3: CROSS-VALIDATION BASED DECISION TREE CLUSTERING FOR HMM-BASED TTS |
| Yu Zhang; Shanghai Jiao Tong University |
| Zhi-Jie Yan; Microsoft Research Asia |
| Frank K.Soong; Microsoft Research Asia |
| SP-P4.4: IMPROVED MODELING FOR F0 GENERATION AND V/U DECISION IN HMM-BASED TTS |
| Qingqing Zhang; Institute of Acoustics, Chinese Academy of Sciences |
| Frank Soong; Microsoft Research Asia |
| Yao Qian; Microsoft Research Asia |
| Zhijie Yan; Microsoft Research Asia |
| Jielin Pan; Institute of Acoustics, Chinese Academy of Sciences |
| Yonghong Yan; Institute of Acoustics, Chinese Academy of Sciences |
| SP-P4.5: SIMPLE METHODS FOR IMPROVING SPEAKER-SIMILARITY OF HMM-BASED SPEECH SYNTHESIS |
| Junichi Yamagishi; University of Edinburgh |
| Simon King; University of Edinburgh |
| SP-P4.6: AN AUTOENCODER NEURAL-NETWORK BASED LOW-DIMENSIONALITY APPROACH TO EXCITATION MODELING FOR HMM-BASED TEXT-TO-SPEECH |
| Srikanth Vishnubhotla; University of Maryland |
| Raul Fernandez; IBM Research |
| Bhuvana Ramabhadran; IBM Research |
| SP-P4.7: KALMAN FILTER BASED SPEECH SYNTHESIS |
| Carl Quillen; MIT Lincoln Laboratory |
| SP-P4.8: HMM-BASED SPEECH SYNTHESIS WITH UNSUPERVISED LABELING OF ACCENTUAL CONTEXT BASED ON F0 QUANTIZATION AND AVERAGE VOICE MODEL |
| Takashi Nose; Tokyo Institute of Technology |
| Koujirou Ooki; Tokyo Institute of Technology |
| Takao Kobayashi; Tokyo Institute of Technology |
| SP-P4.9: A COMBINED TIME-VARYING AND TIME-INVARIANT PREDICTION ALGORITHM BASED ON LATTICE FILTERS FOR SPEECH ANALYSIS AND SYNTHESIS |
| Karl Schnell; Goethe-University Frankfurt am Main |
| SP-P4.10: A HMM-BASED SPEECH SYNTHESIS SYSTEM USING A NEW GLOTTAL SOURCE AND VOCAL-TRACT SEPARATION METHOD |
| Pierre Lanchantin; IRCAM |
| Gilles Degottex; IRCAM |
| Xavier Rodet; IRCAM |
| SP-P4.11: APPLYING LOG LINEAR MODEL BASED CONTEXT DEPENDENT MACHINE TRANSLATION TECHNIQUES TO GRAPHEME-TO-PHONEME CONVERSION |
| Rong Zhang; IBM |
| Bowen Zhou; IBM |
| SP-P4.12: SYNTHESIZING SPEECH FROM DOPPLER SIGNALS |
| Arthur Toth; Carnegie Mellon University |
| Bhiksha Raj; Carnegie Mellon University |
| Kaustubh Kalgaonkar; Georgia Institute of Technology |
| Tony Ezzat; Mitsubishi Electric Research Laboratories |
| SP-P4.13: UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION |
| Matthew Gibson; Cambridge University |
| Teemu Hirsimaki; Helsinki University of Technology |
| Reima Karhila; Helsinki University of Technology |
| Mikko Kurimo; Helsinki University of Technology |
| William Byrne; Cambridge University |