SP-P4: Speech Synthesis II |
Session Type: Poster |
Time: Wednesday, March 17, 10:00 - 12:00 |
Location: Poster Area A |
Session Chair: Tomoki Toda, NAIST, Japan |
SP-P4.1: UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS |
Keiichiro Oura; Nagoya Institute of Technology |
Keiichi Tokuda; Nagoya Institute of Technology |
Junichi Yamagishi; University of Edinburgh |
Simon King; University of Edinburgh |
Mirjam Wester; University of Edinburgh |
SP-P4.2: A COMPARISON OF SUPERVISED AND UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION APPROACHES FOR HMM-BASED SPEECH SYNTHESIS |
Hui Liang; Idiap Research Institute and Ecole Polytechnique Fédérale de Lausanne (EPFL) |
John Dines; Idiap Research Institute |
Lakshmi Saheer; Idiap Research Institute and Ecole Polytechnique Fédérale de Lausanne (EPFL) |
SP-P4.3: CROSS-VALIDATION BASED DECISION TREE CLUSTERING FOR HMM-BASED TTS |
Yu Zhang; Shanghai Jiao Tong University |
Zhi-Jie Yan; Microsoft Research Asia |
Frank K.Soong; Microsoft Research Asia |
SP-P4.4: IMPROVED MODELING FOR F0 GENERATION AND V/U DECISION IN HMM-BASED TTS |
Qingqing Zhang; Institute of Acoustics, Chinese Academy of Sciences |
Frank Soong; Microsoft Research Asia |
Yao Qian; Microsoft Research Asia |
Zhijie Yan; Microsoft Research Asia |
Jielin Pan; Institute of Acoustics, Chinese Academy of Sciences |
Yonghong Yan; Institute of Acoustics, Chinese Academy of Sciences |
SP-P4.5: SIMPLE METHODS FOR IMPROVING SPEAKER-SIMILARITY OF HMM-BASED SPEECH SYNTHESIS |
Junichi Yamagishi; University of Edinburgh |
Simon King; University of Edinburgh |
SP-P4.6: AN AUTOENCODER NEURAL-NETWORK BASED LOW-DIMENSIONALITY APPROACH TO EXCITATION MODELING FOR HMM-BASED TEXT-TO-SPEECH |
Srikanth Vishnubhotla; University of Maryland |
Raul Fernandez; IBM Research |
Bhuvana Ramabhadran; IBM Research |
SP-P4.7: KALMAN FILTER BASED SPEECH SYNTHESIS |
Carl Quillen; MIT Lincoln Laboratory |
SP-P4.8: HMM-BASED SPEECH SYNTHESIS WITH UNSUPERVISED LABELING OF ACCENTUAL CONTEXT BASED ON F0 QUANTIZATION AND AVERAGE VOICE MODEL |
Takashi Nose; Tokyo Institute of Technology |
Koujirou Ooki; Tokyo Institute of Technology |
Takao Kobayashi; Tokyo Institute of Technology |
SP-P4.9: A COMBINED TIME-VARYING AND TIME-INVARIANT PREDICTION ALGORITHM BASED ON LATTICE FILTERS FOR SPEECH ANALYSIS AND SYNTHESIS |
Karl Schnell; Goethe-University Frankfurt am Main |
SP-P4.10: A HMM-BASED SPEECH SYNTHESIS SYSTEM USING A NEW GLOTTAL SOURCE AND VOCAL-TRACT SEPARATION METHOD |
Pierre Lanchantin; IRCAM |
Gilles Degottex; IRCAM |
Xavier Rodet; IRCAM |
SP-P4.11: APPLYING LOG LINEAR MODEL BASED CONTEXT DEPENDENT MACHINE TRANSLATION TECHNIQUES TO GRAPHEME-TO-PHONEME CONVERSION |
Rong Zhang; IBM |
Bowen Zhou; IBM |
SP-P4.12: SYNTHESIZING SPEECH FROM DOPPLER SIGNALS |
Arthur Toth; Carnegie Mellon University |
Bhiksha Raj; Carnegie Mellon University |
Kaustubh Kalgaonkar; Georgia Institute of Technology |
Tony Ezzat; Mitsubishi Electric Research Laboratories |
SP-P4.13: UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION |
Matthew Gibson; Cambridge University |
Teemu Hirsimaki; Helsinki University of Technology |
Reima Karhila; Helsinki University of Technology |
Mikko Kurimo; Helsinki University of Technology |
William Byrne; Cambridge University |