Message Board

Respected readers, authors and reviewers, you can add comments to this page on any questions about the contribution, review,        editing and publication of this journal. We will give you an answer as soon as possible. Thank you for your support!

Name
E-mail
Phone
Title
Content
Verification Code
Volume 27 Issue 1
Jan.  2002
Turn off MathJax
Article Contents

YANG Yuhong, HU Ruimin, XU Zhengquan, AI Haojun. Diphone-based Unit Selection in Text-to-Speech Conversion for Mandarin[J]. Geomatics and Information Science of Wuhan University, 2002, 27(1): 94-97.
Citation: YANG Yuhong, HU Ruimin, XU Zhengquan, AI Haojun. Diphone-based Unit Selection in Text-to-Speech Conversion for Mandarin[J]. Geomatics and Information Science of Wuhan University, 2002, 27(1): 94-97.

Diphone-based Unit Selection in Text-to-Speech Conversion for Mandarin

  • Received Date: 2001-12-10
  • Publish Date: 2002-01-05
  • Most of the mandarin text-to-speech systems are syllable-based which only include syllable-internal coarticulation while cross out any cross-syllable coarticulation.One solution to this problem is to abandon syllable-based models in favor of units which can model both syllable-internal and cross-syllable coarticulation.One such unit is the diphone which has been quite used in English TTS sysem.The concept of diphone can be improved in Chinese speech synthesis for there are only 410 syllables in the mandarin.It is shown that diphone-based models can model cross-syllable coarticulation and produce natural-sounding speech rather than syllable-based systems which just insert silence between syllables.
  • 加载中
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Article Metrics

Article views(864) PDF downloads(360) Cited by()

Related
Proportional views

Diphone-based Unit Selection in Text-to-Speech Conversion for Mandarin

Abstract: Most of the mandarin text-to-speech systems are syllable-based which only include syllable-internal coarticulation while cross out any cross-syllable coarticulation.One solution to this problem is to abandon syllable-based models in favor of units which can model both syllable-internal and cross-syllable coarticulation.One such unit is the diphone which has been quite used in English TTS sysem.The concept of diphone can be improved in Chinese speech synthesis for there are only 410 syllables in the mandarin.It is shown that diphone-based models can model cross-syllable coarticulation and produce natural-sounding speech rather than syllable-based systems which just insert silence between syllables.

YANG Yuhong, HU Ruimin, XU Zhengquan, AI Haojun. Diphone-based Unit Selection in Text-to-Speech Conversion for Mandarin[J]. Geomatics and Information Science of Wuhan University, 2002, 27(1): 94-97.
Citation: YANG Yuhong, HU Ruimin, XU Zhengquan, AI Haojun. Diphone-based Unit Selection in Text-to-Speech Conversion for Mandarin[J]. Geomatics and Information Science of Wuhan University, 2002, 27(1): 94-97.

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return