A corpus-based concatenative speech synthesis system for Turkish

Speech synthesis is the process of converting written text into machine-generated synthetic speech. Concatenative speech synthesis systems form utterances by concatenating pre-recorded speech units. Corpus-based methods use a large inventory to select the units to be concatenated. In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative speech synthesis system for the Turkish language. The implemented system contains a front-end comprised of text analysis, ,phonetic analysis, and optional use of transplanted prosody. The unit selection algorithm is based on commonly used Viterbi decoding algorithm of the best-path in the network of the speech units using spectral discontinuity and prosodic mismatch objective cost measures. The back-end is the speech waveform generation based on the harmonic coding of speech and overlap-and-add mechanism. Harmonic coding enabled us to compress the unit inventory size by a factor of three. In this study, a Turkish phoneme set has been designed and a pronunciation lexicon for root words has been constructed. The importance of prosody in unit selection has been investigated by using transplanted prosody. A Turkish Diagnostic Rhyme Test (DRT) word list that can be used to evaluate the intelligibility of Turkish Text-to-Speech (TTS) systems has been compiled. Several experiments have been performed to evaluate the quality of the synthesized speech and we obtained 4-2 Mean Opinion Score (MOS) in the listening tests for our system, which is the first unit selection based system published for Turkish.

Dergi Adı Turkish Journal of Electrical Engineering and Computer Sciences
Cild 14
Dergi Sayısı 2
Sayfalar 209 - 223
Yayın Tarihi 2006
Eser Adı
[dc.title]
A corpus-based concatenative speech synthesis system for Turkish
Yazar
[dc.contributor.author]
Sak, Haşim
Yazar
[dc.contributor.author]
Güngör, Tunga
Yazar
[dc.contributor.author]
Safkan, Yaşar
Yayın Türü
[dc.type]
article
Özet
[dc.description.abstract]
Speech synthesis is the process of converting written text into machine-generated synthetic speech. Concatenative speech synthesis systems form utterances by concatenating pre-recorded speech units. Corpus-based methods use a large inventory to select the units to be concatenated. In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative speech synthesis system for the Turkish language. The implemented system contains a front-end comprised of text analysis, ,phonetic analysis, and optional use of transplanted prosody. The unit selection algorithm is based on commonly used Viterbi decoding algorithm of the best-path in the network of the speech units using spectral discontinuity and prosodic mismatch objective cost measures. The back-end is the speech waveform generation based on the harmonic coding of speech and overlap-and-add mechanism. Harmonic coding enabled us to compress the unit inventory size by a factor of three. In this study, a Turkish phoneme set has been designed and a pronunciation lexicon for root words has been constructed. The importance of prosody in unit selection has been investigated by using transplanted prosody. A Turkish Diagnostic Rhyme Test (DRT) word list that can be used to evaluate the intelligibility of Turkish Text-to-Speech (TTS) systems has been compiled. Several experiments have been performed to evaluate the quality of the synthesized speech and we obtained 4-2 Mean Opinion Score (MOS) in the listening tests for our system, which is the first unit selection based system published for Turkish.
Kayıt Giriş Tarihi
[dc.date.accessioned]
2020-03-18
Yayın Tarihi
[dc.date.issued]
2006
Açık Erişim Tarihi
[dc.date.available]
2020-03-18
Dil
[dc.language.iso]
eng
Konu Başlıkları
[dc.subject]
Mühendislik
Konu Başlıkları
[dc.subject]
Elektrik ve Elektronik
Haklar
[dc.rights]
info:eu-repo/semantics/openAccess
ISSN
[dc.identifier.issn]
1300-0632
ISSN
[dc.identifier.issn]
1300-0632
Yayının ilk sayfa sayısı
[dc.identifier.startpage]
209
Yayının son sayfa sayısı
[dc.identifier.endpage]
223
Dergi Adı
[dc.relation.journal]
Turkish Journal of Electrical Engineering and Computer Sciences
Dergi Sayısı
[dc.identifier.issue]
2
Cild
[dc.identifier.volume]
14
Tek Biçim Adres
[dc.identifier.uri]
http://www.trdizin.gov.tr/publication/paper/detail/TmpBMU1ESXk=
Tek Biçim Adres
[dc.identifier.uri]
https://hdl.handle.net/20.500.11831/5338
Görüntülenme Sayısı ( Şehir )
Görüntülenme Sayısı ( Ülke )
Görüntülenme Sayısı ( Zaman Dağılımı )
Görüntülenme
6
20.03.2023 tarihinden bu yana
İndirme
1
20.03.2023 tarihinden bu yana
Son Erişim Tarihi
29 Eylül 2023 03:16
Google Kontrol
Tıklayınız
speech Turkish system selection prosody synthesis algorithm evaluate transplanted inventory systems analysis coding compress factor phoneme designed pronunciation investigated lexicon importance constructed Diagnostic quality published listening Opinion obtained synthesized performed experiments Several compiled Text-to-Speech intelligibility
6698 sayılı Kişisel Verilerin Korunması Kanunu kapsamında yükümlülüklerimiz ve çerez politikamız hakkında bilgi sahibi olmak için alttaki bağlantıyı kullanabilirsiniz.

creativecommons
Bu site altında yer alan tüm kaynaklar Creative Commons Alıntı-GayriTicari-Türetilemez 4.0 Uluslararası Lisansı ile lisanslanmıştır.
Platforms