Speech synthesis is the process of converting written text into machine-generated synthetic speech. Concatenative speech synthesis systems form utterances by concatenating pre-recorded speech units. Corpus-based methods use a large inventory to select the units to be concatenated. In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative speech synthesis system for the Turkish language. The implemented system contains a front-end comprised of text analysis, ,phonetic analysis, and optional use of transplanted prosody. The unit selection algorithm is based on commonly used Viterbi decoding algorithm of the best-path in the network of the speech units using spectral discontinuity and prosodic mismatch objective cost measures. The back-end is the speech waveform generation based on the harmonic coding of speech and overlap-and-add mechanism. Harmonic coding enabled us to compress the unit inventory size by a factor of three. In this study, a Turkish phoneme set has been designed and a pronunciation lexicon for root words has been constructed. The importance of prosody in unit selection has been investigated by using transplanted prosody. A Turkish Diagnostic Rhyme Test (DRT) word list that can be used to evaluate the intelligibility of Turkish Text-to-Speech (TTS) systems has been compiled. Several experiments have been performed to evaluate the quality of the synthesized speech and we obtained 4-2 Mean Opinion Score (MOS) in the listening tests for our system, which is the first unit selection based system published for Turkish.
Yazar |
Sak, Haşim Güngör, Tunga Safkan, Yaşar |
Yayın Türü | Article |
Tek Biçim Adres | https://hdl.handle.net/20.500.11831/5338 |
Konu Başlıkları |
Mühendislik
Elektrik ve Elektronik |
Koleksiyonlar |
Araştırma Çıktıları | Ön Baskı | WoS | Scopus | TR-Dizin | PubMed 03- Scopus İndeksli Yayınlar Koleksiyonu 04- TR-Dizin İndeksli Yayınlar Koleksiyonu |
Dergi Adı | Turkish Journal of Electrical Engineering and Computer Sciences |
Cild | 14 |
Dergi Sayısı | 2 |
Sayfalar | 209 - 223 |
Yayın Tarihi | 2006 |
Eser Adı [dc.title] | A corpus-based concatenative speech synthesis system for Turkish |
Yazar [dc.contributor.author] | Sak, Haşim |
Yazar [dc.contributor.author] | Güngör, Tunga |
Yazar [dc.contributor.author] | Safkan, Yaşar |
Yayın Türü [dc.type] | article |
Özet [dc.description.abstract] | Speech synthesis is the process of converting written text into machine-generated synthetic speech. Concatenative speech synthesis systems form utterances by concatenating pre-recorded speech units. Corpus-based methods use a large inventory to select the units to be concatenated. In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative speech synthesis system for the Turkish language. The implemented system contains a front-end comprised of text analysis, ,phonetic analysis, and optional use of transplanted prosody. The unit selection algorithm is based on commonly used Viterbi decoding algorithm of the best-path in the network of the speech units using spectral discontinuity and prosodic mismatch objective cost measures. The back-end is the speech waveform generation based on the harmonic coding of speech and overlap-and-add mechanism. Harmonic coding enabled us to compress the unit inventory size by a factor of three. In this study, a Turkish phoneme set has been designed and a pronunciation lexicon for root words has been constructed. The importance of prosody in unit selection has been investigated by using transplanted prosody. A Turkish Diagnostic Rhyme Test (DRT) word list that can be used to evaluate the intelligibility of Turkish Text-to-Speech (TTS) systems has been compiled. Several experiments have been performed to evaluate the quality of the synthesized speech and we obtained 4-2 Mean Opinion Score (MOS) in the listening tests for our system, which is the first unit selection based system published for Turkish. |
Kayıt Giriş Tarihi [dc.date.accessioned] | 2020-03-18 |
Yayın Tarihi [dc.date.issued] | 2006 |
Açık Erişim Tarihi [dc.date.available] | 2020-03-18 |
Dil [dc.language.iso] | eng |
Konu Başlıkları [dc.subject] | Mühendislik |
Konu Başlıkları [dc.subject] | Elektrik ve Elektronik |
Haklar [dc.rights] | info:eu-repo/semantics/openAccess |
ISSN [dc.identifier.issn] | 1300-0632 |
ISSN [dc.identifier.issn] | 1300-0632 |
Yayının ilk sayfa sayısı [dc.identifier.startpage] | 209 |
Yayının son sayfa sayısı [dc.identifier.endpage] | 223 |
Dergi Adı [dc.relation.journal] | Turkish Journal of Electrical Engineering and Computer Sciences |
Dergi Sayısı [dc.identifier.issue] | 2 |
Cild [dc.identifier.volume] | 14 |
Tek Biçim Adres [dc.identifier.uri] | http://www.trdizin.gov.tr/publication/paper/detail/TmpBMU1ESXk= |
Tek Biçim Adres [dc.identifier.uri] | https://hdl.handle.net/20.500.11831/5338 |