Title | : | An evaluation of sentence selection methods on the different phone-sized units for constructing Indonesian speech corpus |
Author | : |
Muljono (1) Prof. Drs. Agus Harjoko, M.Sc., Ph.D. (2) Nurul Anisa Sri Winarsih (3) Catur Supriyanto (4) |
Date | : | 0 2020 |
Keyword | : | Indonesian minimum sentence set, Least-to-most greedy algorithm, Phonetically balanced sentence set, Speech corpus Indonesian minimum sentence set, Least-to-most greedy algorithm, Phonetically balanced sentence set, Speech corpus |
Abstract | : | Collecting phonetically balanced text corpus is an important step to develop automatic speech recognition and text-to-speech systems. A corpus should have a small number of sentences but contains all phonetic units, such as monophone, triphone, and pentaphone units. There are exist least-to-most greedy algorithm (LTM + Greedy) and its variant to select the minimum sentence set. The variant is on the sentence scoring method, which affect the number of selected sentences. In this paper, we evaluate the sentence scoring methods by Zhang and Suyanto on LTM + Greedy algorithm. The sentence scoring methods are conducted on triphone and pentaphone units on the collection of sentence set. Triphone and pentaphone units have offered higher quality synthesized speech than monophone unit. The dataset of this paper is Indonesian sentences that collected from holy book translation, news, novel, dialog, monologue, and question sentences. Totally 115,489 sentences are used for the experiments. Based on the experiments, LTM + Greedy by Suyanto produces a smaller number of sentences that contain large number of phone units. |
Group of Knowledge | : | Ilmu Komputer |
Original Language | : | English |
Level | : | Internasional |
Status | : |
Published
|
No | Title | Action |
---|---|---|
1 |
Editors-International Journal of Speech Technology _ Editors.pdf
Document Type : [PAK] Halaman Editorial
|
View |
2 |
toc-International Journal of Speech Technology _ Volume 23, issue 1.pdf
Document Type : [PAK] Daftar Isi
|
View |
3 |
home-International Journal of Speech Technology _ Home.pdf
Document Type : [PAK] Halaman Cover
|
View |
4 |
turnitin-An evaluation of sentence selection methods on the diferent phone?sized units for constructing Indonesian speech corpus.pdf
Document Type : [PAK] Cek Similarity
|
View |
5 |
IJST_AnEvaluationOfSentenceSelection-lengkap-PAK.pdf
Document Type : [PAK] Full Dokumen
|
View |