Automatic Phoneme Identification for Malay Dialects

Yen-Min Jasmina Khaw, Tien-Ping Tan, Bali Ranaivo-Malançon

Abstract


In many languages such as English, French, German, and Mandarin, there is a documented way of how words are pronounced. The pronunciation of a word is determined by the sequence of phonemes or some speech sounds. Each language or dialect might have different phoneme set. However, there is often a lack of phonological study for a dialect. The number of phonemes is unknown for some of the dialects or languages without a written form. In this work, we propose an approach to identify the phonemes for a dialect from the dialect text transcript and speech corpus, leveraging on existing resources from standard language and multilingual resources. Our study was carried out on Malay dialects. The result shows that the accuracy of the phoneme identification approach is high when we compare the results against previous works in the area.

Keywords


Phoneme Identification; Malay Dialect; Multilingual; Text Transcript;

Full Text:

PDF

References


Y. M. Maris, The Malay Sound System, Malaysia: Siri Teks Fajar Bakti, 1979.

D. Reithaug, Orchestrating Success in Reading, Canada: Stirling Head Enterprises, 2002.

G. Norkevic ̌ ius, G. Raškinis and A. Kazlauskienė, "Knowledge-Based Grapheme-to-Phoneme Conversion of Lithuanian Words," in SPECOM 2005, 10th International Conference Speech and Compute, Greece, 2005.

T. P. Tan and B. Ranaivo-Malancon, "Malay Grapheme to Phoneme Tool for Automatic Speech Recognition," in Third International Workshop on Malay and Indonesian Language Engineering, Singapore, 2009.

S. Stuker and A. Waibel, "Towards Human Translations Guided Language Discovery for ASR Systems," in in SLTU, Hanoi, 2008.

S. Stuker, L. Besacier and A. Waibel, "Human Translations Guided Language Discovery for ASR Systems," in in Interspeech, Brighton, 2009.

L. Besacier, B. Zhou and Y. Gao, "Towards Speech Transla- tion of Non Written Languages," in in SLT, Aruba, 2006.

S. Sitaram, G. K. Anumanchipalli, J. Chiu, A. Parlikar and A. W. Black, "Text to Speech in New Languages without a Standardized Orthography," in in Speech Synthesis Workshop, 2013.

S. Sitaram, S. Palkar, Y. Chen, A. Parlikar and A. W. Black, "Bootstrapping Text-to-Speech for Speech Processing in Languages Without an Orthography," in in ICASSP, Canada, 2013.

F. Stahlberg, T. Schlippe, S. Vogel and T. Schultz, "Word Segmentation through Cross-Lingual Word-to-Phoneme Alignment," in in SLT, USA, 2012.

F. Stahlberg, T. Schlippe, S. Vogel and T. Schultz, "Pronunciation Extraction from Phoneme Sequences through Cross- Lingual Wordto-Phoneme Alignment," in in SLSP, Tarragona, 2013.

O. Martirosian and M. Davel, "Error Analysis of a Public Do- main Pronunciation Dictionary," in in PRASA, 2007.

N. Rezaei and A. Salehi, "An Introduction to Speech Sciences (Acoustic Analysis of Speech)," Iranian Rehabilitation Journal, vol. 4, no. 4, pp. 5-14, 2006.

J. T. Colins, "Malay Dialect Research in Malaysia: the Issue of Perspective," Bijdragen tot de Taal-, Land- en Volkenkunde, pp. 235-264, 1989.

H. O. Asmah, Aspek Bahasa dan Kajiannya, Kuala Lumpur: Dewan Bahasa dan Pustaka, 1991.

Z. B. Ahmad, The Phonology & Morphology of the Perak Dialect, Kuala Lumpur: Dewan Bahasa dan Pustaka, 1991.

P. Ladefoged, Vowels and Consonants: An Introduction to the Sound of Languages, United Kingdom: Black Well Publishing, 2000.

N. Schmitt, A. Winkler, M. Boretzki and I. Holube, "A Phoneme Perception Test Method for High-Frequency Hearing Aid Fitting," Journal of the American Academy of Audiology fast track, vol. 27, p. 367–379, 2016.

R. D. Kent and C. Read, The Acoustic Analysis of Speech, Canada: Singular/Thomson Learning, 2002.

N. S. Kenneth, Acoustic Phonetics (Current Studies in Linguistics), Cambridge, MA: MIT., 2000.

X. D. Huang, A. Acero and H. W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development, New Jersey: Prentice Hall PTR, 2001.

A. Cole, Y. K. Muthusamy and B. T. Oshika, "The OGI Multilanguage Telephone Speech Corpus," in In Proc ICSLP'92, Banff, 1992.

O. Andersen, P. Dalsgaard and W. Barry, "Data-Driven Identification of Poly- and Mono-phonemes for four European Languages," in Proceedings of EUROSPEECH'93, Berlin, 1993.

A. J. Bosman, Speech perception by the hearing impaired, Netherlands: Doctorial thesis, University of Utrecht, 1989.

S. Gokcen and J. M. Gokcen, "A Multilingual Phoneme and Model Set: Toward a Universal Base for Automatic Speech Recognition," in Automatic Speech Recognition and Understanding, Proceedings., IEEE Workshop on, 1997.

L. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," in Proceedings of the IEEE, 1989.

A. K. Kienappel, D. Geller and R. Bippus, "Cross-Language Transfer Of Multilingual Phoneme Models," in ASR2000 - Automatic Speech Recognition: Challenges for the new Millenium Paris, France, 2000.

T. P. Tan, Automatic Speech Recognition for Non-Native Speakers, France: Universit́ e Joseph-Fourier - Grenoble I, 2008.

M. K. Ravishankar, "Sphinx3 Decoders: Online," 2006. [Online]. Available: http://cmusphinx.sourceforge.net/sphinx3/doc/s3_overview.html.. [Accessed 4 April 2017].

T. P. Tan and L. Besacier, "Improving Pronunciation Modeling for Non-native Speech Recognitio," in in Proc. Interspeech, Brisbane, 2008.

H. M. Abdul, Sintaksis Dialek Kelantan, Kuala Lumpur: Dewan Bahasa dan Pustaka, 2006.

H. O. Asmah, The Phonological Diversity of the Malay Dialects, Kuala Lumpur: Bahagian Pembinaan dan Pengembangan Bahasa, Dewan Bahasa dan Pustaka, 1977.

H. O. Asmah, Susur Galur Bahasa Melayu, Malaysia: Dewan Bahasa dan Pustaka, Kementerian Pendidikan, 1988.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

ISSN: 2180-1843

eISSN: 2289-8131