L. E. Baum, An inequality and associated maximization technique in statistical estimation for probabilistic functions of markov processes, Proceedings of 3rd Symposium on Inequalities, pp.1-8, 1972.

V. Berment, Méthodes pour informatiser des langues et des groups de langues peu dotées, 2004.

L. Besacier, E. Barnard, A. Karpov, and T. Schultz, Automatic speech recognition for under-resourced languages: A survey, Speech Communication, vol.56, pp.85-100, 2014.
DOI : 10.1016/j.specom.2013.07.008
URL : https://hal.archives-ouvertes.fr/hal-00953644

J. Billa, K. Ma, J. Mcdonough, G. Zavaliagkos, D. R. Miller et al., Multilingual speech recognition: the 1996 byblos callhome system, Proceedings of Eurospeech, pp.363-366, 1997.

S. Bird, L. Gawne, K. Gelbart, and I. Mcalister, Collecting bilingual audio in remote indigenous communities, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pp.1015-1024, 2014.

S. Bird, F. R. Hanke, O. Adams, and H. Lee, Aikuma: A Mobile App for Collaborative Language Documentation, Proceedings of the 2014 Workshop on the Use of Computational Methods in the Study of Endangered Languages, p.1, 2014.
DOI : 10.3115/v1/W14-2201

M. Bisani and H. Ney, Joint-sequence models for grapheme-to-phoneme conversion, Speech Communication, vol.50, issue.5, pp.434-451, 2008.
DOI : 10.1016/j.specom.2008.01.002
URL : https://hal.archives-ouvertes.fr/hal-00499203

P. H. Bo-june and J. Glass, Iterative language model estimation: Efficient data structure & algorithms, Proceedings of INTERSPEECH, pp.841-844, 2008.

G. Bouselmi, D. Fohr, and J. P. Haton, Fully automated non-native speech recognition using confusion-based acoustic model intergration, Proceedings of Eurospeech, pp.1369-1372, 2005.

L. Burget, P. Schwartz, M. Agarwal, P. Akyazi, K. Feng et al., Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing
DOI : 10.1109/ICASSP.2010.5495646

R. Bust, Subgrouping, circularity and extinction : some issues in austronesian comparative linguistics, Selected papers from the Eigth International Conference on Austronesian Linguistics, pp.31-94, 1999.

G. Caelen-haumont, Towards the mo piu tonal system: first results on an undocumented south-asian language, Proceedings of Speech Prosody, 2012.

G. Caelen-haumont and S. Sam, Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the south-asia: the case of mo piu, Proceedings of LREC, pp.956-962, 2008.

G. Caelen-haumont, S. Sam, and E. Castelli, Automatic Labeling and Phonetic Assessment for an Unknown Asian Language: The Case of the "Mo Piu" North Vietnamese Minority (early results), 2011 International Conference on Asian Language Processing, pp.260-263, 2011.
DOI : 10.1109/IALP.2011.81

O. C. ¸-etin, M. Plauché, and U. Nallasamy, Unsupervised adpative speech technology for limited resource languages: A case study for tamil, Proceedings of ICASSP, 2007.

S. F. Chen, Conditional and joint models for grapheme-to-phoneme conversion, Proceedings of EUROSPEECH, pp.933-936, 2003.

X. Chen and J. Cheng, Deep neural network acoustic modeling for native and non-native Mandarin speech recognition, The 9th International Symposium on Chinese Spoken Language Processing, 2014.
DOI : 10.1109/ISCSLP.2014.6936617

P. Cohen, S. Dharanipragada, J. Gros, M. Monkowski, C. Neti et al., Towards a universal speech recognizer for multiple languages, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, pp.591-598, 1997.
DOI : 10.1109/ASRU.1997.659140

G. E. Dahl, D. Yu, L. Deng, and A. Acero, Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, vol.20, issue.1, pp.30-42, 2012.

M. Davel and E. Barnard, Bootstrapping in language resource generation, Proceedings of 14th Annual Symposium of the Pattern Recognition Association of South Africa, 2003.

M. Davel and E. Barnard, The efficient generation of pronunciation dictionaries: Human factors during bootstrapping, Proceedings of INTERSPEECH, 2004.

S. Davis and P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.28, issue.4, pp.357-366, 1980.
DOI : 10.1109/TASSP.1980.1163420

N. J. De-vries, Effective automatic speech recognition data collection for under-resourced languages. Master's thesis, 2011.

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the em algorithm, Journal of the Royal Statistical Society, vol.39, issue.1, pp.1-21, 1977.

C. Dugast, X. Aubert, and R. Kneser, The philips large-vocabulary recognition system for american english, french and german, Proceedings of Eurospeech, pp.197-200, 1995.

I. Dyen, J. B. Kruskal, and P. Black, An Indoeuropean classification:a lexicostatistical experiment, volume iii, Transactions of the American Philosophical Society, 1992.

J. L. Elman, Finding Structure in Time, Cognitive Science, vol.49, issue.2, pp.179-211, 1990.
DOI : 10.1207/s15516709cog1402_1

J. Ensiring, J. Umbat, and R. M. Salleh, The Tun Jugah Foundation, Bup Sereba Reti Jaku Iban, 2011.

V. Ferdiansyah and A. Purwarianti, Indonesian Automatic Speech Recognition System Using English-Based Acoustic Model, American Journal of Signal Processing, vol.2, issue.4, pp.60-63, 2012.
DOI : 10.5923/j.ajsp.20120204.01

R. A. Fisher, The Use of Multiple Measures in Taxonomic Problems, 1936.

S. Furui, Speaker-independent isolated word recognition using dynamic features of speech spectrum, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.34, issue.1, pp.52-59, 1986.
DOI : 10.1109/TASSP.1986.1164788

M. Gales, Maximum likelihood linear transformations for HMM-based speech recognition, Computer Science and Language, pp.75-98, 1998.
DOI : 10.1006/csla.1998.0043

J. F. Gales, K. M. Knill, A. Ragni, and S. P. Rath, Speech recognition and keyword spotting for low resource languages: Babel project research at cued, Proceedings of Workshop for Spoken Language Technology for Under-resourced (SLTU), Russia, 2014.

A. Gandhe, F. Metze, and I. Lane, Neural network language models for low resource languages, Proceedings of INTERSPEECH, 2014.

H. Gelas, S. T. Abate, L. Besacier, and F. Pellegrino, Quality assessment of crowdsourcing transcriptions for african languages, Proceedings of INTERSPEECH, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00959158

A. Ghoshal, P. Swietojanski, and S. Renals, Multilingual training of deep neural networks, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7319-7323, 2013.
DOI : 10.1109/ICASSP.2013.6639084

V. Goel, S. Kumar, and W. Byrne, Segmental minimum bayes-risk decoding for automatic speech recognition, Proceedings of IEEE Transactions on Speech and Audio Processing, 2003.

K. E. Goh and A. M. Ahmad, Malay speech recognition using self-organizing map and multilayer perceptron, Proceedings of Postgraduate Annual Research Seminar, 2005.

C. Gollan and M. Bacchiani, Confidence scores for acoustic model adaptation, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4289-4292, 2008.
DOI : 10.1109/ICASSP.2008.4518603

I. J. Good, The Population Frequencies of Species and the Estimation of Population Parameters, Biometrika, vol.40, issue.3/4, pp.237-264, 1953.
DOI : 10.2307/2333344

R. A. Gopinath, Maximum likelihood modeling with Gaussian distributions for classification, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181), pp.661-664, 1998.
DOI : 10.1109/ICASSP.1998.675351

S. Goronzy, Robust adaptation to non-native accents in automatic speech recognition, 2002.
DOI : 10.1007/3-540-36290-8

P. Hahn, M. Vozila, and . Bisani, Comparison of graphemeto-phoneme methods on large pronunciation dictionaries and lvcsr tasks, Proceedings of INTERSPEECH, 2012.

W. Heeringa, F. De, and . Wet, The origin of afrikaans pronunciation: a comparison to west germanic languages and dutch dialects, Proceedings of Conference of the Pattern Recognition Association of South Africa, pp.159-164, 2008.

G. Hinton, L. Deng, D. Yu, A. Mohamed, N. Jaitly et al., Deep neural networks for acoustic modeling in speech recognition, IEEE Signal Processing Magazine, issue.6, pp.2982-97, 2012.

G. E. Hinton, A practical guide to training restricted boltzmann machines. Utml tr 2010-003, 2010.

G. E. Hinton, S. Osindero, and Y. Teh, A Fast Learning Algorithm for Deep Belief Nets, Neural Computation, vol.18, issue.7, pp.1527-1554, 2006.
DOI : 10.1162/jmlr.2003.4.7-8.1235

G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Improving neural networks by preventing co-adaptation of feature detectors, Proceedings of CoRR, 2012.

C. Huang, E. Chang, J. Zhou, and K. Lee, Accent modeling based on pronunciation dictionary adaptation for large vocabulary mandarin speech recognition, Proceedings of ICLSP, pp.818-821, 2000.

J. Huang, J. Li, D. Yu, L. Deng, and Y. Gong, Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013.
DOI : 10.1109/ICASSP.2013.6639081

Y. Huang, D. Yu, C. Liu, and Y. Gong, Multi-accent deep neural network acoustic model with accent-specific top layer using the kld-regularized model adaptation, Proceedings of INTERSPEECH, 2014.

K. Hughes, L. Nakajima, A. Ha, P. Vasu, M. Moreno et al., Building transcribed speech corpora quickly and cheaply for many languages, Proceedings of INTERSPEECH, pp.1914-1917, 2010.

D. Imseng, P. Motlicek, H. Bourlard, and P. N. Garner, Using out-of-language data to improve an under-resourced speech recognizer, Speech Communication, vol.56, issue.0, pp.56142-151, 2014.
DOI : 10.1016/j.specom.2013.01.007

F. Jelinek, Statistical Methods for Speech Recognition, 2001.

F. Jelinek and R. L. Mercer, Interpolated estimation of markov source parameters from sparse data, Proceedings of Workshop on Patter Recognition in Practice, pp.381-397, 1980.

S. Jiampojamarn, G. Kondrak, and T. Sherif, Applying many-to-many alignments and hidden markov models to letter-to-phoneme conversion In Human Language Technologies 2007: The Conference of the North American Chapter, Proceedings of the Main Conference, pp.372-379, 2007.

S. S. Juan and L. Besacier, Fast bootstrapping of grapheme to phoneme system for under-resourced languages -application to the iban language, Proceedings of 4th Workshop on South and Southeast Asian Natural Language Processing 2013, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00953784

S. S. Juan, L. Besacier, and T. Tan, Analysis of Malay Speech Recognition for Different Speaker Origins, 2012 International Conference on Asian Language Processing, pp.229-232, 2012.
DOI : 10.1109/IALP.2012.23

S. S. Juan, L. Besacier, B. Lecouteux, and T. Tan, Using closely-related language to build an ASR for a very under-resourced language: Iban, 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), pp.71-76, 2014.
DOI : 10.1109/ICSDA.2014.7051423
URL : https://hal.archives-ouvertes.fr/hal-01055576

S. S. Juan, L. Besacier, and S. Rossato, Semi-supervised G2P bootstrapping and its application to asr for a very under-resourced language: Iban, Proceedings of Workshop for Spoken Language Technology for Under-resourced (SLTU), 2014.

S. Juan, L. Besacier, and S. Rossato, Construction faiblement supervisée d'un phonétiseur pour la langue ibanàibanà partir de ressources en malais, Proceedings of Journée d'Etude sur la Parole (JEP), 2014.

S. M. Katz, Estimation of probabilities from sparse data for the language model component of a speech recognizer, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.35, issue.3, pp.400-401, 1987.
DOI : 10.1109/TASSP.1987.1165125

B. Kingsbury, Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.3761-3764, 2009.
DOI : 10.1109/ICASSP.2009.4960445

R. Kneser, Grapheme-to-phoneme study, 2000.

R. Kneser and H. Ney, Improved backing-off for M-gram language modeling, 1995 International Conference on Acoustics, Speech, and Signal Processing, pp.181-184, 1995.
DOI : 10.1109/ICASSP.1995.479394

L. Lamel, M. Adda-decker, and J. L. Gauvain, Issues in large vocabulary multilingual speech recognition, Proceedings of Eurospeech, pp.185-189, 1995.

V. B. Le and L. Besacier, Automatic speech recognition for under-resourced languages: application to vietnamese language, IEEE Transactions on Audio, Speech and Language Processing, vol.17, issue.8, pp.1471-1482, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00953718

K. F. Lee, S. Hayamizu, H. W. Hon, C. Huang, J. Swartz et al., Allophone clustering for continuous speech recognition, International Conference on Acoustics, Speech, and Signal Processing, pp.749-752, 1990.
DOI : 10.1109/ICASSP.1990.115900

C. J. Leggetter and P. C. Woodland, Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, Computer Speech & Language, vol.9, issue.2, pp.171-185, 1995.
DOI : 10.1006/csla.1995.0010

W. P. Lehmann, Historical Linguistics, 1993.

M. P. Lewis and G. F. Simons, Assessing endangerment: Expanding fishman's gids

M. P. Lewis, G. F. Simons, and C. D. Fennig, Ethnologue : Languages of the world, Seventh Edition

H. Lin, L. Deng, D. Yu, Y. Fan-gong, A. Acero et al., A study on multilingual acoustic modeling for large vocabulary ASR, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4333-4336, 2009.
DOI : 10.1109/ICASSP.2009.4960588

J. Lööf, C. Gollan, S. Hahn, G. Heigold, B. Hoffmeister et al., The rwth 2007 tc-star evaluation system for european english and spanish, Proceedings of INTERSPEECH, pp.2145-2148, 2007.

J. Lööf, C. Gollan, and H. Ney, Cross-language bootstrapping for unsupervised acoustic model training: Rapid development of a polish speech recognition system, Proceedings of INTERSPEECH, 2009.

L. Lu, A. Ghoshal, and S. Renals, Regularized subspace Gaussian mixture models for cross-lingual speech recognition, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011.
DOI : 10.1109/ASRU.2011.6163959

L. Lu, A. Ghoshal, and S. Renals, Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012.
DOI : 10.1109/ICASSP.2012.6289012

L. Lu, A. Ghoshal, and S. Renals, Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition, Speech and Language Processing, pp.17-27, 2014.
DOI : 10.1109/TASL.2013.2281575

K. R. Mabokela, M. J. Manamela, and M. Manaileng, Modeling code-switching speech on under-resourced languages for language identification, Proceedings of Workshop for Spoken Language Technology for Under-resourced (SLTU), pp.225-230, 2014.

Y. M. Maris, The Malay Sound System. Siri Teks Fajar Bakti, 1979.

S. R. Maskey, A. W. Black, and L. M. Tomokiyo, Bootstrapping phonetic lexicons for language, Proceedings of INTERSPEECH, pp.69-72, 2004.

M. Maxwell and B. Hughes, Frontiers in linguistic annotation for low-density languages, Proceedings of Workshop on Frontiers in Linguistically annotated corpora, pp.29-37, 2006.

F. Miao and . Metze, Improving low-resource cd-dnn-hmm using dropout and multilingual dnn training, Proceedings of INTERSPEECH, pp.2237-2241, 2013.

T. Mikolov, M. Karafiát, L. Burget, J. H. Cernock´ycernock´y, and S. Khudanpur, Recurrent neural network based language model, Proceedings of Interspeech, pp.1045-1048, 2010.

T. Mikolov, S. Kombrink, L. Burget, J. H. Cernock´ycernock´y, and S. Khudanpur, Extensions of recurrent neural network language model, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5528-5531, 2011.
DOI : 10.1109/ICASSP.2011.5947611

A. Mohamed, G. E. Dahl, and G. Hinton, Acoustic Modeling Using Deep Belief Networks, IEEE Transactions on Audio, Speech and Language Processing, pp.14-22, 2012.
DOI : 10.1109/TASL.2011.2109382

A. Mohan, S. H. Ghalehjegh, and R. C. Rose, Dealing with acoustic mismatch for training multlingual subspace gaussian mixture models for speech recognition, Proceedings of ICASSP, pp.4893-4896, 2012.

R. Molapo, E. Barnard, F. De, and . Wet, Speech data collection in an under-resourced language within a multilingual context, Proceedings of Workshop for Spoken Language Technology for Under-resourced (SLTU), pp.238-242, 2014.

J. J. Morgan, Making a speech recognizer tolerate non-native speech through gaussian mixture merging, Proceedings of ICALL'04, 2004.

J. Mugabe, P. Kameri-mbote, and D. Mutta, Traditional knowledge, genetic resources and intellectual property protection: Towards a new international regime, International Environmental Law Research Center, 2001.

W. D. Mulder, S. Bethard, and M. Moens, A survey on the application of recurrent neural networks to statistical language modeling, Computer Speech & Language, vol.30, issue.1, pp.61-98, 2015.
DOI : 10.1016/j.csl.2014.09.005

E. L. Ng, A. W. Yeo, and B. Rainavo-malançon, Identification of closely-related indigenous languages:an orthographic approach Bibliography 143, Proceedings of International Conference on Asian Language Processing (IALP), pp.230-235, 2009.

J. R. Novak, N. Minematsu, and K. Hirose, Evaluations of an open source wfst-based phoneticezer. PDF, General Talk No, 2011.

S. Novotney and C. Callison-burch, Cheap, fast and good enough: Automatic speech recognition with non-expert transcription, Proceedings of Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL, pp.207-215, 2010.

A. H. Omar, Perkaitan bahasa melayu dengan bahasa iban dari segi sejarah

H. F. Ong and A. M. Ahmad, Malay language speech recognizer with hybrid hidden markov model and artificial neural network (hmm/ann), International Journal of Information and Education Technology, vol.1, issue.2, pp.114-119, 2011.

M. Pitz and H. Ney, Vocal tract normalization as linear transformation of mfcc, Proceedings of Eurospeech, 2003.

C. Plahl, B. Hoffmeister, M. Hwang, D. Lu, G. Heigold et al., Recent improimprove of the rwth gale mandarin lvcsr system, Proceedings of INTERSPEECH, pp.2426-2429, 2008.

D. Povey, H. J. Kuo, and H. Soltau, Fast speaker adaptive training for speech recognition, Proceedings of INTERSPEECH, pp.1245-1248, 2008.

D. Povey, L. Burget, M. Agarwal, P. Akyazi, K. Feng et al., Subspace Gaussian Mixture Models for speech recognition, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010.
DOI : 10.1109/ICASSP.2010.5495662

D. Povey, L. Burget, M. Agarwal, P. Akyazi, F. Kai et al., The subspace Gaussian mixture model???A structured model for speech recognition, Computer Speech & Language, vol.25, issue.2, pp.404-439, 2011.
DOI : 10.1016/j.csl.2010.06.003

A. Povey, G. Ghoshal, L. Boulianne, O. Burget, N. Glembek et al., Veseì y. The kaldi speech recognition toolkit, Proceedings of Workshop on Automatic Speech Recognition and Understanding, p.11, 2011.

L. R. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proceedings of IEEE, pp.257-286, 1989.

A. Rousseau, P. Deléglise, and Y. Estève, Ted-lium: An automatic speech recognition dedicated corpus, Proceedings of LREC European Language Resources Association (ELRA), pp.125-129, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01434928

D. Rybach, S. Hahn, C. Gollan, R. Sclüter, and H. Ney, Advances in Arabic broadcast news transcription at RWTH, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pp.449-454, 2007.
DOI : 10.1109/ASRU.2007.4430154

D. Rybach, C. Gollan, G. Heigold, B. Hoffmeister, J. Lööf et al., The rwth aachen university open source speech recognition system, Proceedings of INTERSPEECH, pp.2111-2114, 2009.

K. P. Scannell, The crúbadán project: Corpus building for under-resourced languages, Building and Exploring Web Corpora: Proceedings of the 3rd Web as Corpus Workshop, pp.5-15, 2007.

T. Schultz, Globalphone: a multilingual speech and text database developed at karlsruhe university, Proceedings of ICLSP, pp.345-348, 2002.

T. Schultz and A. Waibel, Fast bootstrapping of lvcsr systems with multilingual phoneme sets, Proceedings of Eurospeech, pp.371-374, 1997.

T. Schultz and A. Waibel, Multilingual and crosslingual speech recognition

T. Schultz and A. Waibel, Language-independent and language-adaptive acoustic modeling for speech recognition, Speech Communication, vol.35, issue.1-2, pp.31-52, 2001.
DOI : 10.1016/S0167-6393(00)00094-7

T. Schultz, N. T. Vu, and T. Schilippe, GlobalPhone: A multilingual text & speech database in 20 languages, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2013-145
DOI : 10.1109/ICASSP.2013.6639248

G. Seide, D. Li, and . Yu, Conversational speech transcription using context-dependent deep neural networks, Proceedings of INTERSPEECH, pp.437-440, 2011.

A. Sixtus and H. Ney, From within-word model search to across-word model search in large vocabulary continuous speech recognition, Computer Speech & Language, vol.16, issue.2, pp.245-271, 2002.
DOI : 10.1006/csla.2002.0192

A. Stolcke, Srilm -an extensible language modeling toolkit, Proceedings of the 7th International Conference on Spoken Language Processing, pp.901-904, 2002.

M. Swadesh, Lexico-statistic dating of prehistoric ethnic contacts, Proceedings of the American Philosophical Society, pp.452-463, 1952.

P. Swietojanski, A. Ghoshal, and S. Renals, Unsupervised cross-lingual knowledge transfer in dnn-based lvscr, Proceedings of ICASSP, 2013.

T. Tan and L. Besacier, Acoustic Model Interpolation for Non-Native Speech Recognition, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007.
DOI : 10.1109/ICASSP.2007.367243

T. Tan and B. Rainavo-malançon, Malay grapheme to phoneme tool for automatic speech recognition, Proceedings of Workshop of Malaysia and Indonesia Language Engineering, 2009.

T. Tan, H. Li, E. K. Tang, X. Xiao, and E. S. Chng, MASS: A Malay language LVCSR corpus resource, 2009 Oriental COCOSDA International Conference on Speech Database and Assessments, pp.26-30, 2009.
DOI : 10.1109/ICSDA.2009.5278382

T. Tan, L. Besacier, and B. Lecouteux, Acoustic model merging using acoustic models from multilingual speakers for automatic speech recognition, 2014 International Conference on Asian Language Processing (IALP), 2014.
DOI : 10.1109/IALP.2014.6973492
URL : https://hal.archives-ouvertes.fr/hal-01020180

E. Titariy, N. Lotner, M. Gishri, and A. Moyal, A hybrid keyword spotting approach for combining lvcsr and phonetic search, Proceedings of Speech Processing Conference, 2014.

R. Tong, B. P. Lim, N. F. Chen, B. Ma, and H. Li, Subspace Gaussian mixture model for computer-assisted language learning, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5347-5351
DOI : 10.1109/ICASSP.2014.6854624

J. Vozila, Y. Adams, T. Lobacheva, and . Ryan, Grapheme to phoneme conversion and dictionary verification using graphonemes, Proceedings of Eurospeech, 2003.

N. T. Vu, D. Lyu, J. Weiner, D. Telaar, T. Schlippe et al., A first speech recognition system for Mandarin-English code-switch conversational speech, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4889-4892, 2012.
DOI : 10.1109/ICASSP.2012.6289015

N. T. Vu, D. Imseng, D. Povey, P. Motlí?-cek, T. Schultz et al., Multilingual deep neural network based acoustic modeling for rapid language adaptation, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014.
DOI : 10.1109/ICASSP.2014.6855086

M. Walsh, Will indigenous languages survive? Annual Review of Anthropology, pp.293-315, 2005.

Z. Wang, T. Schultz, and A. Waibel, Towards universal speech recognition, Proceedings of International Conference on Multimodal Interfaces, 2002.

J. Weiner, N. T. Vu, D. Telaar, F. Metze, T. Schultz et al., Integration of language identification into a recognition system for spoken conversations containing code-switches, Proceedings of Workshop for Spoken Language Technology for Under-resourced (SLTU), pp.76-79, 2012.

L. Welling, S. Kanthak, and H. Ney, Improved methods for vocal tract normalization, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), pp.761-764, 1999.
DOI : 10.1109/ICASSP.1999.759780

J. Wells, W. Barry, M. Grice, A. Fourcin, and D. Gibbon, Standard computer-compatible transcription. Doc. no. sam-ucl-037, 1992.

J. C. Wells, Computer-coding the ipa: a proposed extension of sampa, 1995.

J. C. Wells, Handbook of Standards and Resources for Spoken Language Systems, chapter SAMPA computer readable phonetic alphabet, 1997.

P. J. Werbos, Backpropagation through time: what it does and how to do it, Proceedings of IEEE adaptive text compression, pp.1550-1560, 1990.
DOI : 10.1109/5.58337

K. B. Wright, Researching Internet-Based Populations: Advantages and Disadvantages of Online Survey Research, Online Questionnaire Authoring Software Packages, and Web Survey Services, Journal of Computer-Mediated Communication, vol.6, issue.1, 2005.
DOI : 10.1111/j.1083-6101.2005.tb00259.x

S. Wurm, Language Diversity Endangered, chapter Threatened Languages in the Western Pacific Area from Taiwan to, and including Papua New Guinea, pp.374-390, 2008.

X. Xiao, E. S. Chng, T. Tan, and H. Li, Development of a malay lvscr system, Proceedings of Oriental COCOSDA, 2010.

. Woodland, Multilingual large vocabulary speech recognition: the european sqale project, Computer Speech and Language, vol.11, pp.73-89, 1997.

R. M. Yusof, Perkaitan bahasa melayu dan bahasa iban: Satu tinjauan ringkas, Jurnal Bahasa, vol.3, issue.3, 2003.

Y. Zhang and J. R. Glass, Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, pp.398-403, 2009.
DOI : 10.1109/ASRU.2009.5372931