Publications

You can also browse my Google Scholar profile.

Journal

Kak Soky, Sheng Li, Chenhui Chu, Tatsuya Kawahara. Finetuning Pretrained Model with Embedding of Domain and Language Information for ASR of Very Low-Resource Settings, International Journal of Asian Language Processing, 2023
Kak Soky, Masato Mimura, Chenhui Chu, Tatsuya Kawahara, Sheng Li, Chenchen Ding, Sethserey Sam. TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies, International Journal of Asian Language Processing, 2022

Conference papers

K. Soky, S. Li, C. Chu and T. Kawahara, “Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-Based Speech Recognition of Low-Resource Language,” ICASSP 2023, Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10095644.
K. Soky, Z. Gong, S. Li, NICT-Tib1: A Public Speech Corpus of Lhasa Dialect for Benchmarking Tibetan Language Speech Recognition Systems, O-COCOSDA2022, 2022
K. Soky, S. Li, M. Mimura, C. Chu, T. Kawahara, Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism, Interspeech, 2022
K. Soky, M. Mimura, T. Kawahara, S. Li, C. Ding, C. Chu, S. Sam, KHMER SPEECH TRANSLATION CORPUS OF THE EXTRAORDINARY CHAMBERS IN THE COURTS OF CAMBODIA (ECCC), O-COCOSDA2021, 2021
K. Soky, S. Li, M. Mimura, C. Chu, T. Kawahara, On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora, APSIPA2021, 2021
K. Soky, S. Li, M. Mimura, C. Chu, T. Kawahara, Comparison of End-to-End Models for Joint Speaker and Speech Recognition, IEICE-SP, 2021 (PDF)
K., Soky, Sheng, Li., Tatsuya, K., Sopheap, S.. Multi-lingual transformer training for Khmer automatic speech recognition. In Proc. APSIPA ASC, 2019 (PDF)
Saly, K., Soky, K., Jinfu, N., Yoshinori, S., Kato, H., & Kawai, H.. Hidden Markov Model based Speech Synthesis system for Khmer Language. In Proc. ONA2018 (URL)
K., Soky, Chea, V., Sam S.. Khmer Automatic Speech Recognition based on DNN models. In Proc. ONA2017 (PDF)
Hour, K., Soky, K., Vichet, C., Sethserey, S.. OCR post-processing for Khmer language: Error detection using Conditional Random Field. In Proc. ONA2017 (PDF)
Soky, K., Lu, X., Shen, P., Kato, H., Kawai, H., Vanna, C., & Chea, V.. Building WFST based Grapheme to Phoneme Conversion for Khmer. In Proc. 3rd Khmer Natural Language Processing. 2016 (PDF)