You can also browse my Google Scholar profile.
Journal
- Kak Soky, Sheng Li, Chenhui Chu, Tatsuya Kawahara. Finetuning Pretrained Model with Embedding of Domain and Language Information for ASR of Very Low-Resource Settings, International Journal of Asian Language Processing, 2023
- Kak Soky, Masato Mimura, Chenhui Chu, Tatsuya Kawahara, Sheng Li, Chenchen Ding, Sethserey Sam. TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies, International Journal of Asian Language Processing, 2022
Conference papers
- K. Soky, S. Li, C. Chu and T. Kawahara, “Domain and Language Adaptation Using Heterogeneous Datasets for Wav2vec2.0-Based Speech Recognition of Low-Resource Language,” ICASSP 2023, Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10095644.
- K. Soky, Z. Gong, S. Li, NICT-Tib1: A Public Speech Corpus of Lhasa Dialect for Benchmarking Tibetan Language Speech Recognition Systems, O-COCOSDA2022, 2022
-
K. Soky, S. Li, M. Mimura, C. Chu, T. Kawahara, Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism, Interspeech, 2022
-
K. Soky, M. Mimura, T. Kawahara, S. Li, C. Ding, C. Chu, S. Sam, KHMER SPEECH TRANSLATION CORPUS OF THE EXTRAORDINARY CHAMBERS IN THE COURTS OF CAMBODIA (ECCC), O-COCOSDA2021, 2021
-
K. Soky, S. Li, M. Mimura, C. Chu, T. Kawahara, On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora, APSIPA2021, 2021
-
K. Soky, S. Li, M. Mimura, C. Chu, T. Kawahara, Comparison of End-to-End Models for Joint Speaker and Speech Recognition, IEICE-SP, 2021 (PDF)
-
K., Soky, Sheng, Li., Tatsuya, K., Sopheap, S.. Multi-lingual transformer training for Khmer automatic speech recognition. In Proc. APSIPA ASC, 2019 (PDF)
-
Saly, K., Soky, K., Jinfu, N., Yoshinori, S., Kato, H., & Kawai, H.. Hidden Markov Model based Speech Synthesis system for Khmer Language. In Proc. ONA2018 (URL)
-
K., Soky, Chea, V., Sam S.. Khmer Automatic Speech Recognition based on DNN models. In Proc. ONA2017 (PDF)
-
Hour, K., Soky, K., Vichet, C., Sethserey, S.. OCR post-processing for Khmer language: Error detection using Conditional Random Field. In Proc. ONA2017 (PDF)
- Soky, K., Lu, X., Shen, P., Kato, H., Kawai, H., Vanna, C., & Chea, V.. Building WFST based Grapheme to Phoneme Conversion for Khmer. In Proc. 3rd Khmer Natural Language Processing. 2016 (PDF)