Publications

Kemal Kurniawan, Meladel Mistica, Timothy Baldwin, and Jey Han Lau (2025). Training and Evaluating with Human Label Variation: An Empirical Study (under review)
Toby Simonds, Kemal Kurniawan, and Jey Han Lau (2024). MoDEM: Mixture of Domain Expert Models. In ALTA.
Raphael Merx, Ekaterina Vylomova, Kemal Kurniawan (2024). Generating bilingual example sentences with large language models as lexicography assistants. In ALTA. [code] (Best Paper)
Kemal Kurniawan, Meladel Mistica, Timothy Baldwin, and Jey Han Lau (2024). To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction. In WASSA. [preprint] [code]
Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Rahmad Mahendra, Fajri Koto, Ade Romadhony, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Pascale Fung, Timothy Baldwin, Jey Han Lau, Rico Sennrich, Sebastian Ruder (2023). NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages. In EACL. [preprint] [data] (Outstanding Paper)
Kemal Kurniawan, Lea Frermann, Philip Schulz, and Trevor Cohn (2022). Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data. In NAACL. [preprint] [code]
Alham Fikri Aji, Genta Indra Winata, Fajri Koto, Samuel Cahyawijaya, Ade Romadhony, Rahmad Mahendra, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Timothy Baldwin, Jey Han Lau, and Sebastian Ruder (2022). One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia. In ACL. [preprint]
Kemal Kurniawan, Lea Frermann, Philip Schulz, and Trevor Cohn (2021). PTST-UoM at SemEval-2021 Task 10: Parsimonious Transfer for Sequence Tagging. In SemEval. [code]
Kemal Kurniawan, Lea Frermann, Philip Schulz, and Trevor Cohn (2021). PPT: Parsimonious Parser Transfer for Unsupervised Cross-Lingual Adaptation. In EACL. [preprint] [code] [video]
Kemal Kurniawan (2019). KaWAT: A Word Analogy Task Dataset for Indonesian. [code]
Kemal Kurniawan and Samuel Louvan (2018). IndoSum: A New Benchmark Dataset for Indonesian Text Summarization. In IALP. [preprint] [code]
Kemal Kurniawan and Alham Fikri Aji (2018). Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging. In IALP. [preprint] [code]
Kemal Kurniawan and Samuel Louvan (2018). Empirical Evaluation of Character-Based Model on Neural Named-Entity Recognition in Indonesian Conversational Texts. In W-NUT. [preprint]
Fariz Ikhwantri, Samuel Louvan, Kemal Kurniawan, Bagas Abisena, Valdi Rachman, Alfan Farizki Wicaksono, and Rahmad Mahendra (2018). Multi-Task Active Learning for Neural Semantic Role Labeling on Low Resource Conversational Corpus. In DeepLo. [preprint]
Kemal Kurniawan (2017). Exploring Recurrent Neural Network Grammars for Parsing Low-Resource Languages. MSc thesis. University of Edinburgh, Scotland, United Kingdom. [code]