Go to:
Logótipo
Você está em: Start > Publications > View > SSA-COMET: Do LLMs outperform learned metrics in evaluating MT for under-resourced African languages?
Publication

SSA-COMET: Do LLMs outperform learned metrics in evaluating MT for under-resourced African languages?

Title
SSA-COMET: Do LLMs outperform learned metrics in evaluating MT for under-resourced African languages?
Type
Article in International Conference Proceedings Book
Year
2025
Authors
Li, Senyu
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Wang, Jiayi
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Ali, Felermino D. M. A.
(Author)
Other
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications Without AUTHENTICUS Without ORCID
Cherry, Colin
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Deutsch, Daniel
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Briakou, Eleftheria
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Stenetorp, Pontus
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Adelani, David Ifeoluwa
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Conference proceedings International
Pages: 12991-13010
2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)
Suzhou, China, 2025
Indexing
Crossref
Other information
Resumo (PT):
Abstract (EN): Evaluating machine translation (MT) quality for under-resourced African languages remains a significant challenge, as existing metrics often suffer from limited language coverage and poor performance in low-resource settings. While recent efforts, such as AfriCOMET, have addressed some of the issues, they are still constrained by small evaluation sets, a lack of publicly available training data tailored to African languages, and inconsistent performance in extremely low-resource scenarios. In this work, we introduce SSA-MTE, a large-scale human-annotated MT evaluation (MTE) dataset covering 14 African language pairs from the News domain, with over 73,000 sentence-level annotations from a diverse set of MT systems. Based on this data, we develop SSA-COMET and SSA-COMET-QE, improved reference-based and reference-free evaluation metrics. We also benchmark prompting-based approaches using state-of-the-art LLMs like GPT-4o, Claude-3.7 and Gemini 2.5 Pro. Our experimental results show that SSA-COMET models significantly outperform AfriCOMET and are competitive with the strongest LLM Gemini 2.5 Pro evaluated in our study, particularly on low-resource languages such as Twi, Luo, and Yoruba. All resources are released under open licenses to support future research.
Language: English
Type (Professor's evaluation): Scientific
Documents
File name Description Size
2025.emnlp-main.656 1471.54 KB
Recommend this page Top
Copyright 1996-2025 © Faculdade de Desporto da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-12-03 at 18:39:24 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book