Go to:
Logótipo
Você está em: Start > Publications > View > Leveraging loanword constraints for improving machine translation in a low-resource multilingual context
Publication

Leveraging loanword constraints for improving machine translation in a low-resource multilingual context

Title
Leveraging loanword constraints for improving machine translation in a low-resource multilingual context
Type
Article in International Conference Proceedings Book
Year
2025
Authors
Ali, Felermino D. M. A.
(Author)
Other
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications Without AUTHENTICUS Without ORCID
Conference proceedings International
Pages: 27631 -27645
2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025)
Suzhou, China, 2025
Indexing
Crossref
Other information
Resumo (PT):
Abstract (EN): This research investigates how to improve machine translation systems for low-resource languages by integrating loanword constraints as external linguistic knowledge. Focusing on the Portuguese-Emakhuwa language pair, which exhibits significant lexical borrowing, we address the challenge of effectively adapting loanwords during the translation process. To tackle this, we propose a novel approach that augments source sentences with loanword constraints, explicitly linking source-language loanwords to their target-language equivalents. Then, we perform supervised fine-tuning on multilingual neural machine translation models and multiple Large Language Models of different sizes. Our results demonstrate that incorporating loanword constraints leads to significant improvements in translation quality as well as in handling loanword adaptation correctly in target languages, as measured by different machine translation metrics. This approach offers a promising direction for improving machine translation performance in low-resource settings characterized by frequent lexical borrowing.
Language: English
Type (Professor's evaluation): Scientific
Documents
File name Description Size
2025.emnlp-main.1406 644.73 KB
Related Publications

Of the same authors

SSA-COMET: Do LLMs outperform learned metrics in evaluating MT for under-resourced African languages? (2025)
Article in International Conference Proceedings Book
Li, Senyu; Wang, Jiayi; Ali, Felermino D. M. A.; Cherry, Colin ; Deutsch, Daniel; Briakou, Eleftheria ; Sousa-Silva, Rui; Cardoso, Henrique Lopes ; Stenetorp, Pontus; Adelani, David Ifeoluwa
Expanding FLORES+ benchmark for more low-resource settings: Portuguese-Emakhuwa machine translation evaluation (2024)
Article in International Conference Proceedings Book
Ali, Felermino; Cardoso, Henrique Lopes ; Sousa-Silva, Rui
Evaluating WMT 2025 Metrics shared task submissions on the SSA-MTE African challenge set (2025)
Article in International Conference Proceedings Book
Li, Senyu; Ali, Felermino D. M. A.; Wang, Jiayi ; Sousa-Silva, Rui; Cardoso, Henrique Lopes ; Stenetorp, Pontus ; Cherry, Colin; Adelani, David Ifeoluwa
Detecting loanwords in Emakhuwa: an extremely low-resource bantu language exhibiting significant borrowing from portuguese (2024)
Article in International Conference Proceedings Book
Ali, Felermino; Cardoso, Henrique Lopes ; Sousa-Silva, Rui
Building resources for Emakhuwa: machine translation and news classification benchmarks (2024)
Article in International Conference Proceedings Book
Ali, Felermino; Cardoso, Henrique Lopes ; Sousa-Silva, Rui
Recommend this page Top
Copyright 1996-2025 © Faculdade de Desporto da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-12-04 at 13:07:14 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book