Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
Publication

Publications

PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese

Title
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
Type
Article in International Scientific Journal
Year
2024
Authors
Osório, TF
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Leite, B
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Gomes, L
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Rodrigues, J
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Santos, R
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Branco, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Indexing
Publicação em Scopus Scopus - 0 Citations
Other information
Authenticus ID: P-010-Z9J
Abstract (EN): Leveraging research on the neural modelling of Portuguese, we contribute a collection of datasets for an array of language processing tasks and a corresponding collection of fine-tuned neural language models on these downstream tasks. To align with mainstream benchmarks in the literature, originally developed in English, and to kick start their Portuguese counterparts, the datasets were machine-translated from English with a state-of-the-art translation engine. The resulting PORTULAN ExtraGLUE benchmark is a basis for research on Portuguese whose improvement can be pursued in future work. Similarly, the respective fine-tuned neural language models, developed with a low-rank adaptation approach, are made available as baselines that can stimulate future work on the neural processing of Portuguese. All datasets and models have been developed and are made available for two variants of Portuguese: European and Brazilian. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 10
Documents
File name Description Size
2404.05333[1] 201.90 KB
Related Publications

Of the same authors

Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina PT* Family (2024)
Article in International Scientific Journal
Santos, R; Rodrigues, J; Gomes, L; Silva, J; Branco, A; Henrique Lopes Cardoso; Osório, TF; Leite, B
Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-10 at 21:23:41 | Privacy Policy | Personal Data Protection Policy | Whistleblowing