Saltar para:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Início > Publicações > Visualização > Recognizing textual entailment: Challenges in the Portuguese language

Recognizing textual entailment: Challenges in the Portuguese language

Título
Recognizing textual entailment: Challenges in the Portuguese language
Tipo
Artigo em Revista Científica Internacional
Ano
2018
Autores
Gil Rocha
(Autor)
Outra
A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID
Revista
Vol. 9 4
Páginas: 1-19
ISSN: 2078-2489
Editora: MDPI
Indexação
Publicação em ISI Web of Science ISI Web of Science
INSPEC
Outras Informações
ID Authenticus: P-00N-S7J
Resumo (PT):
Abstract (EN): Recognizing textual entailment comprises the task of determining semantic entailment relations between text fragments. A text fragment entails another text fragment if, from the meaning of the former, one can infer the meaning of the latter. If such relation is bidirectional, then we are in the presence of a paraphrase. Automatically recognizing textual entailment relations captures major semantic inference needs in several natural language processing (NLP) applications. As in many NLP tasks, textual entailment corpora for English abound, while the same is not true for more resource-scarce languages such as Portuguese. Exploiting what seems to be the only Portuguese corpus for textual entailment and paraphrases (the ASSIN corpus), in this paper, we address the task of automatically recognizing textual entailment (RTE) and paraphrases from text written in the Portuguese language, by employing supervised machine learning techniques. We employ lexical, syntactic and semantic features, and analyze the impact of using semantic-based approaches in the performance of the system. We then try to take advantage of the bi-dialect nature of ASSIN to compensate its limited size. With the same aim, we explore modeling the task of recognizing textual entailment and paraphrases as a binary classification problem by considering the bidirectional nature of paraphrases as entailment relationships. Addressing the task as a multi-class classification problem, we achieve results in line with the winner of the ASSIN Challenge. In addition, we conclude that semantic-based approaches are promising in this task, and that combining data from European and Brazilian Portuguese is less straightforward than it may initially seem. The binary classification modeling of the problem does not seem to bring advantages to the original multi-class model, despite the outstanding results obtained by the binary classifier for recognizing textual entailments. © 2018 by the authors.
Idioma: Inglês
Tipo (Avaliação Docente): Científica
Nº de páginas: 19
Documentos
Nome do Ficheiro Descrição Tamanho
information-09-00076 493.72 KB
Publicações Relacionadas

Dos mesmos autores

On Sentence Representations for Propaganda Detection: From Handcrafted Features to Word Embeddings (2019)
Artigo em Livro de Atas de Conferência Internacional
André Ferreira Cruz; Gil Rocha; Henrique Lopes Cardoso
On Document Representations for Detection of Biased News Articles (2020)
Artigo em Livro de Atas de Conferência Internacional
André Ferreira Cruz; Gil Rocha; Henrique Lopes Cardoso
Improving Transfer Learning in Unsupervised Language Adaptation (2021)
Artigo em Livro de Atas de Conferência Internacional
Gil Rocha; Henrique Lopes Cardoso
Exploring Spanish Corpora for Portuguese Coreference Resolution (2018)
Artigo em Livro de Atas de Conferência Internacional
André Ferreira Cruz; Gil Rocha; Henrique Lopes Cardoso
Cross-Lingual Annotation Projection for Argument Mining in Portuguese (2021)
Artigo em Livro de Atas de Conferência Internacional
Afonso Sousa; Bernardo Leite; Gil Rocha; Henrique Lopes Cardoso

Ver todas (10)

Da mesma revista

Teaching Software Engineering Topics Through Pedagogical Game Design Patterns: An Empirical Study (2020)
Artigo em Revista Científica Internacional
Nuno Flores; Paiva, ACR; Cruz, N
Screening System for Cardiac Problems through Non-Invasive Identification of Blood Pressure Waveform (2020)
Artigo em Revista Científica Internacional
Paulo Abreu; Fernando Carneiro; Maria Teresa Restivo
Robust Complaint Processing in Portuguese (2021)
Artigo em Revista Científica Internacional
Henrique Lopes Cardoso; Osorio, TF; Barbosa, LV; Rocha, G; reis, lp; Machado, JP; Oliveira, AM
Prototype to Increase Crosswalk Safety by Integrating Computer Vision with ITS-G5 Technologies (2020)
Artigo em Revista Científica Internacional
Gaspar, F; Guerreiro, V; Loureiro, P; Costa, P; Mendes, S; Rabadao, C
On the Implementation of a Cloud-Based Computing Test Bench Environment for Prolog Systems (2017)
Artigo em Revista Científica Internacional
Goncalves, R; Miguel Areias; Ricardo Rocha

Ver todas (17)

Recomendar Página Voltar ao Topo
Copyright 1996-2025 © Centro de Desporto da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-10-10 às 04:39:33 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico