Go to:
Logótipo
Você está em: Start > Publications > View > Recognizing textual entailment: Challenges in the Portuguese language
Map of Premises
Principal
Publication

Recognizing textual entailment: Challenges in the Portuguese language

Title
Recognizing textual entailment: Challenges in the Portuguese language
Type
Article in International Scientific Journal
Year
2018
Authors
Gil Rocha
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Journal
Vol. 9 No. 4
Pages: 1-19
ISSN: 2078-2489
Publisher: MDPI
Indexing
Publicação em ISI Web of Science ISI Web of Science
INSPEC
Other information
Authenticus ID: P-00N-S7J
Resumo (PT):
Abstract (EN): Recognizing textual entailment comprises the task of determining semantic entailment relations between text fragments. A text fragment entails another text fragment if, from the meaning of the former, one can infer the meaning of the latter. If such relation is bidirectional, then we are in the presence of a paraphrase. Automatically recognizing textual entailment relations captures major semantic inference needs in several natural language processing (NLP) applications. As in many NLP tasks, textual entailment corpora for English abound, while the same is not true for more resource-scarce languages such as Portuguese. Exploiting what seems to be the only Portuguese corpus for textual entailment and paraphrases (the ASSIN corpus), in this paper, we address the task of automatically recognizing textual entailment (RTE) and paraphrases from text written in the Portuguese language, by employing supervised machine learning techniques. We employ lexical, syntactic and semantic features, and analyze the impact of using semantic-based approaches in the performance of the system. We then try to take advantage of the bi-dialect nature of ASSIN to compensate its limited size. With the same aim, we explore modeling the task of recognizing textual entailment and paraphrases as a binary classification problem by considering the bidirectional nature of paraphrases as entailment relationships. Addressing the task as a multi-class classification problem, we achieve results in line with the winner of the ASSIN Challenge. In addition, we conclude that semantic-based approaches are promising in this task, and that combining data from European and Brazilian Portuguese is less straightforward than it may initially seem. The binary classification modeling of the problem does not seem to bring advantages to the original multi-class model, despite the outstanding results obtained by the binary classifier for recognizing textual entailments. © 2018 by the authors.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 19
Documents
File name Description Size
information-09-00076 493.72 KB
Related Publications

Of the same authors

On Sentence Representations for Propaganda Detection: From Handcrafted Features to Word Embeddings (2019)
Article in International Conference Proceedings Book
André Ferreira Cruz; Gil Rocha; Henrique Lopes Cardoso
On Document Representations for Detection of Biased News Articles (2020)
Article in International Conference Proceedings Book
André Ferreira Cruz; Gil Rocha; Henrique Lopes Cardoso
Improving Transfer Learning in Unsupervised Language Adaptation (2021)
Article in International Conference Proceedings Book
Gil Rocha; Henrique Lopes Cardoso
Exploring Spanish Corpora for Portuguese Coreference Resolution (2018)
Article in International Conference Proceedings Book
André Ferreira Cruz; Gil Rocha; Henrique Lopes Cardoso
Cross-Lingual Annotation Projection for Argument Mining in Portuguese (2021)
Article in International Conference Proceedings Book
Afonso Sousa; Bernardo Leite; Gil Rocha; Henrique Lopes Cardoso

See all (10)

Of the same journal

Teaching Software Engineering Topics Through Pedagogical Game Design Patterns: An Empirical Study (2020)
Article in International Scientific Journal
Nuno Flores; Paiva, ACR; Cruz, N
Screening System for Cardiac Problems through Non-Invasive Identification of Blood Pressure Waveform (2020)
Article in International Scientific Journal
Paulo Abreu; Fernando Carneiro; Maria Teresa Restivo
Robust Complaint Processing in Portuguese (2021)
Article in International Scientific Journal
Henrique Lopes Cardoso; Osorio, TF; Barbosa, LV; Rocha, G; reis, lp; Machado, JP; Oliveira, AM
Prototype to Increase Crosswalk Safety by Integrating Computer Vision with ITS-G5 Technologies (2020)
Article in International Scientific Journal
Gaspar, F; Guerreiro, V; Loureiro, P; Costa, P; Mendes, S; Rabadao, C
On the Implementation of a Cloud-Based Computing Test Bench Environment for Prolog Systems (2017)
Article in International Scientific Journal
Goncalves, R; Miguel Areias; Ricardo Rocha

See all (17)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-22 at 19:12:45 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book