Abstract (EN):
The aim of coreference resolution is to automatically determine all linguistic expressions included in a piece of text that refer to the same entity. Following the mention-pair model, we employ machine learning techniques to address coreference resolution from text written in Portuguese. Based on a modest annotated corpus, we highlight the impact that different training-set creation strategies have on the quality of the predictions made by the system. We conclude that enriching the system with semantic-based features significantly improves the overall performance of the system.
Language:
English
Type (Professor's evaluation):
Scientific
No. of pages:
13