Resumo (PT):
Abstract (EN):
The development of a robust annotation scheme
and corresponding guidelines is crucial for pro-
ducing annotated datasets that advance both lin-
guistic and computational research. This paper
presents a case study that outlines a method-
ology for designing an annotation scheme and
its guidelines, specifically aimed at represent-
ing morphosyntactic and semantic information
regarding temporal features, as well as medi-
cal information in medical reports written in
Portuguese. We detail a multi-step process that
includes reviewing existing frameworks, con-
ducting an annotation experiment to determine
the optimal approach, and designing a model
based on these findings. We validated the ap-
proach through a pilot experiment where we
assessed the reliability and applicability of the
annotation scheme and guidelines. In this ex-
periment, two annotators independently anno-
tated a patient’s medical report consisting of six
documents using the proposed model, while a
curator established the ground truth. The analy-
sis of inter-annotator agreement and the annota-
tion results enabled the identification of sources
of human variation and provided insights for
further refinement of the annotation scheme
and guidelines.
Idioma:
Inglês
Tipo (Avaliação Docente):
Científica
Contacto:
Disponível em: https://aclanthology.org/2025.law-1.28/
Notas:
Nº de páginas:
11