Abstract (EN):
We propose an unsupervised method for propagating automatically extracted fine-grained topic labels among news items to improve their topic description for subsequent text classification procedure. This method compares vector representations of news items and assigns to each news item the label of its closest neighbour with a different topic label. Results obtained show that high precision can be achieved in propagating the top ranked topic label, and that 2-gram and 3-gram feature representations optimize the precision.
Language:
English
Type (Professor's evaluation):
Scientific
Contact:
las@fe.up.pt; ssn@fe.up.pt; jft@fe.up.pt; eco@fe.up.pt
No. of pages:
4
License type: