Você está em: Start > Publications > View > Normalized Google Distance in the identification and characterization of health queries

Map of Premises

Publication

Publication Search

Publications

Normalized Google Distance in the identification and characterization of health queries

Title

Normalized Google Distance in the identification and characterization of health queriesExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Conference Proceedings Book

Date

2019

Title

Normalized Google Distance in the identification and characterization of health queries

Type

Article in International Conference Proceedings Book

Year

2019

Authors

Carla Teixeira Lopes

(Author)

FEUP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page

Diogo Moura

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

Conference proceedings International

Title: Iberian Conference on Information Systems and Technologies, CISTI Search for Conference Proceedings Publications

Pages: 1-4

14th Iberian Conference on Information Systems and Technologies, CISTI 2019

19 June 2019 through 22 June 2019

Indexing

ISI Web of Knowledge - 0 Citations

Scopus - 1 Citation

Other information

Authenticus ID: P-00Q-XPE

DOI: 10.23919/cisti.2019.8760964

Abstract (EN): Classifying web queries into a set of categories is a crucial task to better understand the user's intent behind a query, contextualize their search and provide more relevant results to the user. However, web queries are typically short and ambiguous making the query classification a non-trivial problem. In this article, we present a new automatic approach for identifying and characterizing queries in the health domain. This method makes use of the search engine counts through a semantic similarity measure called Normalized Google Distance (NGD) combined with Support Vector Machines to classify queries into three dimensions: health-related, severity and semantic type. To evaluate our methods, we used two datasets in different languages, Portuguese and English, and built another for evaluating the last dimension. Overall, the results achieved were satisfactory. The most generic classification obtains better results than more specific ones. The NGD proved to be a valuable assent in query classification.

Language: English

Type (Professor's evaluation): Scientific

No. of pages: 4

Documents

We could not find any documents associated to the publication.

Recommend this page Top

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-07-28 at 14:38:00 | Privacy Policy | Personal Data Protection Policy | Whistleblowing