Você está em: Start > Publications > View > Exploring Transformers for Multi-Label Classification of Java Vulnerabilities

Map of Premises

Publication

Publication Search

Publications

Exploring Transformers for Multi-Label Classification of Java Vulnerabilities

Title

Exploring Transformers for Multi-Label Classification of Java VulnerabilitiesExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Conference Proceedings Book

Date

2022

Title

Exploring Transformers for Multi-Label Classification of Java Vulnerabilities

Type

Article in International Conference Proceedings Book

Year

2022

Authors

Mamede, C

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

Pinconschi, E

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

Rui Abreu

(Author)

FEUP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page

José Campos

(Author)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page View ORCID page

Conference proceedings International

Title: IEEE International Conference on Software Quality, Reliability and Security, QRS Search for Conference Proceedings Publications

Pages: 43-52

22nd IEEE International Conference on Software Quality, Reliability and Security, QRS 2022

Virtual, Online, 5 December 2022 through 9 December 2022

Indexing

ISI Web of Knowledge - 0 Citations

Scopus - 0 Citations

Other information

Authenticus ID: P-00Y-8M0

DOI: 10.1109/qrs57517.2022.00015

Abstract (EN): Deep learning (DL) techniques have demonstrated potential in reasoning complex patterns of vulnerable code from high-level abstractions. Recent advancements in the area, such as the introduction of transformer-based models, like BERT, help overcome the problem of the available vulnerability detection datasets being too small to enable most DL models to capture all relevant patterns. They mitigate the challenge by leveraging knowledge from a general domain to solve problems in specific domains. In this paper, we explore different BERT-based models for multi-label classification of vulnerabilities in Java on a synthetic dataset. The models yield up to 99% in accuracy and 94% in f1-score. We remove biases in the training dataset and observe drops of up to 13% of the f1-score. We further assess the generalizability of the models on realistic samples and notice that one model, in particular, predicted unknown vulnerabilities with an f1-score of nearly 85%.

Language: English

Type (Professor's evaluation): Scientific

No. of pages: 10

Documents

We could not find any documents associated to the publication.

Recommend this page Top

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-08-08 at 00:02:15 | Privacy Policy | Personal Data Protection Policy | Whistleblowing