Please use this identifier to cite or link to this item: https://ruomoplus.lib.uom.gr/handle/8000/2037
DC FieldValueLanguage
dc.contributor.authorKalouptsoglou, Iliasel
dc.contributor.authorSiavvas, Miltiadisel
dc.contributor.authorAmpatzoglou, Apostolosel
dc.contributor.authorKehagias, Dionysiosel
dc.contributor.authorChatzigeorgiou, Alexanderel
dc.date.accessioned2025-11-01T12:48:37Z-
dc.date.available2025-11-01T12:48:37Z-
dc.date.issued2024-10-29-
dc.identifier.isbn[9798350365658]-
dc.identifier.urihttps://ruomoplus.lib.uom.gr/handle/8000/2037-
dc.description.abstractNowadays, security testing is an integral part of the testing activities during the software development life-cycle. Over the years, various techniques have been proposed to identify security issues in the source code, especially vulnerabilities, which can be exploited and cause severe damages. Recently, Machine Learning (ML) techniques capable of predicting vulnerable software components and indicating high-risk areas have appeared, among others, accelerating the effort demanding and time consuming process of vulnerability localization. For effective subsequent vulnerability elimination, there is a need for automating the process of labeling detected vulnerabilities in vulnerability categories i.e., identifying the type of the vulnerability. Several techniques have been proposed over the years for automating the labeling process of vulnerabilities. However, the vast majority of the proposed methods attempt to identify the type of vulnerabilities based on their textual description that is provided by experts, such as the description provided by the vulnerability report in the National Vulnerability Database, and not on their actual source code, hindering their full automation and the vulnerability categorization from the software testing phase. This work examines the vulnerability classification directly from the source code during the vulnerability detection step. Moreover, this way, a vulnerability detection method will be able to provide complete information and interpretation of its findings. Leveraging the advances in the field of Artificial Intelligence and Natural Language Processing, we construct and compare several multi-class classification models for categorizing vulnerable code snippets. The results highlight the importance of the context-aware embeddings of the pre-trained Transformer-based models, as well as the significance of transfer learning from a programming language-related domain.el
dc.language.isoenel
dc.publisherIEEE-
dc.rightsCC0 1.0 Παγκόσμια*
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Διεθνές*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectFRASCATI__Natural sciences__Computer and information sciencesel
dc.subjectFRASCATI__Engineering and technology__Electrical engineering, Electronic engineering, Information engineeringel
dc.subject.othercontextual word embeddingel
dc.subject.otherlarge language modelsel
dc.subject.othernatural language processingel
dc.subject.othersecurity testingel
dc.subject.othertransfer learningel
dc.subject.othervulnerability classificationel
dc.titleVulnerability Classification on Source Code Using Text Mining and Deep Learning Techniquesel
dc.typeconference paperel
dc.relation.conference2024 IEEE 24th International Conference on Software Quality, Reliability, and Security Companion (QRS-C)el
dc.identifier.doi10.1109/QRS-C63300.2024.00017-
dc.identifier.scopus2-s2.0-85209784889-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/85209784889-
dc.description.startpage47el
dc.description.endpage56el
dc.contributor.departmentDepartment of Applied Informaticsel
dc.contributor.departmentDepartment of Applied Informaticsel
dc.contributor.departmentDepartment of Applied Informaticsel
dc.description.volumetitleProceedings of the 2024 IEEE 24th International Conference on Software Quality, Reliability, and Security Companion (QRS-C)el
item.fulltextWith Fulltext-
item.openairecristypehttp://purl.org/coar/resource_type/c_5794-
item.cerifentitytypePublications-
item.openairetypeconference paper-
item.languageiso639-1en-
item.grantfulltextopen-
crisitem.author.deptUniversity of Macedonia-
crisitem.author.deptUniversity of Macedonia-
crisitem.author.deptUniversity of Macedonia-
crisitem.author.deptUniversity of Macedonia-
crisitem.author.deptUniversity of Macedonia-
crisitem.author.departmentDepartment of Applied Informatics-
crisitem.author.departmentDepartment of Applied Informatics-
crisitem.author.departmentDepartment of Applied Informatics-
crisitem.author.orcid0000-0002-5118-2508-
crisitem.author.orcid0000-0002-5764-7302-
crisitem.author.orcid0000-0002-6912-3493-
crisitem.author.orcid0000-0002-5381-8418-
crisitem.author.facultySchool of Information Sciences-
crisitem.author.facultySchool of Information Sciences-
crisitem.author.facultySchool of Information Sciences-
Appears in Collections:Conference proceedings
Files in This Item:
File Description SizeFormat
kalouptsoglou2024qrs.pdf313,86 kBAdobe PDF
View/Open
Show simple item record

SCOPUSTM   
Citations

4
checked on May 13, 2026

Page view(s)

89
checked on May 14, 2026

Download(s)

80
checked on May 14, 2026

Google ScholarTM

Check

Altmetric

Altmetric


This item is licensed under a Creative Commons License Creative Commons