Please use this identifier to cite or link to this item: https://ruomoplus.lib.uom.gr/handle/8000/2231
Title: Benchmarking Advanced Text Anonymisation Methods: A Comparative Study on Novel and Traditional Approaches
Authors: Asimopoulos, Dimitris 
Siniosoglou, Ilias 
Argyriou, Vasileios 
Karamitsou, Thomai 
Fountoukidis, Eleftherios 
Goudos, Sotirios K. 
Moscholios, Ioannis 
Psannis, Konstantinos 
Sarigiannidis, Panagiotis 
Author Department Affiliations: Department of Applied Informatics 
Author School Affiliations: School of Information Sciences 
Subjects: FRASCATI__Natural sciences__Computer and information sciences
Keywords: CRF
Data anonymisation
LLM
LSTM
Microsoft Presidio
NER
text anonymisation
Transformers
Issue Date: 1-Jan-2024
Publisher: ΙΕΕΕ
Volume Title: 2024 13th International Conference on Modern Circuits and Systems Technologies (MOCAST)
Start page: 1
End page: 6
Conference: 13th International Conference on Modern Circuits and Systems Technologies (MOCAST) 
Abstract: 
In the realm of data privacy, the ability to effectively anonymise text is paramount. With the proliferation of deep learning and, in particular, transformer architectures, there is a burgeoning interest in leveraging these advanced models for text anonymisation tasks. This paper presents a comprehensive benchmarking study comparing the performance of transformer-based models and Large Language Models(LLM) against traditional architectures for text anonymisation. Utilising the CoNLL-2003 dataset, known for its robustness and diversity, we evaluate several models. Our results showcase the strengths and weaknesses of each approach, offering a clear perspective on the efficacy of modern versus traditional methods. Notably, while modern models exhibit advanced capabilities in capturing contextual nuances, certain traditional architectures still keep high performance. This work aims to guide researchers in selecting the most suitable model for their anonymisation needs, while also shedding light on potential paths for future advancements in the field.
URI: https://ruomoplus.lib.uom.gr/handle/8000/2231
ISBN: [9798350385427]
DOI: 10.1109/MOCAST61810.2024.10615642
Rights: Attribution-NonCommercial-NoDerivatives 4.0 Διεθνές
Corresponding Item Departments: Department of Applied Informatics
Appears in Collections:Conference proceedings

Files in This Item:
File Description SizeFormat
Benchmarking Advanced Text Anonymisation Methods.pdf390,8 kBAdobe PDF
View/Open
Show full item record

SCOPUSTM   
Citations

2
checked on Dec 12, 2025

Page view(s)

38
checked on Dec 16, 2025

Download(s)

10
checked on Dec 16, 2025

Google ScholarTM

Check

Altmetric

Altmetric


This item is licensed under a Creative Commons License Creative Commons