Please use this identifier to cite or link to this item: https://ruomoplus.lib.uom.gr/handle/8000/1129
Title: PCA-based Similarity Search: Pre-processing & Distance Measures
Authors: Karamitopoulos, Leonidas 
Evangelidis, Georgios 
Author Department Affiliations: Department of Applied Informatics 
Author School Affiliations: School of Information Sciences 
Subjects: FRASCATI__Natural sciences__Computer and information sciences
Issue Date: 2007
Publisher: Springer
Volume Title: Proceedings of the 2nd International Scientific Conference, eRA: The Contribution of Information Technology to Science, Economy, Society and Education, Athens, Greece
Start page: 318
End page: 327
Abstract: 
Time series appear frequently in several domains such as in multimedia, business,industry or medicine. A multivariate time series dataset is a set of co-evolving timeseries that relates to a specific object (e.g. the motion of a person). The increasing needfor analyzing efficiently the huge amount of this information leads to the application ofdata mining techniques. At the core of these techniques lies the concept of similaritysince most of them require searching for similar patterns, such as in query by content,clustering or classification. Nevertheless, when dealing with multivariate time seriesdatasets, similarity should be sought between the whole datasets and not only betweenthe individual time series, since there are usually important correlations among themthat shouldn’t be lost. In this paper, we discuss the application of Principal ComponentAnalysis (PCA) on multivariate time series datasets for the purpose of similarity search.PCA is applied in order to reduce the high dimensionality of such data while retaining asmuch as possible of the variation present in the data. We provide a thorough descriptionof the pre-processing phase with respect to PCA assumptions and limitations, as wellas, to the most frequently appeared distortions in data. Furthermore, we experimentallyexplore the potential usefulness of incorporating Piecewise Aggregate Approximationinto this phase. Finally, we discuss the various aspects of the proposed PCA-basedsimilarity (dissimilarity) measures.
URI: http://ikaros.teipir.gr/era/era2/fullpap/B26.doc
https://ruomoplus.lib.uom.gr/handle/8000/1129
Rights: Attribution-NonCommercial-ShareAlike 4.0 International
Corresponding Item Departments: Department of Applied Informatics
Appears in Collections:Conference proceedings

Files in This Item:
File Description SizeFormat
2007_ERA_Karamitopoulos.pdf191,6 kBAdobe PDF
View/Open
Show full item record

Page view(s)

206
checked on Jun 16, 2026

Download(s)

61
checked on Jun 16, 2026

Google ScholarTM

Check


This item is licensed under a Creative Commons License Creative Commons