Please use this identifier to cite or link to this item:
https://ruomoplus.lib.uom.gr/handle/8000/1129| Title: | PCA-based Similarity Search: Pre-processing & Distance Measures | Authors: | Karamitopoulos, Leonidas Evangelidis, Georgios |
Author Department Affiliations: | Department of Applied Informatics | Author School Affiliations: | School of Information Sciences | Subjects: | FRASCATI__Natural sciences__Computer and information sciences | Issue Date: | 2007 | Publisher: | Springer | Volume Title: | Proceedings of the 2nd International Scientific Conference, eRA: The Contribution of Information Technology to Science, Economy, Society and Education, Athens, Greece | Start page: | 318 | End page: | 327 | Abstract: | Time series appear frequently in several domains such as in multimedia, business,industry or medicine. A multivariate time series dataset is a set of co-evolving timeseries that relates to a specific object (e.g. the motion of a person). The increasing needfor analyzing efficiently the huge amount of this information leads to the application ofdata mining techniques. At the core of these techniques lies the concept of similaritysince most of them require searching for similar patterns, such as in query by content,clustering or classification. Nevertheless, when dealing with multivariate time seriesdatasets, similarity should be sought between the whole datasets and not only betweenthe individual time series, since there are usually important correlations among themthat shouldn’t be lost. In this paper, we discuss the application of Principal ComponentAnalysis (PCA) on multivariate time series datasets for the purpose of similarity search.PCA is applied in order to reduce the high dimensionality of such data while retaining asmuch as possible of the variation present in the data. We provide a thorough descriptionof the pre-processing phase with respect to PCA assumptions and limitations, as wellas, to the most frequently appeared distortions in data. Furthermore, we experimentallyexplore the potential usefulness of incorporating Piecewise Aggregate Approximationinto this phase. Finally, we discuss the various aspects of the proposed PCA-basedsimilarity (dissimilarity) measures. |
URI: | http://ikaros.teipir.gr/era/era2/fullpap/B26.doc https://ruomoplus.lib.uom.gr/handle/8000/1129 |
Rights: | Attribution-NonCommercial-ShareAlike 4.0 International | Corresponding Item Departments: | Department of Applied Informatics |
| Appears in Collections: | Conference proceedings |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 2007_ERA_Karamitopoulos.pdf | 191,6 kB | Adobe PDF | View/Open |
Page view(s)
206
checked on Jun 16, 2026
Download(s)
61
checked on Jun 16, 2026
Google ScholarTM
Check
This item is licensed under a Creative Commons License