Supervised classification of curves via a combined use of functional data analysis and tree-based methods-Reference-Cited by-同舟云学术

Supervised classification of curves via a combined use of functional data analysis and tree-based methods

Published:2022-05-30 Issue:1 Volume:38 Page:419-459
ISSN:0943-4062
Container-title:Computational Statistics
language:en
Short-container-title:Comput Stat

Author:

Maturo Fabrizio^ORCID,Verde Rosanna

Abstract

AbstractTechnological advancement led to the development of tools to collect vast amounts of data usually recorded at temporal stamps or arriving over time, e.g. data from sensors. Common ways of analysing this kind of data also involve supervised classification techniques; however, despite constant improvements in the literature, learning from high-dimensional data is always a challenging task due to many issues such as, for example, dealing with the curse of dimensionality and looking for a trade-off between complexity and accuracy. Nowadays, research in functional data analysis (FDA) and statistical learning is very lively to address these drawbacks adequately. This study offers a supervised classification strategy that combines FDA and tree-based procedures. Specifically, we introduce functional classification trees, functional bagging, and functional random forest exploiting the functional principal components decomposition as a tool to extract new features and build functional classifiers. In addition, we introduce new tools to support the understanding of the classification rules, such as the functional empirical separation prototype, functional predicted separation prototype, and the leaves’ functional deviance. Furthermore, we suggest some possible solutions for choosing the number of functional principal components and functional classification trees to be implemented in the supervised classification procedure. This research aims to provide an approach to improve the accuracy of the functional classifier, serve the interpretation of the functional classification rules, and overcome the classical drawbacks due to the high-dimensionality of the data. An application on a real dataset regarding daily electrical power demand shows the functioning of the supervised classification proposal. A simulation study with nine scenarios highlights the performance of this approach and compares it with other functional classification methods. The results demonstrate that this line of research is exciting and promising; indeed, in addition to the benefits of the suggested interpretative tools, we exceed the previously established accuracy records on a dataset available online.

Funder

Università degli Studi della Campania Luigi Vanvitelli

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Statistics, Probability and Uncertainty,Statistics and Probability

Link

https://link.springer.com/content/pdf/10.1007/s00180-022-01236-1.pdf

Reference44 articles.

1. Aguilera A, Aguilera-Morillo M (2013) Penalized pca approaches for b-spline expansions of smooth functional data. Applied Mathematics and Computation. https://doi.org/10.1016/j.amc.2013.02.009

2. Aguilera-Morillo M, Aguilera A, Escabias M, Valderrama MJ (2012) Penalized spline approaches for functional logit regression. Test 22(2):251–277. https://doi.org/10.1007/s11749-012-0307-1

3. Balakrishnan S, Madigan D (2006) Decision trees for functional variables. In: Sixth International Conference on Data Mining (ICDM’06), IEEE, https://doi.org/10.1109/icdm.2006.49

4. Belli E, Vantini S (2020) Measure inducing classification and regression trees for functional data. arXiv preprint arXiv:2011.00046

5. Bongiorno E, Goia A (2019) Describing the concentration of income populations by functional principal component analysis on lorenz curves. Journal of Multivariate Analysis 170:10–24

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Functional Data Classifier Based on Bonferroni Mean Fuzzy K-Nearest Centroid Neighbor;2024 6th International Conference on Communications, Information System and Computer Engineering (CISCE);2024-05-10

2. Functional Local Mean K-Nearest Neighbor: introducing a novel metric for improved algorithm performance;2024 International Conference on Intelligent Systems and Computer Vision (ISCV);2024-05-08

3. Exploring intertemporal decision-making dynamics through functional data analysis: investigating variations in different discount function's dimensions;Quality & Quantity;2024-04-04

4. Curve Classification Based on Mean-Variance Feature Weighting and Its Application;Computers, Materials & Continua;2024

5. Flu vaccination coverage in Italy in the COVID-19 era: A fuzzy functional k-means (FFKM) approach;Journal of Infection and Public Health;2023-11