Benchmark and application of unsupervised classification approaches for univariate data-Reference-Cited by-同舟云学术

Benchmark and application of unsupervised classification approaches for univariate data

Published:2021-03-12 Issue:1 Volume:4 Page:
ISSN:2399-3650
Container-title:Communications Physics
language:en
Short-container-title:Commun Phys

Author:

El Abbassi Maria^ORCID,Overbeck Jan^ORCID,Braun Oliver^ORCID,Calame Michel^ORCID,van der Zant Herre S. J.^ORCID,Perrin Mickael L.^ORCID

Abstract

AbstractUnsupervised machine learning, and in particular data clustering, is a powerful approach for the analysis of datasets and identification of characteristic features occurring throughout a dataset. It is gaining popularity across scientific disciplines and is particularly useful for applications without a priori knowledge of the data structure. Here, we introduce an approach for unsupervised data classification of any dataset consisting of a series of univariate measurements. It is therefore ideally suited for a wide range of measurement types. We apply it to the field of nanoelectronics and spectroscopy to identify meaningful structures in data sets. We also provide guidelines for the estimation of the optimum number of clusters. In addition, we have performed an extensive benchmark of novel and existing machine learning approaches and observe significant performance differences. Careful selection of the feature space construction method and clustering algorithms for a specific measurement type can therefore greatly improve classification accuracies.

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy

Link

http://www.nature.com/articles/s42005-021-00549-9.pdf

Reference69 articles.

1. International Data Corporation (IDC). Worldwide Spending on Artificial Intelligence Systems Will Be Nearly $98 Billion in 2023 https://www.idc.com/getdoc.jsp?containerId=prUS45481219 (2019).

2. Schmidhuber, J. Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015).

3. Sun, Y., Wang, X. & Tang, X. Deep learning face representation from predicting 10,000 classes. In Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1891–1898 (IEEE Computer Society, 2014).

4. Liu, Z., Luo, P., Wang, X. & Tang, X. Deep learning face attributes in the wild. In 2015 IEEE International Conference on Computer Vision (ICCV) 3730–3738 (IEEE Computer Society, 2015).

5. Mikolov, T., Karafiát, M., Burget, L., Cernocký, J. & Khudanpur, S. Recurrent neural network based language model. In Proc. 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 (eds. Kobayashi, T., Hirose, K. & Nakamura, S.) Vol. 2, 1045–1048 (Interspeech, 2010).

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. YogurtNet: Enhanced machine learning approach for voltage drop prediction;Journal of Physics: Conference Series;2024-07-01

2. Making the Most of Nothing: One-Class Classification for Single-Molecule Transport Studies;ACS Nanoscience Au;2024-06-06

3. Essential spectral pixels-based improvement of UMAP classifying hyperspectral imaging data to identify minor compounds in food matrix;Talanta;2024-06

4. Influence of Peripheral Alkyl Groups on Junction Configurations in Single-Molecule Electronics;The Journal of Physical Chemistry C;2024-01-16

5. Gaussian Mixture Model-Based Cloud- Phase Estimation From GEO- KOMPSAT-2A Observations;IEEE Transactions on Geoscience and Remote Sensing;2024