Abstract
AbstractPrincipal component analysis (PCA) is a dimensionality reduction technique that is known for being simple and easy to interpret. Principal components are often interpreted as low-dimensional patterns in high-dimensional data. However, this simple interpretation of PCA relies on several unstated assumptions that are difficult to satisfy. When these assumptions are violated, non-oscillatory data may have oscillatory principal components. Here, we show that two common properties of data violate these assumptions and cause oscillatory principal components: smooth-ness, and shifts in time or space. These two properties implicate almost all neuroscience data. We show how the oscillations that they produce, which we call “phantom oscillations”, impact data analysis. We also show that traditional cross-validation does not detect phantom oscillations, so we suggest procedures that do. Our findings are supported by a collection of mathematical proofs. Collectively, our work demonstrates that patterns which emerge from high-dimensional data analysis may not faithfully represent the underlying data.
Publisher
Cold Spring Harbor Laboratory
Reference76 articles.
1. Ahmed, N. , Natarajan, T. , and Rao, K. 1974. Discrete cosine transform. IEEE Transactions on Computers C-23:90–93.
2. Karhunen-loeve expansion of stationary random signals with exponentially oscillating covariance function;Optical Engineering,2003
3. Ames, K. C. and Churchland, M. M. 2019. Motor cortex signals for each arm are mixed across hemispheres and neurons yet partitioned within the population response. eLife 8.
4. Antognini, J. and Sohl-Dickstein, J. 2018. Pca of high dimensional random walks with comparison to neural network training. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (eds.), Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc.
5. Atasoy, S. , Donnelly, I. , and Pearson, J. 2016. Human brain networks function in connectome-specific harmonic waves. Nature Communications 7.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献