Movatterモバイル変換


[0]ホーム

URL:


Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
Thehttps:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log inShow account info
Access keysNCBI HomepageMyNCBI HomepageMain ContentMain Navigation
pubmed logo
Advanced Clipboard
User Guide

Full text links

Free PMC article
Full text links

Actions

.2017 May;167(3-4):462-475.
doi: 10.1007/s10955-017-1770-6. Epub 2017 Mar 27.

PCA meets RG

Affiliations

PCA meets RG

Serena Bradde et al. J Stat Phys.2017 May.

Abstract

A system with many degrees of freedom can be characterized by a covariance matrix; principal components analysis (PCA) focuses on the eigenvalues of this matrix, hoping to find a lower dimensional description. But when the spectrum is nearly continuous, any distinction between components that we keep and those that we ignore becomes arbitrary; it then is natural to ask what happens as we vary this arbitrary cutoff. We argue that this problem is analogous to the momentum shell renormalization group (RG). Following this analogy, we can define relevant and irrelevant operators, where the role of dimensionality is played by properties of the eigenvalue density. These results also suggest an approach to the analysis of real data. As an example, we study neural activity in the vertebrate retina as it responds to naturalistic movies, and find evidence of behavior controlled by a nontrivial fixed point. Applied to financial data, our analysis separates modes dominated by sampling noise from a smaller but still macroscopic number of modes described by a non-Gaussian distribution.

PubMed Disclaimer

Figures

FIG. 1
FIG. 1
Analysis of neural activity in the retina. States are defined by the patterns of spiking and silence in successive time bins from 160 neurons, as described in the text. Left: The spectrum of eigenvalues of the correlation matrix, with states constructed from different numbers of time bins, plotted vs fractional mode number in descending order; results from randomized data shown for comparison. Right: Normalized fourth moments for each of the 8 × 160 variables (cyan dots), as a function of the fraction of modes remaining after the coarse graining procedure; blue circles show medians ± one quartile. More precisely, what we plot isψi4/ψi22, with the coarse grained variablesψi defined by Eqs (28, 29). The dashed line is the value of this normalized moment for Gaussian random variables. The plot suggests that the fourth moments flow to a non–trivial fixed value, well above the Gaussian prediction.
FIG. 2
FIG. 2
Analysis of daily returns onN = 2048 assets in the NYSE for a period ofT = 2356 days. Left: Eigenvalues of the correlation matrix, in descending order, as a function of fractional mode number; results are shown for all the data (blue), as well as cases in which we remove the largest 1% (green), 4% (yellow), or 13% (red) of the eigenvalues. Solid lines are theoretical expectations from the Marchenko–Pastur distribution [29]. Right: The flow of the normalized fourth moments for each of theN = 2048 variablesψi4 when we integrate out a fraction of high eigenmodes. The normalization procedure after eingenmodes integration is described in the main text. Colors code different cases where we track all the modes, or first remove different fractions of the large variance modes, as in the analysis on the left. The dashed line is the prediction for Gaussian random variables. We see that the fourth moments have a nonmonotonic behavior when all eigenvalues are included, while they flow rapidly to the Gaussian fixed point when the top 10% of eigenvalues are removed.
See this image and copyright information in PMC

References

    1. Pearson K. On lines and planes of closest fit to systems of points in space. Phil Mag. 1901;2:559–572.
    1. Shlens J. A tutorial on principal components analysis. 2014 arXiv:1404.1100 [cs.LG]
    1. Hopfield JJ. Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci (USA) 1982;79:2554–2558. - PMC - PubMed
    1. Wilson KG. Problems in physics with many scales of length. Sci Am. 1979;241:158–179.
    1. Jona–Lasinio G. The renormalization group: A probabilistic view. Il Nuovo Cimento. 1975;26B:99–119.

Grants and funding

LinkOut - more resources

Full text links
Free PMC article
Cite
Send To

NCBI Literature Resources

MeSHPMCBookshelfDisclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.


[8]ページ先頭

©2009-2026 Movatter.jp