Abstract
A penalized approach is proposed for performing large numbers of parallel nonparametric analyses of either of two types: Restricted likelihood ratio tests of a parametric regression model versus a general smooth alternative, and nonparametric regression. Compared with naïvely performing each analysis in turn, our techniques reduce computation time dramatically. Viewing the large collection of scatterplot smooths produced by our methods as functional data, we develop a clustering approach to summarize and visualize these results.Our approach is applicable to ultra-high-dimensional data, particularly data acquired by neuroimaging; we illustrate it with an analysis of developmental trajectories of functional connectivity at each of approximately 70,000 brain locations. Supplementary materials, including an appendix and an R package, are available online.
Original language | English |
---|---|
Pages (from-to) | 232-248 |
Number of pages | 17 |
Journal | Journal of Computational and Graphical Statistics |
Volume | 23 |
Issue number | 1 |
DOIs | |
State | Published - 2014 |
Externally published | Yes |
Bibliographical note
Funding Information:The authors thank Eva Petkova, Ciprian Crainiceanu, Davide Imperati, Michael Milham, Clare Kelly, Babak Ardekani, and Xavier Castellanos for very helpful discussions; and the Editor, Associate Editor, and referees for valuable comments on the initial manuscript. The first author’s research is supported in part by National Science Foundation grant DMS-0907017 and National Institutes of Health grant 1R01MH095836-01A1.
Keywords
- Functional data clustering
- Neuroimaging
- Penalized splines
- Restricted likelihood ratio test
- Smoothing parameter selection
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty
- Discrete Mathematics and Combinatorics