Continuous-Time Multivariate Analysis

Biplab Paul, Philip T. Reiss, Erjia Cui, Noemi Foà

Research output: Contribution to journalArticlepeer-review

Abstract

The starting point for much of multivariate analysis (MVA) is an n × p data matrix whose n rows represent observations and whose p columns represent variables. Some multivariate datasets, however, may be best conceptualized not as n discrete p-variate observations, but as p curves or functions defined on a common time interval. Here we introduce a framework for extending techniques of multivariate analysis to such settings. The proposed continuous-time multivariate analysis (CTMVA) framework rests on the assumption that the curves can be represented as linear combinations of basis functions such as B-splines, as in the Ramsay-Silverman representation of functional data; but whereas functional data analysis extends MVA to the case of observations that are curves rather than vectors—heuristically, n × p data with p infinite—we are instead concerned with what happens when n is infinite. We present continuous-time extensions of the classical MVA methods of covariance and correlation estimation, principal component analysis, Fisher’s linear discriminant analysis, and k-means clustering. We show that CTMVA can improve on the performance of classical MVA, in particular for correlation estimation and clustering, and can be applied in some settings where classical MVA cannot, including variables observed at disparate time points. CTMVA is illustrated with a novel perspective on a well-known Canadian weather dataset, and with applications to datasets involving international development, brain signals, and air quality. The proposed methods are implemented in the publicly available R package ctmva. Supplementary materials for this article are available online.

Original languageEnglish
Pages (from-to)384-394
Number of pages11
JournalJournal of Computational and Graphical Statistics
Volume34
Issue number1
DOIs
StatePublished - 2025

Bibliographical note

Publisher Copyright:
© 2024 The Author(s). Published with license by Taylor & Francis Group, LLC.

Keywords

  • B-splines
  • Correlation matrix
  • Fisher’s linear discriminant analysis
  • Functional data
  • k-means clustering
  • Principal component analysis

ASJC Scopus subject areas

  • Statistics and Probability
  • Discrete Mathematics and Combinatorics
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Continuous-Time Multivariate Analysis'. Together they form a unique fingerprint.

Cite this