Multivariate multidistance tests for high-dimensional low sample size case-control studies

Stat Med. 2015 Apr 30;34(9):1511-26. doi: 10.1002/sim.6418. Epub 2015 Jan 29.

Abstract

A class of multivariate tests for case-control studies with high-dimensional low sample size data and with complex dependence structure, which are common in medical imaging and molecular biology, is proposed. The tests can be applied when the number of variables is much larger than the number of subjects and when the underlying population distributions are heavy-tailed or skewed. As a motivating application, we consider a case-control study where phase-contrast cinematic cardiovascular magnetic resonance imaging has been used to compare many cardiovascular characteristics of young healthy smokers and young healthy non-smokers. The tests are based on the combination of tests on interpoint distances. It is theoretically proved that the tests are exact, unbiased and consistent. It is shown that the tests are very powerful under normal, heavy-tailed and skewed distributions. The tests can also be applied to case-control studies with high-dimensional low sample size data from other medical imaging techniques (like computed tomography or X-ray radiography), chemometrics and microarray data (proteomics and transcriptomics).

Keywords: combined tests; hypothesis testing; magnetic resonance imaging; nonparametric tests.

MeSH terms

  • Adult
  • Biometry / methods
  • Cardiovascular Diseases / diagnosis
  • Cardiovascular Diseases / physiopathology
  • Case-Control Studies*
  • Data Interpretation, Statistical*
  • Humans
  • Magnetic Resonance Imaging
  • Male
  • Middle Aged
  • Multivariate Analysis*
  • Sample Size*
  • Smoking / adverse effects