Publication: A Robust Hotelling Test Statistic for One Sample Case in High Dimensional Data
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The Hotelling T-2 statistic is used to test the hypothesis about the location parameter of multivariate Gaussian distribution, and it is significantly sensitive to outliers. Also, we cannot calculate it when the sample size is less than the number of variables because this statistic needs the inverse of the covariance matrix, and the sample covariance matrix is singular in high dimensional data. Although a new approach, based on shrinkage estimation, was proposed to solve this singularity problem, this estimator is still sensitive to outliers. On the other hand, a robust one sample Hotelling T-2 statistic was proposed by using the minimum covariance determinant (MCD) estimates instead of classical ones. Since the MCD estimates cannot be calculated when n < p, this statistic cannot be used in high-dimensional data. This study proposes to use the minimum regularized covariance determinant (MRCD) estimator instead of classical or MCD. The MRCD estimator is a robust location and scatter estimator, which can be calculated in high-dimensional data. We obtain the asymptotic distribution of the proposed test statistic using Monte Carlo simulations and examine the power and robustness properties of the test statistic with simulated datasets. As a result, we show that the approximate distribution of the test statistic is proper, and the proposed robust test statistic can be used to test the hypothesis about the location parameter of contaminated high dimensional data. Finally, we construct an R function in the MVTests package to perform our proposed test statistic.
Description
Bulut, Hasan/0000-0002-6924-9651;
Citation
WoS Q
Q3
Scopus Q
Q2
Source
Communications in Statistics-Theory and Methods
Volume
52
Issue
13
Start Page
4590
End Page
4604
