Mathematics, Vol. 11, Pages 4374: Testing Equality of Several Distributions at High Dimensions: A Maximum-Mean-Discrepancy-Based Approach

1 year ago 27

Mathematics, Vol. 11, Pages 4374: Testing Equality of Several Distributions at High Dimensions: A Maximum-Mean-Discrepancy-Based Approach

Mathematics doi: 10.3390/math11204374

Authors: Zhi Peng Ong Aixiang Andy Chen Tianming Zhu Jin-Ting Zhang

With the development of modern data collection techniques, researchers often encounter high-dimensional data across various research fields. An important problem is to determine whether several groups of these high-dimensional data originate from the same population. To address this, this paper presents a novel k-sample test for equal distributions for high-dimensional data, utilizing the Maximum Mean Discrepancy (MMD). The test statistic is constructed using a V-statistic-based estimator of the squared MMD derived for several samples. The asymptotic null and alternative distributions of the test statistic are derived. To approximate the null distribution accurately, three simple methods are described. To evaluate the performance of the proposed test, two simulation studies and a real data example are presented, demonstrating the effectiveness and reliability of the test in practical applications.

Read Entire Article