Ball Divergence: Nonparametric two sample test

Type: Article

Publication Date: 2018-05-03

Citations: 46

DOI: https://doi.org/10.1214/17-aos1579

Abstract

In this paper, we first introduce Ball Divergence, a novel measure of the difference between two probability measures in separable Banach spaces, and show that the Ball Divergence of two probability measures is zero if and only if these two probability measures are identical without any moment assumption. Using Ball Divergence, we present a metric rank test procedure to detect the equality of distribution measures underlying independent samples. It is therefore robust to outliers or heavy-tail data. We show that this multivariate two sample test statistic is consistent with the Ball Divergence, and it converges to a mixture of χ2 distributions under the null hypothesis and a normal distribution under the alternative hypothesis. Importantly, we prove its consistency against a general alternative hypothesis. Moreover, this result does not depend on the ratio of the two imbalanced sample sizes, ensuring that can be applied to imbalanced data. Numerical studies confirm that our test is superior to several existing tests in terms of Type I error and power. We conclude our paper with two applications of our method: one is for virtual screening in drug development process and the other is for genome wide expression analysis in hormone replacement therapy.

Locations

  • The Annals of Statistics - View - PDF
  • PubMed Central - View
  • Europe PMC (PubMed Central) - View - PDF
  • PubMed - View

Similar Works

Action Title Year Authors
+ Ball: An R package for detecting distribution difference and association in metric spaces 2018 Jin Zhu
Wenliang Pan
Wei Zheng
Xueqin Wang
+ On High Dimensional Behaviour of Some Two-Sample Tests Based on Ball Divergence 2022 Bilol Banerjee
Anil K. Ghosh
+ <b>Ball</b>: An <i>R</i> Package for Detecting Distribution Difference and Association in Metric Spaces 2021 Jin Zhu
Wenliang Pan
Wei Xing Zheng
Xueqin Wang
+ PDF Chat Two-sample test for equal distributions in separate metric space: New maximum mean discrepancy based approaches 2022 Jin‐Ting Zhang
Łukasz Smaga
+ High-Dimensional Behaviour of Some Two-Sample Tests Based on Ball Divergence 2023 Bilol Banerjee
Anil K. Ghosh
+ PDF Chat Direct Divergence Approximation between Probability Distributions and Its Applications in Machine Learning 2013 Masashi Sugiyama
Song Liu
Marthinus Christoffel du Plessis
Masao Yamanaka
Makoto Yamada
Taiji Suzuki
Takafumi Kanamori
+ A hyperbolic divergence based nonparametric test for two‐sample multivariate distributions 2022 Roulin Wang
Wei Fan
Xueqin Wang
+ PDF Chat On Wasserstein Two-Sample Testing and Related Families of Nonparametric Tests 2017 Aaditya Ramdas
Nicolás García Trillos
Marco Cuturi
+ PDF Chat Ball Covariance: A Generic Measure of Dependence in Banach Space 2019 Wenliang Pan
Xueqin Wang
Heping Zhang
Hongtu Zhu
Jin Zhu
+ A new maximum mean discrepancy based two-sample test for equal distributions in separable metric spaces 2024 Bu Zhou
Zhi Peng Ong
Jin‐Ting Zhang
+ A robust and nonparametric two-sample test in high dimensions 2020 Tao Qiu
Wangli Xu
Li Zhu
+ A Maximum Value for the Kullback–Leibler Divergence between Quantized Distributions 2024 Vincenzo Bonnici
+ Testing equality of several distributions in separable metric spaces: A maximum mean discrepancy based approach 2022 Jin‐Ting Zhang
Jia Guo
Bu Zhou
+ Exact Expressions for Kullback–Leibler Divergence for Univariate Distributions 2024 Victor Mooto Nawa
Saralees Nadarajah
+ PDF Chat On the Second-Order Asymptotics of the Hoeffding Test and Other Divergence Tests 2024 K. V. Harsha
Jithin Ravi
Tobias Koch
+ PDF Chat A Ball Divergence Based Measure For Conditional Independence Testing 2024 Bilol Banerjee
Bhaswar B. Bhattacharya
Anil K. Ghosh
+ TwoSampleTest.HD: A Two-Sample Test for the Equality of Distributions for High-Dimensional Data 2018 Marta Cousido Rocha
Elena de Uña Álvarez
Jeffrey D. Hart
+ Some Universal Insights on Divergences for Statistics, Machine Learning and Artificial Intelligence 2018 Michel Broniatowski
Wolfgang Stummer
+ On the sampling distribution of an $\ell^2$ norm of the Empirical Distribution Function, with applications to two-sample nonparametric testing 2012 François Caron
Chris Holmes
Emmanuel Rio
+ Metric Distributional Discrepancy in Metric Space 2021 Wenliang Pan
Yujue Li
Jianwu Liu
Weixiong Mai