Direct Divergence Approximation between Probability Distributions and Its Applications in Machine Learning

Direct Divergence Approximation between Probability Distributions and Its Applications in Machine Learning
Direct Divergence Approximation between Probability Distributions and Its Applications in Machine Learning

ㆍ 저자명: Sugiyama. Masashi,Liu. Song,du Plessis. Marthinus Christoffel,Yamanaka. Masao,Yamada. Makoto,Suzuki. Taiji,Kanamori. Takafumi
ㆍ 간행물명: Journal of computing science and engineering
ㆍ 권/호정보: 2013년|7권 2호|pp.99-111 (13 pages)
ㆍ 발행정보: 한국정보과학회
ㆍ 파일정보: 정기간행물|ENG|
PDF텍스트
ㆍ 주제분야: 기타

이 논문은 한국과학기술정보연구원과 논문 연계를 통해 무료로 제공되는 원문입니다.

서지반출

기타언어초록

Approximating a divergence between two probability distributions from their samples is a fundamental challenge in statistics, information theory, and machine learning. A divergence approximator can be used for various purposes, such as two-sample homogeneity testing, change-point detection, and class-balance estimation. Furthermore, an approximator of a divergence between the joint distribution and the product of marginals can be used for independence testing, which has a wide range of applications, including feature selection and extraction, clustering, object matching, independent component analysis, and causal direction estimation. In this paper, we review recent advances in divergence approximation. Our emphasis is that directly approximating the divergence without estimating probability distributions is more sensible than a naive two-step approach of first estimating probability distributions and then approximating the divergence. Furthermore, despite the overwhelming popularity of the Kullback-Leibler divergence as a divergence measure, we argue that alternatives such as the Pearson divergence, the relative Pearson divergence, and the $L^2$-distance are more useful in practice because of their computationally efficient approximability, high numerical stability, and superior robustness against outliers.

키워드

Machine learning Probability distributions Kullback-Leibler divergence Pearson divergence $L^2$-distance

다운URL