Jose Blanchet , Yang Kang (2021) Sample Out-of-Sample Inference Based on Wasserstein Distance. Operations Research 69(3):985-1013. https://doi.org/10.1287/opre.2020.2028
Abstract
We present a novel inference approach that we call sample out-of-sample inference. The approach can be used widely, ranging from semisupervised learning to stress testing, and it is fundamental in the application of data-driven distributionally robust optimization. Our method enables measuring the impact of plausible out-of-sample scenarios in a given performance measure of interest, such as a financial loss. The methodology is inspired by empirical likelihood (EL), but we optimize the empirical Wasserstein distance (instead of the empirical likelihood) induced by observations. From a methodological standpoint, our analysis of the asymptotic behavior of the induced Wasserstein-distance profile function shows dramatic qualitative differences relative to EL. For instance, in contrast to EL, which typically yields chi-squared weak convergence limits, our asymptotic distributions are often not chi-squared. Also, the …
Authors
Jose Blanchet, Yang Kang
Publication date
2021/5
Journal
Operations Research
Volume
69
Issue
3
Pages
985-1013
Publisher
INFORMS