Abstract
In the adaptive ProbeTopK problem, given a collection of mutually independent random variables X1,…,Xn, our goal is to design an adaptive probing policy to sample these variables in a sequence of T stages, with the objective of maximizing the expected sum of the K highest rewards sampled. In spite of its stylized formulation, this setting captures numerous technical hurdles inherent to stochastic optimization, related to both information structure and efficient computation. For these reasons, special cases and variants of this problem have served as a test bed for a multitude of algorithmic methods, and concurrently as a popular teaching tool in courses and tutorials dedicated to recent trends in optimization under uncertainty. The main contribution of this paper consists in proposing a novel method for upper-bounding the expected reward of optimal adaptive probing policies, based on a simple Min-Max problem. Equipped with this method, we devise a purely combinatorial algorithms for deterministically computing feasible sets whose vicinity to the adaptive optimum is analyzed through prophet inequality ideas. Consequently, this approach allows us to establish improved constructive adaptivity gaps for the ProbeTopK problem in its broadest form, where X1,…,Xn are general random variables, making further advancements when X1,…,Xn are continuous.
Original language | English |
---|---|
Journal | Mathematical Programming |
DOIs | |
State | Accepted/In press - 2025 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© The Author(s) 2025.
Keywords
- Adaptivity gap
- Prophet inequality
- Stochastic probing
ASJC Scopus subject areas
- Software
- General Mathematics