Abstract
We consider a multistage stochastic optimization problem, studying how a single server should prioritize stochastically departing customers. In this setting, our objective is to determine an adaptive service policy that maximizes the expected total reward collected along a discrete planning horizon, in the presence of customers who are independently departing between one stage and the next with known stationary probabilities. Despite its deceiving structural simplicity, we are unaware of nontrivial results regarding the rigorous design of optimal or truly near-optimal policies at present time. Our main contribution resides in proposing a quasi-polynomial-time approximation scheme for serving impatient customers. Specifically, letting n be the number of underlying customers, our algorithm identifies in O(nOɛ (log2n) ) time a service policy whose expected reward is within factor 1 - ɛ of the optimal adaptive reward. Our method for deriving this approximation scheme synthesizes various stochastic analyses in order to investigate how the adaptive optimum is affected by alterations to several instance parameters, including the reward values, the departure probabilities, and the collection of customers itself.
| Original language | English |
|---|---|
| Pages (from-to) | 2744-2760 |
| Number of pages | 17 |
| Journal | Operations Research |
| Volume | 73 |
| Issue number | 5 |
| DOIs | |
| State | Published - 1 Sep 2025 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2024 INFORMS.
Keywords
- dynamic programming
- probabilistic coupling
- quasi-PTAS
ASJC Scopus subject areas
- Computer Science Applications
- Management Science and Operations Research