Frohner, N., & Raidl, G. R. (2025). Learning Value Functions for Same-Day Delivery Problems in the Tardiness Regime. In Computer Aided Systems Theory – EUROCAST 2024 (pp. 263–271). https://doi.org/10.1007/978-3-031-82949-9_24
E192-01 - Forschungsbereich Algorithms and Complexity
-
Published in:
Computer Aided Systems Theory – EUROCAST 2024
-
ISBN:
978-3-031-82949-9
-
Volume:
15172
-
Date (published):
24-Apr-2025
-
Event name:
International Conference on Computer Aided Systems Theory
en
Event date:
25-Feb-2024 - 1-Mar-2024
-
Event place:
Spain
-
Number of Pages:
9
-
Peer reviewed:
Yes
-
Keywords:
Dynamic Vehicle Routing with Stochastic Customers; Same-Day Delivery; Surrogate Function Optimization; Value Function Approximation
en
Abstract:
Same-day delivery problems are a class of stochastic decision making problems concerned with delivering orders placed dynamically by stochastic customers on the same day given a fleet of vehicles. We consider a variant where all orders have to be served with the objective to minimize a tardiness penalty function and where their spatiotemporal distribution is known. A well-known baseline approach to increase performance compared to myopic optimization is by sampling and optimizing scenarios in the short-horizon and deriving a consensus solution from the resulting plans. Its drawback is the computational effort required, which may not make it suitable for near real-time decision making. Extending recent methodology from the literature, we replace this online sampling by an offline training of a short-horizon value function using a neural network, which is then used in the online point-in-time optimization, combining current reward plus estimated future value of a solution candidate. In a first computational study on a single-vehicle instance class with unavoidable tardiness, we show that this leads to comparable performance as the sampling approach, while greatly reducing the online decision time.