Uncertainty-bounded reinforcement learning for revenue optimization in air cargo: a prescriptive learning approach

We propose a prescriptive learning approach for revenue management in air-cargo that combines machine learning prediction with decision making using deep reinforcement learning. This approach, named RL-Cargo, addresses a problem that is unique to the air-cargo business, namely the wide discrepancy b...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Knowledge and information systems 2022-09, Vol.64 (9), p.2515-2541
Hauptverfasser:	Rizzo, Stefano Giovanni, Chen, Yixian, Pang, Linsey, Lucas, Ji, Kaoudi, Zoi, Quiane, Jorge, Chawla, Sanjay
Format:	Artikel
Sprache:	eng
Schlagworte:	Air cargo Airline operations Airlines Cargo capacity Computer Science Data Mining and Knowledge Discovery Database Management Decision making Deep learning Dynamic programming Feature extraction Information Storage and Retrieval Information Systems and Communication Service Information Systems Applications (incl.Internet) IT in Business Machine learning Optimization Predictive analytics Regular Paper Revenue Uncertainty
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We propose a prescriptive learning approach for revenue management in air-cargo that combines machine learning prediction with decision making using deep reinforcement learning. This approach, named RL-Cargo, addresses a problem that is unique to the air-cargo business, namely the wide discrepancy between the quantity (weight or volume) that a shipper will book and the actual amount received at departure time by the airline. The discrepancy results in sub-optimal and inefficient behavior by both the shipper and the airline resulting in an overall loss of potential revenue for the airline. In the proposed approach, booking features and extracted disguised missing values are exploited to provide a prediction on the received volume, while a DQN method using uncertainty bounds from the prediction intervals is proposed for decision making. We have validated the benefits of RL-Cargo using a real dataset of 1000 flights to compare classical Dynamic Programming and Deep Reinforcement Learning techniques on offloading costs and revenue generation. Our results suggest that prescriptive learning which combines prediction with decision making provides a principled approach for managing the air cargo revenue ecosystem. Furthermore, the proposed approach can be abstracted to many other application domains where decision making needs to be carried out in face of both data and behavioral uncertainty.
ISSN:	0219-1377 0219-3116
DOI:	10.1007/s10115-022-01713-5