Predicting vessel speed in the Arctic without knowing ice conditions using AIS data and decision trees

•Analysis of enhanced AIS data and identification of patterns in vessel traffic in the Kara Sea [2017–2018].•Accuracy of random forest, XGBoost, and LightGBM is tested for vessels’ speeds prediction.•Considered approaches can predict vessels’ speeds (MAE ~3.5 knots) without local ice information.•Vi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Maritime transport research 2021, Vol.2, p.100024, Article 100024
Hauptverfasser: Rao, Prithvi S, Kim, Ekaterina, Smestad, Bjørnar Brende, Asbjørnslett, Bjørn Egil, Bhattacharyya, Anirban
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Analysis of enhanced AIS data and identification of patterns in vessel traffic in the Kara Sea [2017–2018].•Accuracy of random forest, XGBoost, and LightGBM is tested for vessels’ speeds prediction.•Considered approaches can predict vessels’ speeds (MAE ~3.5 knots) without local ice information.•Visibility metrics (daylight data) and distance from the landmark places enhance predictive capabilities of the models. The vessel speed is one of the important parameters that govern safety, emergency, and transport planning in the Arctic. While previous studies have traditionally relied on physics-based simulations to predict vessel's speed in ice-covered waters, most have not fully explored data-driven approaches and powerful supervised machine learning tools to aid speed prediction. This study offers a perspective of applying supervised machine learning models to predict MV SOG using historical Automatic Identification System (AIS) data and without explicit knowledge of local ice conditions. This paper presents a case-study from the region of the Eastern Barents Sea and the Southern Kara Sea. We first analyzed the vessel traffic situation for the years 2017 and 2018, and then used this knowledge to build statistical models to predict vessel speeds. Finally, we evaluated the models’ performance on a test dataset from January 2019. Performance of three models (Random Forest, XGBoost, and LightGBM) have been tested with a variety of date-time handling techniques, and data input mode being permuted to arrive at the most optimal model. The results demonstrate the ability of the models to predict the vessel's speed based on its geographical location, time of the year and other engineered features such as daylight information and route. With the proposed approach we were able to achieve mean absolute error 3.5 knots in average on a test dataset without explicit knowledge of local ice conditions around the vessel, with the majority of the errors being in the Kara Strait region and the Sabetta Channel.
ISSN:2666-822X
2666-822X
DOI:10.1016/j.martra.2021.100024