Game Data Mining Competition on Churn Prediction and Survival Analysis using Commercial Game Log Data

Game companies avoid sharing their game data with external researchers. Only a few research groups have been granted limited access to game data so far. The reluctance of these companies to make data publicly available limits the wide use and development of data mining techniques and artificial inte...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2018-12
Hauptverfasser: Lee, EunJo, Jang, Yoonjae, Yoon, DuMim, Jeon, JiHoon, Yang, Seong-il, Sang-Kwang, Lee, Dae-Wook, Kim, Pei Pei Chen, Guitart, Anna, Bertens, Paul, Periáñez, África, Hadiji, Fabian, Müller, Marc, Joo, Youngjun, Lee, Jiyeon, Hwang, Inchon, Kyung-Joong, Kim
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Game companies avoid sharing their game data with external researchers. Only a few research groups have been granted limited access to game data so far. The reluctance of these companies to make data publicly available limits the wide use and development of data mining techniques and artificial intelligence research specific to the game industry. In this work, we developed and implemented an international competition on game data mining using commercial game log data from one of the major game companies in South Korea: NCSOFT. Our approach enabled researchers to develop and apply state-of-the-art data mining techniques to game log data by making the data open. For the competition, data were collected from Blade & Soul, an action role-playing game, from NCSOFT. The data comprised approximately 100 GB of game logs from 10,000 players. The main aim of the competition was to predict whether a player would churn and when the player would churn during two periods between which the business model was changed to a free-to-play model from a monthly subscription. The results of the competition revealed that highly ranked competitors used deep learning, tree boosting, and linear regression.
ISSN:2331-8422
DOI:10.48550/arxiv.1802.02301