A deep reinforcement learning agent for geometry online tutoring

In this paper, we apply deep reinforcement learning (DRL) for geometry reasoning and develop Dragon to facilitate online tutoring. Its success is contingent on a flexible data model to capture diverse concepts and heterogeneous relations, as well as an effective DRL agent to generate near-optimal an...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Knowledge and information systems 2023-04, Vol.65 (4), p.1611-1625
Hauptverfasser:	Xiao, Ziyang, Zhang, Dongxiang
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Computer Science Core curriculum Data augmentation Data Mining and Knowledge Discovery Database Management Datasets Deep learning Expansion Geometry Heuristic methods Information Storage and Retrieval Information Systems and Communication Service Information Systems Applications (incl.Internet) IT in Business Knowledge Knowledge bases (artificial intelligence) Machine learning Neural networks Optimization Optimization techniques Regular Paper Theorems Tutoring
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, we apply deep reinforcement learning (DRL) for geometry reasoning and develop Dragon to facilitate online tutoring. Its success is contingent on a flexible data model to capture diverse concepts and heterogeneous relations, as well as an effective DRL agent to generate near-optimal and human-readable solutions. We use proximal policy optimization (PPO) as the backbone DRL architecture, customized with effective state representation and integrated with a bunch of optimization tricks including attention mechanism, action mask, data augmentation and curriculum learning. In our experimental study, we craft so far the largest scale dataset with geometry problems and a knowledge base with 46 theorems. We implement various heuristic algorithms and DRL models as baselines for performance comparison. The results show that our agent achieves near-optimal solution and is superior over multiple competitive baselines. To benefit the community, we opensource the dataset and implementation at https://github.com/AIEdu-xzy/geometry-solver .
ISSN:	0219-1377 0219-3116
DOI:	10.1007/s10115-022-01804-3