BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model
The rapid advancement of large language models (LLMs) has revolutionized role-playing, enabling the development of general role-playing models. However, current role-playing training has two significant issues: (I) Using a predefined role profile to prompt dialogue training for specific scenarios us...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The rapid advancement of large language models (LLMs) has revolutionized
role-playing, enabling the development of general role-playing models. However,
current role-playing training has two significant issues: (I) Using a
predefined role profile to prompt dialogue training for specific scenarios
usually leads to inconsistencies and even conflicts between the dialogue and
the profile, resulting in training biases. (II) The model learns to imitate the
role based solely on the profile, neglecting profile-dialogue alignment at the
sentence level. In this work, we propose a simple yet effective framework
called BEYOND DIALOGUE, designed to overcome these hurdles. This framework
innovatively introduces "beyond dialogue" tasks to align dialogue with profile
traits based on each specific scenario, thereby eliminating biases during
training. Furthermore, by adopting an innovative prompting mechanism that
generates reasoning outcomes for training, the framework allows the model to
achieve fine-grained alignment between profile and dialogue at the sentence
level. The aforementioned methods are fully automated and low-cost.
Additionally, the integration of automated dialogue and objective evaluation
methods forms a comprehensive framework, paving the way for general
role-playing. Experimental results demonstrate that our model excels in
adhering to and reflecting various dimensions of role profiles, outperforming
most proprietary general and specialized role-playing baselines. All code and
datasets are available at https://github.com/yuyouyu32/BeyondDialogue. |
---|---|
DOI: | 10.48550/arxiv.2408.10903 |