Towards Personalized Federated Multi-Scenario Multi-Task Recommendation
In modern recommender systems, especially in e-commerce, predicting multiple targets such as click-through rate (CTR) and post-view conversion rate (CTCVR) is common. Multi-task recommender systems are increasingly popular in both research and practice, as they leverage shared knowledge across diver...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In modern recommender systems, especially in e-commerce, predicting multiple
targets such as click-through rate (CTR) and post-view conversion rate (CTCVR)
is common. Multi-task recommender systems are increasingly popular in both
research and practice, as they leverage shared knowledge across diverse
business scenarios to enhance performance. However, emerging real-world
scenarios and data privacy concerns complicate the development of a unified
multi-task recommendation model.
In this paper, we propose PF-MSMTrec, a novel framework for personalized
federated multi-scenario multi-task recommendation. In this framework, each
scenario is assigned to a dedicated client utilizing the Multi-gate
Mixture-of-Experts (MMoE) structure. To address the unique challenges of
multiple optimization conflicts, we introduce a bottom-up joint learning
mechanism. First, we design a parameter template to decouple the expert network
parameters, distinguishing scenario-specific parameters as shared knowledge for
federated parameter aggregation. Second, we implement personalized federated
learning for each expert network during a federated communication round, using
three modules: federated batch normalization, conflict coordination, and
personalized aggregation. Finally, we conduct an additional round of
personalized federated parameter aggregation on the task tower network to
obtain prediction results for multiple tasks. Extensive experiments on two
public datasets demonstrate that our proposed method outperforms
state-of-the-art approaches. The source code and datasets will be released as
open-source for public access. |
---|---|
DOI: | 10.48550/arxiv.2406.18938 |