DELTA: Dynamic Embedding Learning with Truncated Conscious Attention for CTR Prediction
Click-Through Rate (CTR) prediction is a pivotal task in product and content recommendation, where learning effective feature embeddings is of great significance. However, traditional methods typically learn fixed feature representations without dynamically refining feature representations according...
Gespeichert in:
Hauptverfasser: | , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Click-Through Rate (CTR) prediction is a pivotal task in product and content
recommendation, where learning effective feature embeddings is of great
significance. However, traditional methods typically learn fixed feature
representations without dynamically refining feature representations according
to the context information, leading to suboptimal performance. Some recent
approaches attempt to address this issue by learning bit-wise weights or
augmented embeddings for feature representations, but suffer from uninformative
or redundant features in the context. To tackle this problem, inspired by the
Global Workspace Theory in conscious processing, which posits that only a
specific subset of the product features are pertinent while the rest can be
noisy and even detrimental to human-click behaviors, we propose a CTR model
that enables Dynamic Embedding Learning with Truncated Conscious Attention for
CTR prediction, termed DELTA. DELTA contains two key components: (I) conscious
truncation module (CTM), which utilizes curriculum learning to apply adaptive
truncation on attention weights to select the most critical feature in the
context; (II) explicit embedding optimization (EEO), which applies an auxiliary
task during training that directly and independently propagates the gradient
from the loss layer to the embedding layer, thereby optimizing the embedding
explicitly via linear feature crossing. Extensive experiments on five
challenging CTR datasets demonstrate that DELTA achieves new state-of-art
performance among current CTR methods. |
---|---|
DOI: | 10.48550/arxiv.2305.04891 |