Learning Classifiers under Delayed Feedback with a Time Window Assumption
We consider training a binary classifier under delayed feedback (\emph{DF learning}). For example, in the conversion prediction in online ads, we initially receive negative samples that clicked the ads but did not buy an item; subsequently, some samples among them buy an item then change to positive...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | We consider training a binary classifier under delayed feedback (\emph{DF
learning}). For example, in the conversion prediction in online ads, we
initially receive negative samples that clicked the ads but did not buy an
item; subsequently, some samples among them buy an item then change to
positive. In the setting of DF learning, we observe samples over time, then
learn a classifier at some point. We initially receive negative samples;
subsequently, some samples among them change to positive. This problem is
conceivable in various real-world applications such as online advertisements,
where the user action takes place long after the first click. Owing to the
delayed feedback, naive classification of the positive and negative samples
returns a biased classifier. One solution is to use samples that have been
observed for more than a certain time window assuming these samples are
correctly labeled. However, existing studies reported that simply using a
subset of all samples based on the time window assumption does not perform
well, and that using all samples along with the time window assumption improves
empirical performance. We extend these existing studies and propose a method
with the unbiased and convex empirical risk that is constructed from all
samples under the time window assumption. To demonstrate the soundness of the
proposed method, we provide experimental results on a synthetic and open
dataset that is the real traffic log datasets in online advertising. |
---|---|
DOI: | 10.48550/arxiv.2009.13092 |