Maximizing the Success Probability of Policy Allocations in Online Systems

The effectiveness of advertising in e-commerce largely depends on the ability of merchants to bid on and win impressions for their targeted users. The bidding procedure is highly complex due to various factors such as market competition, user behavior, and the diverse objectives of advertisers. In t...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Betlei, Artem, Vladimirova, Mariia, Sebbar, Mehdi, Urien, Nicolas, Rahier, Thibaud, Heymann, Benjamin
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Cognitive science Computer science Computer Science - Computer Science and Game Theory Computer Science - Information Retrieval Computer Science - Learning Other Statistics Statistics Statistics - Machine Learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Betlei, Artem Vladimirova, Mariia Sebbar, Mehdi Urien, Nicolas Rahier, Thibaud Heymann, Benjamin
description	The effectiveness of advertising in e-commerce largely depends on the ability of merchants to bid on and win impressions for their targeted users. The bidding procedure is highly complex due to various factors such as market competition, user behavior, and the diverse objectives of advertisers. In this paper we consider the problem at the level of user timelines instead of individual bid requests, manipulating full policies (i.e. pre-defined bidding strategies) and not bid values. In order to optimally allocate policies to users, typical multiple treatments allocation methods solve knapsack-like problems which aim at maximizing an expected value under constraints. In the industrial contexts such as online advertising, we argue that optimizing for the probability of success is a more suited objective than expected value maximization, and we introduce the SuccessProbaMax algorithm that aims at finding the policy allocation which is the most likely to outperform a fixed reference policy. Finally, we conduct comprehensive experiments both on synthetic and real-world data to evaluate its performance. The results demonstrate that our proposed algorithm outperforms conventional expected-value maximization algorithms in terms of success rate.
doi_str_mv	10.48550/arxiv.2312.16267
format	Conference Proceeding
fullrecord	<record><control><sourceid>hal_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2312_16267</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>oai_HAL_hal_04413174v1</sourcerecordid><originalsourceid>FETCH-LOGICAL-a1017-eb0d9488a98096091d1b5c9389178c321df4d3f2989bf39090fd6221ae4a219d3</originalsourceid><addsrcrecordid>eNo9kE1LxDAYhHPxIKs_wJO5emjNm6RtciyLukplF9RzSfPhBtJGmrps_fXuh3gaGJ4ZmEHoBkjORVGQezXu_S6nDGgOJS2rS_Tyqva-9z9--MTT1uK3b61tSngzxk51PvhpxtHhTQxez7gOIWo1-Tgk7Ae8HoIfDpk5TbZPV-jCqZDs9Z8u0Mfjw_tylTXrp-dl3WQKCFSZ7YiRXAglBZElkWCgK7RkQkIlNKNgHDfMUSlk55gkkjhTUgrKckVBGrZAd-ferQrt1-h7Nc5tVL5d1U179AjnwKDiOziwt2f2tPyfPj7Qnh5gv_sqVK4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Maximizing the Success Probability of Policy Allocations in Online Systems</title><source>arXiv.org</source><creator>Betlei, Artem ; Vladimirova, Mariia ; Sebbar, Mehdi ; Urien, Nicolas ; Rahier, Thibaud ; Heymann, Benjamin</creator><creatorcontrib>Betlei, Artem ; Vladimirova, Mariia ; Sebbar, Mehdi ; Urien, Nicolas ; Rahier, Thibaud ; Heymann, Benjamin</creatorcontrib><description>The effectiveness of advertising in e-commerce largely depends on the ability of merchants to bid on and win impressions for their targeted users. The bidding procedure is highly complex due to various factors such as market competition, user behavior, and the diverse objectives of advertisers. In this paper we consider the problem at the level of user timelines instead of individual bid requests, manipulating full policies (i.e. pre-defined bidding strategies) and not bid values. In order to optimally allocate policies to users, typical multiple treatments allocation methods solve knapsack-like problems which aim at maximizing an expected value under constraints. In the industrial contexts such as online advertising, we argue that optimizing for the probability of success is a more suited objective than expected value maximization, and we introduce the SuccessProbaMax algorithm that aims at finding the policy allocation which is the most likely to outperform a fixed reference policy. Finally, we conduct comprehensive experiments both on synthetic and real-world data to evaluate its performance. The results demonstrate that our proposed algorithm outperforms conventional expected-value maximization algorithms in terms of success rate.</description><identifier>DOI: 10.48550/arxiv.2312.16267</identifier><language>eng</language><publisher>arXiv</publisher><subject>Cognitive science ; Computer science ; Computer Science - Computer Science and Game Theory ; Computer Science - Information Retrieval ; Computer Science - Learning ; Other Statistics ; Statistics ; Statistics - Machine Learning</subject><creationdate>2023</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><rights>Attribution</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-0318-5333 ; 0000-0003-0251-6177</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,309,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2312.16267$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2312.16267$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-04413174$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Betlei, Artem</creatorcontrib><creatorcontrib>Vladimirova, Mariia</creatorcontrib><creatorcontrib>Sebbar, Mehdi</creatorcontrib><creatorcontrib>Urien, Nicolas</creatorcontrib><creatorcontrib>Rahier, Thibaud</creatorcontrib><creatorcontrib>Heymann, Benjamin</creatorcontrib><title>Maximizing the Success Probability of Policy Allocations in Online Systems</title><description>The effectiveness of advertising in e-commerce largely depends on the ability of merchants to bid on and win impressions for their targeted users. The bidding procedure is highly complex due to various factors such as market competition, user behavior, and the diverse objectives of advertisers. In this paper we consider the problem at the level of user timelines instead of individual bid requests, manipulating full policies (i.e. pre-defined bidding strategies) and not bid values. In order to optimally allocate policies to users, typical multiple treatments allocation methods solve knapsack-like problems which aim at maximizing an expected value under constraints. In the industrial contexts such as online advertising, we argue that optimizing for the probability of success is a more suited objective than expected value maximization, and we introduce the SuccessProbaMax algorithm that aims at finding the policy allocation which is the most likely to outperform a fixed reference policy. Finally, we conduct comprehensive experiments both on synthetic and real-world data to evaluate its performance. The results demonstrate that our proposed algorithm outperforms conventional expected-value maximization algorithms in terms of success rate.</description><subject>Cognitive science</subject><subject>Computer science</subject><subject>Computer Science - Computer Science and Game Theory</subject><subject>Computer Science - Information Retrieval</subject><subject>Computer Science - Learning</subject><subject>Other Statistics</subject><subject>Statistics</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><sourceid>GOX</sourceid><recordid>eNo9kE1LxDAYhHPxIKs_wJO5emjNm6RtciyLukplF9RzSfPhBtJGmrps_fXuh3gaGJ4ZmEHoBkjORVGQezXu_S6nDGgOJS2rS_Tyqva-9z9--MTT1uK3b61tSngzxk51PvhpxtHhTQxez7gOIWo1-Tgk7Ae8HoIfDpk5TbZPV-jCqZDs9Z8u0Mfjw_tylTXrp-dl3WQKCFSZ7YiRXAglBZElkWCgK7RkQkIlNKNgHDfMUSlk55gkkjhTUgrKckVBGrZAd-ferQrt1-h7Nc5tVL5d1U179AjnwKDiOziwt2f2tPyfPj7Qnh5gv_sqVK4</recordid><startdate>20231226</startdate><enddate>20231226</enddate><creator>Betlei, Artem</creator><creator>Vladimirova, Mariia</creator><creator>Sebbar, Mehdi</creator><creator>Urien, Nicolas</creator><creator>Rahier, Thibaud</creator><creator>Heymann, Benjamin</creator><general>arXiv</general><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope><scope>1XC</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0002-0318-5333</orcidid><orcidid>https://orcid.org/0000-0003-0251-6177</orcidid></search><sort><creationdate>20231226</creationdate><title>Maximizing the Success Probability of Policy Allocations in Online Systems</title><author>Betlei, Artem ; Vladimirova, Mariia ; Sebbar, Mehdi ; Urien, Nicolas ; Rahier, Thibaud ; Heymann, Benjamin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a1017-eb0d9488a98096091d1b5c9389178c321df4d3f2989bf39090fd6221ae4a219d3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Cognitive science</topic><topic>Computer science</topic><topic>Computer Science - Computer Science and Game Theory</topic><topic>Computer Science - Information Retrieval</topic><topic>Computer Science - Learning</topic><topic>Other Statistics</topic><topic>Statistics</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Betlei, Artem</creatorcontrib><creatorcontrib>Vladimirova, Mariia</creatorcontrib><creatorcontrib>Sebbar, Mehdi</creatorcontrib><creatorcontrib>Urien, Nicolas</creatorcontrib><creatorcontrib>Rahier, Thibaud</creatorcontrib><creatorcontrib>Heymann, Benjamin</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Betlei, Artem</au><au>Vladimirova, Mariia</au><au>Sebbar, Mehdi</au><au>Urien, Nicolas</au><au>Rahier, Thibaud</au><au>Heymann, Benjamin</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Maximizing the Success Probability of Policy Allocations in Online Systems</atitle><date>2023-12-26</date><risdate>2023</risdate><abstract>The effectiveness of advertising in e-commerce largely depends on the ability of merchants to bid on and win impressions for their targeted users. The bidding procedure is highly complex due to various factors such as market competition, user behavior, and the diverse objectives of advertisers. In this paper we consider the problem at the level of user timelines instead of individual bid requests, manipulating full policies (i.e. pre-defined bidding strategies) and not bid values. In order to optimally allocate policies to users, typical multiple treatments allocation methods solve knapsack-like problems which aim at maximizing an expected value under constraints. In the industrial contexts such as online advertising, we argue that optimizing for the probability of success is a more suited objective than expected value maximization, and we introduce the SuccessProbaMax algorithm that aims at finding the policy allocation which is the most likely to outperform a fixed reference policy. Finally, we conduct comprehensive experiments both on synthetic and real-world data to evaluate its performance. The results demonstrate that our proposed algorithm outperforms conventional expected-value maximization algorithms in terms of success rate.</abstract><pub>arXiv</pub><doi>10.48550/arxiv.2312.16267</doi><orcidid>https://orcid.org/0000-0002-0318-5333</orcidid><orcidid>https://orcid.org/0000-0003-0251-6177</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2312.16267
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2312_16267
source	arXiv.org
subjects	Cognitive science Computer science Computer Science - Computer Science and Game Theory Computer Science - Information Retrieval Computer Science - Learning Other Statistics Statistics Statistics - Machine Learning
title	Maximizing the Success Probability of Policy Allocations in Online Systems
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T18%3A12%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-hal_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Maximizing%20the%20Success%20Probability%20of%20Policy%20Allocations%20in%20Online%20Systems&rft.au=Betlei,%20Artem&rft.date=2023-12-26&rft_id=info:doi/10.48550/arxiv.2312.16267&rft_dat=%3Chal_GOX%3Eoai_HAL_hal_04413174v1%3C/hal_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true