Towards Target High-Utility Itemsets
For applied intelligence, utility-driven pattern discovery algorithms can identify insightful and useful patterns in databases. However, in these techniques for pattern discovery, the number of patterns can be huge, and the user is often only interested in a few of those patterns. Hence, targeted hi...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Miao, Jinbao Gan, Wensheng Wan, Shicheng Wu, Yongdong Fournier-Viger, Philippe |
description | For applied intelligence, utility-driven pattern discovery algorithms can
identify insightful and useful patterns in databases. However, in these
techniques for pattern discovery, the number of patterns can be huge, and the
user is often only interested in a few of those patterns. Hence, targeted
high-utility itemset mining has emerged as a key research topic, where the aim
is to find a subset of patterns that meet a targeted pattern constraint instead
of all patterns. This is a challenging task because efficiently finding
tailored patterns in a very large search space requires a targeted mining
algorithm. A first algorithm called TargetUM has been proposed, which adopts an
approach similar to post-processing using a tree structure, but the running
time and memory consumption are unsatisfactory in many situations. In this
paper, we address this issue by proposing a novel list-based algorithm with
pattern matching mechanism, named THUIM (Targeted High-Utility Itemset Mining),
which can quickly match high-utility itemsets during the mining process to
select the targeted patterns. Extensive experiments were conducted on different
datasets to compare the performance of the proposed algorithm with
state-of-the-art algorithms. Results show that THUIM performs very well in
terms of runtime and memory consumption, and has good scalability compared to
TargetUM. |
doi_str_mv | 10.48550/arxiv.2206.06157 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2206_06157</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2206_06157</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-14b84bf06a413f3492ec68fb911ca51fb0630c5940340076121c3ce1b374b48d3</originalsourceid><addsrcrecordid>eNotzrFuwjAUQFEvHaqUD-jUDKxJ34uf7WREES1ISCxhjmzHTi2BQI7Vkr9HBaa7XR3G3hFKqoWATx2v4besKpAlSBTqlS2785-Ow5R3Oo4u5Zsw_hSHFI4hzfk2udPk0vTGXrw-Tm7xbMa6r3XXbord_nvbrnaFlkoVSKYm40FqQu45NZWzsvamQbRaoDcgOVjREHACUBIrtNw6NFyRoXrgGft4bO_M_hLDSce5_-f2dy6_AXboOGQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Towards Target High-Utility Itemsets</title><source>arXiv.org</source><creator>Miao, Jinbao ; Gan, Wensheng ; Wan, Shicheng ; Wu, Yongdong ; Fournier-Viger, Philippe</creator><creatorcontrib>Miao, Jinbao ; Gan, Wensheng ; Wan, Shicheng ; Wu, Yongdong ; Fournier-Viger, Philippe</creatorcontrib><description>For applied intelligence, utility-driven pattern discovery algorithms can
identify insightful and useful patterns in databases. However, in these
techniques for pattern discovery, the number of patterns can be huge, and the
user is often only interested in a few of those patterns. Hence, targeted
high-utility itemset mining has emerged as a key research topic, where the aim
is to find a subset of patterns that meet a targeted pattern constraint instead
of all patterns. This is a challenging task because efficiently finding
tailored patterns in a very large search space requires a targeted mining
algorithm. A first algorithm called TargetUM has been proposed, which adopts an
approach similar to post-processing using a tree structure, but the running
time and memory consumption are unsatisfactory in many situations. In this
paper, we address this issue by proposing a novel list-based algorithm with
pattern matching mechanism, named THUIM (Targeted High-Utility Itemset Mining),
which can quickly match high-utility itemsets during the mining process to
select the targeted patterns. Extensive experiments were conducted on different
datasets to compare the performance of the proposed algorithm with
state-of-the-art algorithms. Results show that THUIM performs very well in
terms of runtime and memory consumption, and has good scalability compared to
TargetUM.</description><identifier>DOI: 10.48550/arxiv.2206.06157</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Databases</subject><creationdate>2022-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2206.06157$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2206.06157$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Miao, Jinbao</creatorcontrib><creatorcontrib>Gan, Wensheng</creatorcontrib><creatorcontrib>Wan, Shicheng</creatorcontrib><creatorcontrib>Wu, Yongdong</creatorcontrib><creatorcontrib>Fournier-Viger, Philippe</creatorcontrib><title>Towards Target High-Utility Itemsets</title><description>For applied intelligence, utility-driven pattern discovery algorithms can
identify insightful and useful patterns in databases. However, in these
techniques for pattern discovery, the number of patterns can be huge, and the
user is often only interested in a few of those patterns. Hence, targeted
high-utility itemset mining has emerged as a key research topic, where the aim
is to find a subset of patterns that meet a targeted pattern constraint instead
of all patterns. This is a challenging task because efficiently finding
tailored patterns in a very large search space requires a targeted mining
algorithm. A first algorithm called TargetUM has been proposed, which adopts an
approach similar to post-processing using a tree structure, but the running
time and memory consumption are unsatisfactory in many situations. In this
paper, we address this issue by proposing a novel list-based algorithm with
pattern matching mechanism, named THUIM (Targeted High-Utility Itemset Mining),
which can quickly match high-utility itemsets during the mining process to
select the targeted patterns. Extensive experiments were conducted on different
datasets to compare the performance of the proposed algorithm with
state-of-the-art algorithms. Results show that THUIM performs very well in
terms of runtime and memory consumption, and has good scalability compared to
TargetUM.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Databases</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzrFuwjAUQFEvHaqUD-jUDKxJ34uf7WREES1ISCxhjmzHTi2BQI7Vkr9HBaa7XR3G3hFKqoWATx2v4besKpAlSBTqlS2785-Ow5R3Oo4u5Zsw_hSHFI4hzfk2udPk0vTGXrw-Tm7xbMa6r3XXbord_nvbrnaFlkoVSKYm40FqQu45NZWzsvamQbRaoDcgOVjREHACUBIrtNw6NFyRoXrgGft4bO_M_hLDSce5_-f2dy6_AXboOGQ</recordid><startdate>20220609</startdate><enddate>20220609</enddate><creator>Miao, Jinbao</creator><creator>Gan, Wensheng</creator><creator>Wan, Shicheng</creator><creator>Wu, Yongdong</creator><creator>Fournier-Viger, Philippe</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20220609</creationdate><title>Towards Target High-Utility Itemsets</title><author>Miao, Jinbao ; Gan, Wensheng ; Wan, Shicheng ; Wu, Yongdong ; Fournier-Viger, Philippe</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-14b84bf06a413f3492ec68fb911ca51fb0630c5940340076121c3ce1b374b48d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Databases</topic><toplevel>online_resources</toplevel><creatorcontrib>Miao, Jinbao</creatorcontrib><creatorcontrib>Gan, Wensheng</creatorcontrib><creatorcontrib>Wan, Shicheng</creatorcontrib><creatorcontrib>Wu, Yongdong</creatorcontrib><creatorcontrib>Fournier-Viger, Philippe</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Miao, Jinbao</au><au>Gan, Wensheng</au><au>Wan, Shicheng</au><au>Wu, Yongdong</au><au>Fournier-Viger, Philippe</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Towards Target High-Utility Itemsets</atitle><date>2022-06-09</date><risdate>2022</risdate><abstract>For applied intelligence, utility-driven pattern discovery algorithms can
identify insightful and useful patterns in databases. However, in these
techniques for pattern discovery, the number of patterns can be huge, and the
user is often only interested in a few of those patterns. Hence, targeted
high-utility itemset mining has emerged as a key research topic, where the aim
is to find a subset of patterns that meet a targeted pattern constraint instead
of all patterns. This is a challenging task because efficiently finding
tailored patterns in a very large search space requires a targeted mining
algorithm. A first algorithm called TargetUM has been proposed, which adopts an
approach similar to post-processing using a tree structure, but the running
time and memory consumption are unsatisfactory in many situations. In this
paper, we address this issue by proposing a novel list-based algorithm with
pattern matching mechanism, named THUIM (Targeted High-Utility Itemset Mining),
which can quickly match high-utility itemsets during the mining process to
select the targeted patterns. Extensive experiments were conducted on different
datasets to compare the performance of the proposed algorithm with
state-of-the-art algorithms. Results show that THUIM performs very well in
terms of runtime and memory consumption, and has good scalability compared to
TargetUM.</abstract><doi>10.48550/arxiv.2206.06157</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2206.06157 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2206_06157 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Databases |
title | Towards Target High-Utility Itemsets |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T09%3A56%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Towards%20Target%20High-Utility%20Itemsets&rft.au=Miao,%20Jinbao&rft.date=2022-06-09&rft_id=info:doi/10.48550/arxiv.2206.06157&rft_dat=%3Carxiv_GOX%3E2206_06157%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |