GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array

In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of modern education and computer science 2021-08, Vol.13 (4), p.28-41
Hauptverfasser: Sumathi, P., Murugan, S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 41
container_issue 4
container_start_page 28
container_title International journal of modern education and computer science
container_volume 13
creator Sumathi, P.
Murugan, S.
description In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU.
doi_str_mv 10.5815/ijmecs.2021.04.03
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2798547014</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2798547014</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1833-b6b40f0722911d27db2c26b1c2c7cb7c409a679336be673bcfc720b4826d72be3</originalsourceid><addsrcrecordid>eNo9kN1LwzAUxYMoOOb-AN8CPnfmq0nrW9nsVMbcgxu-hSRNu45-zCQThv-8HRPvy70cDucefgDcYzSNExw_1vvWGj8liOApYlNEr8CIIBFHCIvP6_-b41sw8X6PhuEpIygdgZ_FajvPn2AGF-tNpIyxjXUq2AKu-m_bwKypeleHXQvL3sG87oq6q2Du7NfRdgGuVQjWdR5u_FnfWhdqoxo4V0HBvHetCjA7HFyvzA6qroBvqqqG8Mw5dboDN6VqvJ387THY5M8fs5do-b54nWXLyOCE0khzzVCJBCEpxgURhSaGcI0NMcJoYRhKFRcppVxbLqg2pREEaZYQXgiiLR2Dh0vu0GOo7YPc90fXDS8lEWkSM4EwG1z44jKu997ZUh5c3Sp3khjJM2Z5wSzPmCViElH6C52DcMg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2798547014</pqid></control><display><type>article</type><title>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Sumathi, P. ; Murugan, S.</creator><creatorcontrib>Sumathi, P. ; Murugan, S. ; Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</creatorcontrib><description>In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU.</description><identifier>ISSN: 2075-0161</identifier><identifier>EISSN: 2075-017X</identifier><identifier>DOI: 10.5815/ijmecs.2021.04.03</identifier><language>eng</language><publisher>Hong Kong: Modern Education and Computer Science Press</publisher><subject>Algorithms ; Arrays ; Buckets ; Data mining ; Electronic commerce ; Format ; Massive data points ; Mathematics</subject><ispartof>International journal of modern education and computer science, 2021-08, Vol.13 (4), p.28-41</ispartof><rights>2021. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the associated terms available at http://www.mecs-press.org/ijcnis/terms.html</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Sumathi, P.</creatorcontrib><creatorcontrib>Murugan, S.</creatorcontrib><creatorcontrib>Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</creatorcontrib><title>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</title><title>International journal of modern education and computer science</title><description>In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU.</description><subject>Algorithms</subject><subject>Arrays</subject><subject>Buckets</subject><subject>Data mining</subject><subject>Electronic commerce</subject><subject>Format</subject><subject>Massive data points</subject><subject>Mathematics</subject><issn>2075-0161</issn><issn>2075-017X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNo9kN1LwzAUxYMoOOb-AN8CPnfmq0nrW9nsVMbcgxu-hSRNu45-zCQThv-8HRPvy70cDucefgDcYzSNExw_1vvWGj8liOApYlNEr8CIIBFHCIvP6_-b41sw8X6PhuEpIygdgZ_FajvPn2AGF-tNpIyxjXUq2AKu-m_bwKypeleHXQvL3sG87oq6q2Du7NfRdgGuVQjWdR5u_FnfWhdqoxo4V0HBvHetCjA7HFyvzA6qroBvqqqG8Mw5dboDN6VqvJ387THY5M8fs5do-b54nWXLyOCE0khzzVCJBCEpxgURhSaGcI0NMcJoYRhKFRcppVxbLqg2pREEaZYQXgiiLR2Dh0vu0GOo7YPc90fXDS8lEWkSM4EwG1z44jKu997ZUh5c3Sp3khjJM2Z5wSzPmCViElH6C52DcMg</recordid><startdate>20210808</startdate><enddate>20210808</enddate><creator>Sumathi, P.</creator><creator>Murugan, S.</creator><general>Modern Education and Computer Science Press</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>3V.</scope><scope>7XB</scope><scope>88B</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BVBZV</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M0N</scope><scope>M0P</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20210808</creationdate><title>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</title><author>Sumathi, P. ; Murugan, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1833-b6b40f0722911d27db2c26b1c2c7cb7c409a679336be673bcfc720b4826d72be3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Arrays</topic><topic>Buckets</topic><topic>Data mining</topic><topic>Electronic commerce</topic><topic>Format</topic><topic>Massive data points</topic><topic>Mathematics</topic><toplevel>online_resources</toplevel><creatorcontrib>Sumathi, P.</creatorcontrib><creatorcontrib>Murugan, S.</creatorcontrib><creatorcontrib>Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Education Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>East &amp; South Asia Database</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Computing Database</collection><collection>Education Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of modern education and computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sumathi, P.</au><au>Murugan, S.</au><aucorp>Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</atitle><jtitle>International journal of modern education and computer science</jtitle><date>2021-08-08</date><risdate>2021</risdate><volume>13</volume><issue>4</issue><spage>28</spage><epage>41</epage><pages>28-41</pages><issn>2075-0161</issn><eissn>2075-017X</eissn><abstract>In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU.</abstract><cop>Hong Kong</cop><pub>Modern Education and Computer Science Press</pub><doi>10.5815/ijmecs.2021.04.03</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2075-0161
ispartof International journal of modern education and computer science, 2021-08, Vol.13 (4), p.28-41
issn 2075-0161
2075-017X
language eng
recordid cdi_proquest_journals_2798547014
source EZB-FREE-00999 freely available EZB journals
subjects Algorithms
Arrays
Buckets
Data mining
Electronic commerce
Format
Massive data points
Mathematics
title GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T01%3A17%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=GNVDF:%20A%20GPU-accelerated%20Novel%20Algorithm%20for%20Finding%20Frequent%20Patterns%20Using%20Vertical%20Data%20Format%20Approach%20and%20Jagged%20Array&rft.jtitle=International%20journal%20of%20modern%20education%20and%20computer%20science&rft.au=Sumathi,%20P.&rft.aucorp=Research%20Scholar,%20Nehru%20Memorial%20College%20(Affiliated%20to%20Bharathidasan%20University),%20Puthanampatti,%20Tiruchirappalli-Dt,%20Tamil%20Nadu,%20India%20-%20621%20007&rft.date=2021-08-08&rft.volume=13&rft.issue=4&rft.spage=28&rft.epage=41&rft.pages=28-41&rft.issn=2075-0161&rft.eissn=2075-017X&rft_id=info:doi/10.5815/ijmecs.2021.04.03&rft_dat=%3Cproquest_cross%3E2798547014%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2798547014&rft_id=info:pmid/&rfr_iscdi=true