GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array
In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the...
Gespeichert in:
Veröffentlicht in: | International journal of modern education and computer science 2021-08, Vol.13 (4), p.28-41 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 41 |
---|---|
container_issue | 4 |
container_start_page | 28 |
container_title | International journal of modern education and computer science |
container_volume | 13 |
creator | Sumathi, P. Murugan, S. |
description | In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU. |
doi_str_mv | 10.5815/ijmecs.2021.04.03 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2798547014</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2798547014</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1833-b6b40f0722911d27db2c26b1c2c7cb7c409a679336be673bcfc720b4826d72be3</originalsourceid><addsrcrecordid>eNo9kN1LwzAUxYMoOOb-AN8CPnfmq0nrW9nsVMbcgxu-hSRNu45-zCQThv-8HRPvy70cDucefgDcYzSNExw_1vvWGj8liOApYlNEr8CIIBFHCIvP6_-b41sw8X6PhuEpIygdgZ_FajvPn2AGF-tNpIyxjXUq2AKu-m_bwKypeleHXQvL3sG87oq6q2Du7NfRdgGuVQjWdR5u_FnfWhdqoxo4V0HBvHetCjA7HFyvzA6qroBvqqqG8Mw5dboDN6VqvJ387THY5M8fs5do-b54nWXLyOCE0khzzVCJBCEpxgURhSaGcI0NMcJoYRhKFRcppVxbLqg2pREEaZYQXgiiLR2Dh0vu0GOo7YPc90fXDS8lEWkSM4EwG1z44jKu997ZUh5c3Sp3khjJM2Z5wSzPmCViElH6C52DcMg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2798547014</pqid></control><display><type>article</type><title>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Sumathi, P. ; Murugan, S.</creator><creatorcontrib>Sumathi, P. ; Murugan, S. ; Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</creatorcontrib><description>In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU.</description><identifier>ISSN: 2075-0161</identifier><identifier>EISSN: 2075-017X</identifier><identifier>DOI: 10.5815/ijmecs.2021.04.03</identifier><language>eng</language><publisher>Hong Kong: Modern Education and Computer Science Press</publisher><subject>Algorithms ; Arrays ; Buckets ; Data mining ; Electronic commerce ; Format ; Massive data points ; Mathematics</subject><ispartof>International journal of modern education and computer science, 2021-08, Vol.13 (4), p.28-41</ispartof><rights>2021. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the associated terms available at http://www.mecs-press.org/ijcnis/terms.html</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Sumathi, P.</creatorcontrib><creatorcontrib>Murugan, S.</creatorcontrib><creatorcontrib>Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</creatorcontrib><title>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</title><title>International journal of modern education and computer science</title><description>In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU.</description><subject>Algorithms</subject><subject>Arrays</subject><subject>Buckets</subject><subject>Data mining</subject><subject>Electronic commerce</subject><subject>Format</subject><subject>Massive data points</subject><subject>Mathematics</subject><issn>2075-0161</issn><issn>2075-017X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNo9kN1LwzAUxYMoOOb-AN8CPnfmq0nrW9nsVMbcgxu-hSRNu45-zCQThv-8HRPvy70cDucefgDcYzSNExw_1vvWGj8liOApYlNEr8CIIBFHCIvP6_-b41sw8X6PhuEpIygdgZ_FajvPn2AGF-tNpIyxjXUq2AKu-m_bwKypeleHXQvL3sG87oq6q2Du7NfRdgGuVQjWdR5u_FnfWhdqoxo4V0HBvHetCjA7HFyvzA6qroBvqqqG8Mw5dboDN6VqvJ387THY5M8fs5do-b54nWXLyOCE0khzzVCJBCEpxgURhSaGcI0NMcJoYRhKFRcppVxbLqg2pREEaZYQXgiiLR2Dh0vu0GOo7YPc90fXDS8lEWkSM4EwG1z44jKu997ZUh5c3Sp3khjJM2Z5wSzPmCViElH6C52DcMg</recordid><startdate>20210808</startdate><enddate>20210808</enddate><creator>Sumathi, P.</creator><creator>Murugan, S.</creator><general>Modern Education and Computer Science Press</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>3V.</scope><scope>7XB</scope><scope>88B</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BVBZV</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M0N</scope><scope>M0P</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20210808</creationdate><title>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</title><author>Sumathi, P. ; Murugan, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1833-b6b40f0722911d27db2c26b1c2c7cb7c409a679336be673bcfc720b4826d72be3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Arrays</topic><topic>Buckets</topic><topic>Data mining</topic><topic>Electronic commerce</topic><topic>Format</topic><topic>Massive data points</topic><topic>Mathematics</topic><toplevel>online_resources</toplevel><creatorcontrib>Sumathi, P.</creatorcontrib><creatorcontrib>Murugan, S.</creatorcontrib><creatorcontrib>Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Education Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>East & South Asia Database</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Computing Database</collection><collection>Education Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of modern education and computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sumathi, P.</au><au>Murugan, S.</au><aucorp>Research Scholar, Nehru Memorial College (Affiliated to Bharathidasan University), Puthanampatti, Tiruchirappalli-Dt, Tamil Nadu, India - 621 007</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array</atitle><jtitle>International journal of modern education and computer science</jtitle><date>2021-08-08</date><risdate>2021</risdate><volume>13</volume><issue>4</issue><spage>28</spage><epage>41</epage><pages>28-41</pages><issn>2075-0161</issn><eissn>2075-017X</eissn><abstract>In the modern digital world, online shopping becomes essential in human lives. Online shopping stores like Amazon show up the "Frequently Bought Together" for their customers in their portal to increase sales. Discovering frequent patterns is a fundamental task in Data Mining that find the frequently bought items together. Many transactional data were collected every day, and finding frequent itemsets from the massive datasets using the classical algorithms requires more processing time and I/O cost. A GPU accelerated Novel algorithm for finding the frequent patterns using Vertical Data Format (GNVDF) has been introduced in this research article. It uses a novel pattern formation. In this, the candidate i-itemsets is divided into two buckets viz., Bucket-1 and Bucket-2. Bucket-1 contain all the possible items to form candidate-(i+1) itemsets. Bucket-2 has the items that cannot include in the candidate-(i+1) itemsets. It compactly employs a jagged array to minimize the memory requirement and remove common transactions among the frequent 1-itemsets. It also utilizes a vertical representation of data for efficiently extracting the frequent itemsets by scanning the database only once. Further, it is GPU-accelerated for speeding up the execution of the algorithm. The proposed algorithm was implemented with and without GPU usage and compared. The comparison result revealed that GNVDF with GPU acceleration is faster by 90 to 135 times than the method without GPU.</abstract><cop>Hong Kong</cop><pub>Modern Education and Computer Science Press</pub><doi>10.5815/ijmecs.2021.04.03</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2075-0161 |
ispartof | International journal of modern education and computer science, 2021-08, Vol.13 (4), p.28-41 |
issn | 2075-0161 2075-017X |
language | eng |
recordid | cdi_proquest_journals_2798547014 |
source | EZB-FREE-00999 freely available EZB journals |
subjects | Algorithms Arrays Buckets Data mining Electronic commerce Format Massive data points Mathematics |
title | GNVDF: A GPU-accelerated Novel Algorithm for Finding Frequent Patterns Using Vertical Data Format Approach and Jagged Array |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T01%3A17%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=GNVDF:%20A%20GPU-accelerated%20Novel%20Algorithm%20for%20Finding%20Frequent%20Patterns%20Using%20Vertical%20Data%20Format%20Approach%20and%20Jagged%20Array&rft.jtitle=International%20journal%20of%20modern%20education%20and%20computer%20science&rft.au=Sumathi,%20P.&rft.aucorp=Research%20Scholar,%20Nehru%20Memorial%20College%20(Affiliated%20to%20Bharathidasan%20University),%20Puthanampatti,%20Tiruchirappalli-Dt,%20Tamil%20Nadu,%20India%20-%20621%20007&rft.date=2021-08-08&rft.volume=13&rft.issue=4&rft.spage=28&rft.epage=41&rft.pages=28-41&rft.issn=2075-0161&rft.eissn=2075-017X&rft_id=info:doi/10.5815/ijmecs.2021.04.03&rft_dat=%3Cproquest_cross%3E2798547014%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2798547014&rft_id=info:pmid/&rfr_iscdi=true |