A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior
Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' acad...
Gespeichert in:
Veröffentlicht in: | SN computer science 2022-07, Vol.3 (5), p.393, Article 393 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 5 |
container_start_page | 393 |
container_title | SN computer science |
container_volume | 3 |
creator | Dang, Tran Khanh Nguyen, Huu Huong Xuan |
description | Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students. |
doi_str_mv | 10.1007/s42979-022-01251-5 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2938260562</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2938260562</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1155-7313086ca95ef2145f933338e87fa46434b818a3d623441ca34dc6f5bd74380b3</originalsourceid><addsrcrecordid>eNp9kMtOAjEUhhujiQR5AVdNXI_2OpcleMNkjERh3ZTOGSiBDraDCTvfwZWv55NYwERXdtMm_b7_5PwInVNySQnJroJgRVYkhLGEUCZpIo9Qh6UpTfKCZMd_3qeoF8KCEMIkESKVHfTRx8Pt1NsK99dr32gzx5Ng3QzfgLHBNg6PPQDWrsKPm2Vr10vApXWgPX6GmYewZ-rG45GHypp25760mwpcG77eP_EIfPxdaWcAD3SACke-jL7bkSPf7EP2AwYw12-28WfopNbLAL2fu4smd7fj62FSPt0_XPfLxFAqZZJxykmeGl1IqBkVsi54PDnkWa1FKriY5jTXvEoZF4IazUVl0lpOq0zwnEx5F10ccuPirxsIrVo0G-_iSMUKnrOUyKh2ETtQxjcheKjV2tuV9ltFidr1rw79q9i_2vevZJT4QQoRdjPwv9H_WN_QGolh</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2938260562</pqid></control><display><type>article</type><title>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</title><source>SpringerLink Journals - AutoHoldings</source><source>ProQuest Central</source><creator>Dang, Tran Khanh ; Nguyen, Huu Huong Xuan</creator><creatorcontrib>Dang, Tran Khanh ; Nguyen, Huu Huong Xuan</creatorcontrib><description>Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students.</description><identifier>ISSN: 2661-8907</identifier><identifier>ISSN: 2662-995X</identifier><identifier>EISSN: 2661-8907</identifier><identifier>DOI: 10.1007/s42979-022-01251-5</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Academic achievement ; Accuracy ; Algorithms ; Behavior ; Computer Imaging ; Computer Science ; Computer Systems Organization and Communication Networks ; Correlation coefficients ; Data Structures and Information Theory ; Datasets ; Decision trees ; Demographics ; Distance learning ; Education ; Family income ; Future Data and Security Engineering 2021 ; Gender ; Information Systems and Communication Service ; Learning ; Machine learning ; Model accuracy ; Neural networks ; Original Research ; Pattern Recognition and Graphics ; Performance prediction ; Regression analysis ; Regression models ; Secondary schools ; Software Engineering/Programming and Operating Systems ; Students ; Success ; Support vector machines ; Teachers ; Vision</subject><ispartof>SN computer science, 2022-07, Vol.3 (5), p.393, Article 393</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2022</rights><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1155-7313086ca95ef2145f933338e87fa46434b818a3d623441ca34dc6f5bd74380b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s42979-022-01251-5$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2938260562?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,776,780,21367,27901,27902,33721,41464,42533,43781,51294</link.rule.ids></links><search><creatorcontrib>Dang, Tran Khanh</creatorcontrib><creatorcontrib>Nguyen, Huu Huong Xuan</creatorcontrib><title>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</title><title>SN computer science</title><addtitle>SN COMPUT. SCI</addtitle><description>Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students.</description><subject>Academic achievement</subject><subject>Accuracy</subject><subject>Algorithms</subject><subject>Behavior</subject><subject>Computer Imaging</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Correlation coefficients</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Demographics</subject><subject>Distance learning</subject><subject>Education</subject><subject>Family income</subject><subject>Future Data and Security Engineering 2021</subject><subject>Gender</subject><subject>Information Systems and Communication Service</subject><subject>Learning</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Neural networks</subject><subject>Original Research</subject><subject>Pattern Recognition and Graphics</subject><subject>Performance prediction</subject><subject>Regression analysis</subject><subject>Regression models</subject><subject>Secondary schools</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Students</subject><subject>Success</subject><subject>Support vector machines</subject><subject>Teachers</subject><subject>Vision</subject><issn>2661-8907</issn><issn>2662-995X</issn><issn>2661-8907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9kMtOAjEUhhujiQR5AVdNXI_2OpcleMNkjERh3ZTOGSiBDraDCTvfwZWv55NYwERXdtMm_b7_5PwInVNySQnJroJgRVYkhLGEUCZpIo9Qh6UpTfKCZMd_3qeoF8KCEMIkESKVHfTRx8Pt1NsK99dr32gzx5Ng3QzfgLHBNg6PPQDWrsKPm2Vr10vApXWgPX6GmYewZ-rG45GHypp25760mwpcG77eP_EIfPxdaWcAD3SACke-jL7bkSPf7EP2AwYw12-28WfopNbLAL2fu4smd7fj62FSPt0_XPfLxFAqZZJxykmeGl1IqBkVsi54PDnkWa1FKriY5jTXvEoZF4IazUVl0lpOq0zwnEx5F10ccuPirxsIrVo0G-_iSMUKnrOUyKh2ETtQxjcheKjV2tuV9ltFidr1rw79q9i_2vevZJT4QQoRdjPwv9H_WN_QGolh</recordid><startdate>20220725</startdate><enddate>20220725</enddate><creator>Dang, Tran Khanh</creator><creator>Nguyen, Huu Huong Xuan</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20220725</creationdate><title>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</title><author>Dang, Tran Khanh ; Nguyen, Huu Huong Xuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1155-7313086ca95ef2145f933338e87fa46434b818a3d623441ca34dc6f5bd74380b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Academic achievement</topic><topic>Accuracy</topic><topic>Algorithms</topic><topic>Behavior</topic><topic>Computer Imaging</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Correlation coefficients</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Demographics</topic><topic>Distance learning</topic><topic>Education</topic><topic>Family income</topic><topic>Future Data and Security Engineering 2021</topic><topic>Gender</topic><topic>Information Systems and Communication Service</topic><topic>Learning</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Neural networks</topic><topic>Original Research</topic><topic>Pattern Recognition and Graphics</topic><topic>Performance prediction</topic><topic>Regression analysis</topic><topic>Regression models</topic><topic>Secondary schools</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Students</topic><topic>Success</topic><topic>Support vector machines</topic><topic>Teachers</topic><topic>Vision</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dang, Tran Khanh</creatorcontrib><creatorcontrib>Nguyen, Huu Huong Xuan</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>SN computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dang, Tran Khanh</au><au>Nguyen, Huu Huong Xuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</atitle><jtitle>SN computer science</jtitle><stitle>SN COMPUT. SCI</stitle><date>2022-07-25</date><risdate>2022</risdate><volume>3</volume><issue>5</issue><spage>393</spage><pages>393-</pages><artnum>393</artnum><issn>2661-8907</issn><issn>2662-995X</issn><eissn>2661-8907</eissn><abstract>Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s42979-022-01251-5</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2661-8907 |
ispartof | SN computer science, 2022-07, Vol.3 (5), p.393, Article 393 |
issn | 2661-8907 2662-995X 2661-8907 |
language | eng |
recordid | cdi_proquest_journals_2938260562 |
source | SpringerLink Journals - AutoHoldings; ProQuest Central |
subjects | Academic achievement Accuracy Algorithms Behavior Computer Imaging Computer Science Computer Systems Organization and Communication Networks Correlation coefficients Data Structures and Information Theory Datasets Decision trees Demographics Distance learning Education Family income Future Data and Security Engineering 2021 Gender Information Systems and Communication Service Learning Machine learning Model accuracy Neural networks Original Research Pattern Recognition and Graphics Performance prediction Regression analysis Regression models Secondary schools Software Engineering/Programming and Operating Systems Students Success Support vector machines Teachers Vision |
title | A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T09%3A49%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Hybrid%20Approach%20Using%20Decision%20Tree%20and%20Multiple%20Linear%20Regression%20for%20Predicting%20Students%E2%80%99%20Performance%20Based%20on%20Learning%20Progress%20and%20Behavior&rft.jtitle=SN%20computer%20science&rft.au=Dang,%20Tran%20Khanh&rft.date=2022-07-25&rft.volume=3&rft.issue=5&rft.spage=393&rft.pages=393-&rft.artnum=393&rft.issn=2661-8907&rft.eissn=2661-8907&rft_id=info:doi/10.1007/s42979-022-01251-5&rft_dat=%3Cproquest_cross%3E2938260562%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2938260562&rft_id=info:pmid/&rfr_iscdi=true |