A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior

Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' acad...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN computer science 2022-07, Vol.3 (5), p.393, Article 393
Hauptverfasser: Dang, Tran Khanh, Nguyen, Huu Huong Xuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 5
container_start_page 393
container_title SN computer science
container_volume 3
creator Dang, Tran Khanh
Nguyen, Huu Huong Xuan
description Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students.
doi_str_mv 10.1007/s42979-022-01251-5
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2938260562</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2938260562</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1155-7313086ca95ef2145f933338e87fa46434b818a3d623441ca34dc6f5bd74380b3</originalsourceid><addsrcrecordid>eNp9kMtOAjEUhhujiQR5AVdNXI_2OpcleMNkjERh3ZTOGSiBDraDCTvfwZWv55NYwERXdtMm_b7_5PwInVNySQnJroJgRVYkhLGEUCZpIo9Qh6UpTfKCZMd_3qeoF8KCEMIkESKVHfTRx8Pt1NsK99dr32gzx5Ng3QzfgLHBNg6PPQDWrsKPm2Vr10vApXWgPX6GmYewZ-rG45GHypp25760mwpcG77eP_EIfPxdaWcAD3SACke-jL7bkSPf7EP2AwYw12-28WfopNbLAL2fu4smd7fj62FSPt0_XPfLxFAqZZJxykmeGl1IqBkVsi54PDnkWa1FKriY5jTXvEoZF4IazUVl0lpOq0zwnEx5F10ccuPirxsIrVo0G-_iSMUKnrOUyKh2ETtQxjcheKjV2tuV9ltFidr1rw79q9i_2vevZJT4QQoRdjPwv9H_WN_QGolh</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2938260562</pqid></control><display><type>article</type><title>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</title><source>SpringerLink Journals - AutoHoldings</source><source>ProQuest Central</source><creator>Dang, Tran Khanh ; Nguyen, Huu Huong Xuan</creator><creatorcontrib>Dang, Tran Khanh ; Nguyen, Huu Huong Xuan</creatorcontrib><description>Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students.</description><identifier>ISSN: 2661-8907</identifier><identifier>ISSN: 2662-995X</identifier><identifier>EISSN: 2661-8907</identifier><identifier>DOI: 10.1007/s42979-022-01251-5</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Academic achievement ; Accuracy ; Algorithms ; Behavior ; Computer Imaging ; Computer Science ; Computer Systems Organization and Communication Networks ; Correlation coefficients ; Data Structures and Information Theory ; Datasets ; Decision trees ; Demographics ; Distance learning ; Education ; Family income ; Future Data and Security Engineering 2021 ; Gender ; Information Systems and Communication Service ; Learning ; Machine learning ; Model accuracy ; Neural networks ; Original Research ; Pattern Recognition and Graphics ; Performance prediction ; Regression analysis ; Regression models ; Secondary schools ; Software Engineering/Programming and Operating Systems ; Students ; Success ; Support vector machines ; Teachers ; Vision</subject><ispartof>SN computer science, 2022-07, Vol.3 (5), p.393, Article 393</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2022</rights><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1155-7313086ca95ef2145f933338e87fa46434b818a3d623441ca34dc6f5bd74380b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s42979-022-01251-5$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2938260562?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,776,780,21367,27901,27902,33721,41464,42533,43781,51294</link.rule.ids></links><search><creatorcontrib>Dang, Tran Khanh</creatorcontrib><creatorcontrib>Nguyen, Huu Huong Xuan</creatorcontrib><title>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</title><title>SN computer science</title><addtitle>SN COMPUT. SCI</addtitle><description>Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students.</description><subject>Academic achievement</subject><subject>Accuracy</subject><subject>Algorithms</subject><subject>Behavior</subject><subject>Computer Imaging</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Correlation coefficients</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Demographics</subject><subject>Distance learning</subject><subject>Education</subject><subject>Family income</subject><subject>Future Data and Security Engineering 2021</subject><subject>Gender</subject><subject>Information Systems and Communication Service</subject><subject>Learning</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Neural networks</subject><subject>Original Research</subject><subject>Pattern Recognition and Graphics</subject><subject>Performance prediction</subject><subject>Regression analysis</subject><subject>Regression models</subject><subject>Secondary schools</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Students</subject><subject>Success</subject><subject>Support vector machines</subject><subject>Teachers</subject><subject>Vision</subject><issn>2661-8907</issn><issn>2662-995X</issn><issn>2661-8907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9kMtOAjEUhhujiQR5AVdNXI_2OpcleMNkjERh3ZTOGSiBDraDCTvfwZWv55NYwERXdtMm_b7_5PwInVNySQnJroJgRVYkhLGEUCZpIo9Qh6UpTfKCZMd_3qeoF8KCEMIkESKVHfTRx8Pt1NsK99dr32gzx5Ng3QzfgLHBNg6PPQDWrsKPm2Vr10vApXWgPX6GmYewZ-rG45GHypp25760mwpcG77eP_EIfPxdaWcAD3SACke-jL7bkSPf7EP2AwYw12-28WfopNbLAL2fu4smd7fj62FSPt0_XPfLxFAqZZJxykmeGl1IqBkVsi54PDnkWa1FKriY5jTXvEoZF4IazUVl0lpOq0zwnEx5F10ccuPirxsIrVo0G-_iSMUKnrOUyKh2ETtQxjcheKjV2tuV9ltFidr1rw79q9i_2vevZJT4QQoRdjPwv9H_WN_QGolh</recordid><startdate>20220725</startdate><enddate>20220725</enddate><creator>Dang, Tran Khanh</creator><creator>Nguyen, Huu Huong Xuan</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>20220725</creationdate><title>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</title><author>Dang, Tran Khanh ; Nguyen, Huu Huong Xuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1155-7313086ca95ef2145f933338e87fa46434b818a3d623441ca34dc6f5bd74380b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Academic achievement</topic><topic>Accuracy</topic><topic>Algorithms</topic><topic>Behavior</topic><topic>Computer Imaging</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Correlation coefficients</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Demographics</topic><topic>Distance learning</topic><topic>Education</topic><topic>Family income</topic><topic>Future Data and Security Engineering 2021</topic><topic>Gender</topic><topic>Information Systems and Communication Service</topic><topic>Learning</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Neural networks</topic><topic>Original Research</topic><topic>Pattern Recognition and Graphics</topic><topic>Performance prediction</topic><topic>Regression analysis</topic><topic>Regression models</topic><topic>Secondary schools</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Students</topic><topic>Success</topic><topic>Support vector machines</topic><topic>Teachers</topic><topic>Vision</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dang, Tran Khanh</creatorcontrib><creatorcontrib>Nguyen, Huu Huong Xuan</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>SN computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dang, Tran Khanh</au><au>Nguyen, Huu Huong Xuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior</atitle><jtitle>SN computer science</jtitle><stitle>SN COMPUT. SCI</stitle><date>2022-07-25</date><risdate>2022</risdate><volume>3</volume><issue>5</issue><spage>393</spage><pages>393-</pages><artnum>393</artnum><issn>2661-8907</issn><issn>2662-995X</issn><eissn>2661-8907</eissn><abstract>Analyzing factors related to learning progress such as coursework scores, how many times students were occasion, plagiarism or failure, and time spent at the library helps to determine factors in the reduction of dropouts. Many researchers have used traditional methods to predict students' academic performance, and a few research studies have developed a new hybrid approach, a combined classification and prediction method in this field. This study has assessed students’ performance using a hybrid method including a decision tree and multiple linear regression to predict their possibility of graduation. Specifically, the decision tree model is used to classify the ‘Adequate’ and ‘Fair’ classes. Then, multiple linear regression models were used to predict future Cumulative Grade Point Average (CGPA). After evaluating the statistics, the first and second coursework scores exhibit a significant impact on the results. Other attributes such as time spent at the campus or the number of times that students failed in the previous semester should be considered in this context. The decision tree model’s accuracy is 0.47 and the Correlation Coefficient of the multiple linear models is 0.52. The result of this research is an equation with a specific weighted score toward the final results. This, in turn, would ensure early and appropriate actions from education to increase the academic achievement of such students.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s42979-022-01251-5</doi></addata></record>
fulltext fulltext
identifier ISSN: 2661-8907
ispartof SN computer science, 2022-07, Vol.3 (5), p.393, Article 393
issn 2661-8907
2662-995X
2661-8907
language eng
recordid cdi_proquest_journals_2938260562
source SpringerLink Journals - AutoHoldings; ProQuest Central
subjects Academic achievement
Accuracy
Algorithms
Behavior
Computer Imaging
Computer Science
Computer Systems Organization and Communication Networks
Correlation coefficients
Data Structures and Information Theory
Datasets
Decision trees
Demographics
Distance learning
Education
Family income
Future Data and Security Engineering 2021
Gender
Information Systems and Communication Service
Learning
Machine learning
Model accuracy
Neural networks
Original Research
Pattern Recognition and Graphics
Performance prediction
Regression analysis
Regression models
Secondary schools
Software Engineering/Programming and Operating Systems
Students
Success
Support vector machines
Teachers
Vision
title A Hybrid Approach Using Decision Tree and Multiple Linear Regression for Predicting Students’ Performance Based on Learning Progress and Behavior
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T09%3A49%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Hybrid%20Approach%20Using%20Decision%20Tree%20and%20Multiple%20Linear%20Regression%20for%20Predicting%20Students%E2%80%99%20Performance%20Based%20on%20Learning%20Progress%20and%20Behavior&rft.jtitle=SN%20computer%20science&rft.au=Dang,%20Tran%20Khanh&rft.date=2022-07-25&rft.volume=3&rft.issue=5&rft.spage=393&rft.pages=393-&rft.artnum=393&rft.issn=2661-8907&rft.eissn=2661-8907&rft_id=info:doi/10.1007/s42979-022-01251-5&rft_dat=%3Cproquest_cross%3E2938260562%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2938260562&rft_id=info:pmid/&rfr_iscdi=true