Making kernel-based vector quantization robust and effective for incomplete educational data clustering

Nowadays, knowledge discovered from educational data sets plays an important role in educational decision making support. One kind of such knowledge that enables us to get insights into our students’ characteristics is cluster models generated by a clustering task. Each cluster model presents the gr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Vietnam journal of computer science 2016-05, Vol.3 (2), p.93-102
Hauptverfasser: Vo, Thi Ngoc Chau, Nguyen, Hua Phung, Vo, Thi Ngoc Tran
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 102
container_issue 2
container_start_page 93
container_title Vietnam journal of computer science
container_volume 3
creator Vo, Thi Ngoc Chau
Nguyen, Hua Phung
Vo, Thi Ngoc Tran
description Nowadays, knowledge discovered from educational data sets plays an important role in educational decision making support. One kind of such knowledge that enables us to get insights into our students’ characteristics is cluster models generated by a clustering task. Each cluster model presents the groups of similar students by several aspects such as study performance, behavior, skill, etc. Many recent educational data clustering works used the existing algorithms like k -means, expectation–maximization, spectral clustering, etc. Nevertheless, none of them considered the incompleteness of the educational data gathered in an academic credit system although incomplete data handling was figured out well with several different general-purpose solutions. Unfortunately, early in-trouble student detection normally faces data incompleteness as we have collected and processed the study results of the second-, third-, and fourth-year students who have not yet accomplished the program as of that moment. In this situation, the clustering task becomes an inevitable incomplete educational data clustering task. Hence, our work focuses on an incomplete educational data clustering approach to the aforementioned task. Following kernel-based vector quantization, we define a robust effective simple solution, named VQ_fk_nps, which is able to not only handle ubiquitous data incompleteness in an iterative manner using the nearest prototype strategy but also optimize the clusters in the feature space to reach the resulting clusters with arbitrary shapes in the data space. As shown through the experimental results on real educational data sets, the clusters from our solution have better cluster quality as compared to some existing approaches.
doi_str_mv 10.1007/s40595-016-0060-6
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1816009898</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>4042246611</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2376-51f0f7fe94d837df9dc88ebbe9bfbea6c6a3749fefc055f4fc74955ae04e4f403</originalsourceid><addsrcrecordid>eNp1kE1LxDAQhoMouOj-AG8BL16qk37k4yiLX7DiRc8hTSdLd7ttN2kX9NebtSIieJoZeN53Zl5CLhhcMwBxE3IoVJEA4wkAh4QfkVnKFE-kVPz4p5fylMxDWAMAU4xnCmZk9Ww2dbuiG_QtNklpAlZ0j3boPN2Nph3qDzPUXUt9V45hoKatKDoXgXqP1EWqbm237RsckGI12i_aNLQyg6G2iRr0ccE5OXGmCTj_rmfk7f7udfGYLF8enha3y8SmmeBJwRw44VDllcxE5VRlpcSyRFW6Eg233GQiVw6dhaJwubNxKgqDkGPucsjOyNXk2_tuN2IY9LYOFpvGtNiNQTPJOICSSkb08g-67kYfb4-UkIXiqRAiUmyirO9C8Oh07-ut8e-agT6kr6f0dUxfH9LXPGrSSRP6w-_ofzn_K_oEFyaJsg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1785962777</pqid></control><display><type>article</type><title>Making kernel-based vector quantization robust and effective for incomplete educational data clustering</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Springer Nature OA Free Journals</source><creator>Vo, Thi Ngoc Chau ; Nguyen, Hua Phung ; Vo, Thi Ngoc Tran</creator><creatorcontrib>Vo, Thi Ngoc Chau ; Nguyen, Hua Phung ; Vo, Thi Ngoc Tran</creatorcontrib><description>Nowadays, knowledge discovered from educational data sets plays an important role in educational decision making support. One kind of such knowledge that enables us to get insights into our students’ characteristics is cluster models generated by a clustering task. Each cluster model presents the groups of similar students by several aspects such as study performance, behavior, skill, etc. Many recent educational data clustering works used the existing algorithms like k -means, expectation–maximization, spectral clustering, etc. Nevertheless, none of them considered the incompleteness of the educational data gathered in an academic credit system although incomplete data handling was figured out well with several different general-purpose solutions. Unfortunately, early in-trouble student detection normally faces data incompleteness as we have collected and processed the study results of the second-, third-, and fourth-year students who have not yet accomplished the program as of that moment. In this situation, the clustering task becomes an inevitable incomplete educational data clustering task. Hence, our work focuses on an incomplete educational data clustering approach to the aforementioned task. Following kernel-based vector quantization, we define a robust effective simple solution, named VQ_fk_nps, which is able to not only handle ubiquitous data incompleteness in an iterative manner using the nearest prototype strategy but also optimize the clusters in the feature space to reach the resulting clusters with arbitrary shapes in the data space. As shown through the experimental results on real educational data sets, the clusters from our solution have better cluster quality as compared to some existing approaches.</description><identifier>ISSN: 2196-8888</identifier><identifier>EISSN: 2196-8896</identifier><identifier>DOI: 10.1007/s40595-016-0060-6</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Artificial Intelligence ; Clustering ; Clusters ; Computational Intelligence ; Computer Applications ; Computer Science ; Computer Systems Organization and Communication Networks ; e-Commerce/e-business ; Education ; Information Systems and Communication Service ; Mathematical models ; Regular Paper ; Students ; Tasks ; Vector quantization</subject><ispartof>Vietnam journal of computer science, 2016-05, Vol.3 (2), p.93-102</ispartof><rights>The Author(s) 2016</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2376-51f0f7fe94d837df9dc88ebbe9bfbea6c6a3749fefc055f4fc74955ae04e4f403</citedby><cites>FETCH-LOGICAL-c2376-51f0f7fe94d837df9dc88ebbe9bfbea6c6a3749fefc055f4fc74955ae04e4f403</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s40595-016-0060-6$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://doi.org/10.1007/s40595-016-0060-6$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,862,27911,27912,41107,42176,51563</link.rule.ids></links><search><creatorcontrib>Vo, Thi Ngoc Chau</creatorcontrib><creatorcontrib>Nguyen, Hua Phung</creatorcontrib><creatorcontrib>Vo, Thi Ngoc Tran</creatorcontrib><title>Making kernel-based vector quantization robust and effective for incomplete educational data clustering</title><title>Vietnam journal of computer science</title><addtitle>Vietnam J Comput Sci</addtitle><description>Nowadays, knowledge discovered from educational data sets plays an important role in educational decision making support. One kind of such knowledge that enables us to get insights into our students’ characteristics is cluster models generated by a clustering task. Each cluster model presents the groups of similar students by several aspects such as study performance, behavior, skill, etc. Many recent educational data clustering works used the existing algorithms like k -means, expectation–maximization, spectral clustering, etc. Nevertheless, none of them considered the incompleteness of the educational data gathered in an academic credit system although incomplete data handling was figured out well with several different general-purpose solutions. Unfortunately, early in-trouble student detection normally faces data incompleteness as we have collected and processed the study results of the second-, third-, and fourth-year students who have not yet accomplished the program as of that moment. In this situation, the clustering task becomes an inevitable incomplete educational data clustering task. Hence, our work focuses on an incomplete educational data clustering approach to the aforementioned task. Following kernel-based vector quantization, we define a robust effective simple solution, named VQ_fk_nps, which is able to not only handle ubiquitous data incompleteness in an iterative manner using the nearest prototype strategy but also optimize the clusters in the feature space to reach the resulting clusters with arbitrary shapes in the data space. As shown through the experimental results on real educational data sets, the clusters from our solution have better cluster quality as compared to some existing approaches.</description><subject>Artificial Intelligence</subject><subject>Clustering</subject><subject>Clusters</subject><subject>Computational Intelligence</subject><subject>Computer Applications</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>e-Commerce/e-business</subject><subject>Education</subject><subject>Information Systems and Communication Service</subject><subject>Mathematical models</subject><subject>Regular Paper</subject><subject>Students</subject><subject>Tasks</subject><subject>Vector quantization</subject><issn>2196-8888</issn><issn>2196-8896</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1kE1LxDAQhoMouOj-AG8BL16qk37k4yiLX7DiRc8hTSdLd7ttN2kX9NebtSIieJoZeN53Zl5CLhhcMwBxE3IoVJEA4wkAh4QfkVnKFE-kVPz4p5fylMxDWAMAU4xnCmZk9Ww2dbuiG_QtNklpAlZ0j3boPN2Nph3qDzPUXUt9V45hoKatKDoXgXqP1EWqbm237RsckGI12i_aNLQyg6G2iRr0ccE5OXGmCTj_rmfk7f7udfGYLF8enha3y8SmmeBJwRw44VDllcxE5VRlpcSyRFW6Eg233GQiVw6dhaJwubNxKgqDkGPucsjOyNXk2_tuN2IY9LYOFpvGtNiNQTPJOICSSkb08g-67kYfb4-UkIXiqRAiUmyirO9C8Oh07-ut8e-agT6kr6f0dUxfH9LXPGrSSRP6w-_ofzn_K_oEFyaJsg</recordid><startdate>20160501</startdate><enddate>20160501</enddate><creator>Vo, Thi Ngoc Chau</creator><creator>Nguyen, Hua Phung</creator><creator>Vo, Thi Ngoc Tran</creator><general>Springer Berlin Heidelberg</general><general>World Scientific Publishing Co. Pte., Ltd</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20160501</creationdate><title>Making kernel-based vector quantization robust and effective for incomplete educational data clustering</title><author>Vo, Thi Ngoc Chau ; Nguyen, Hua Phung ; Vo, Thi Ngoc Tran</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2376-51f0f7fe94d837df9dc88ebbe9bfbea6c6a3749fefc055f4fc74955ae04e4f403</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Artificial Intelligence</topic><topic>Clustering</topic><topic>Clusters</topic><topic>Computational Intelligence</topic><topic>Computer Applications</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>e-Commerce/e-business</topic><topic>Education</topic><topic>Information Systems and Communication Service</topic><topic>Mathematical models</topic><topic>Regular Paper</topic><topic>Students</topic><topic>Tasks</topic><topic>Vector quantization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Vo, Thi Ngoc Chau</creatorcontrib><creatorcontrib>Nguyen, Hua Phung</creatorcontrib><creatorcontrib>Vo, Thi Ngoc Tran</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><jtitle>Vietnam journal of computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Vo, Thi Ngoc Chau</au><au>Nguyen, Hua Phung</au><au>Vo, Thi Ngoc Tran</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Making kernel-based vector quantization robust and effective for incomplete educational data clustering</atitle><jtitle>Vietnam journal of computer science</jtitle><stitle>Vietnam J Comput Sci</stitle><date>2016-05-01</date><risdate>2016</risdate><volume>3</volume><issue>2</issue><spage>93</spage><epage>102</epage><pages>93-102</pages><issn>2196-8888</issn><eissn>2196-8896</eissn><abstract>Nowadays, knowledge discovered from educational data sets plays an important role in educational decision making support. One kind of such knowledge that enables us to get insights into our students’ characteristics is cluster models generated by a clustering task. Each cluster model presents the groups of similar students by several aspects such as study performance, behavior, skill, etc. Many recent educational data clustering works used the existing algorithms like k -means, expectation–maximization, spectral clustering, etc. Nevertheless, none of them considered the incompleteness of the educational data gathered in an academic credit system although incomplete data handling was figured out well with several different general-purpose solutions. Unfortunately, early in-trouble student detection normally faces data incompleteness as we have collected and processed the study results of the second-, third-, and fourth-year students who have not yet accomplished the program as of that moment. In this situation, the clustering task becomes an inevitable incomplete educational data clustering task. Hence, our work focuses on an incomplete educational data clustering approach to the aforementioned task. Following kernel-based vector quantization, we define a robust effective simple solution, named VQ_fk_nps, which is able to not only handle ubiquitous data incompleteness in an iterative manner using the nearest prototype strategy but also optimize the clusters in the feature space to reach the resulting clusters with arbitrary shapes in the data space. As shown through the experimental results on real educational data sets, the clusters from our solution have better cluster quality as compared to some existing approaches.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s40595-016-0060-6</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2196-8888
ispartof Vietnam journal of computer science, 2016-05, Vol.3 (2), p.93-102
issn 2196-8888
2196-8896
language eng
recordid cdi_proquest_miscellaneous_1816009898
source DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Springer Nature OA Free Journals
subjects Artificial Intelligence
Clustering
Clusters
Computational Intelligence
Computer Applications
Computer Science
Computer Systems Organization and Communication Networks
e-Commerce/e-business
Education
Information Systems and Communication Service
Mathematical models
Regular Paper
Students
Tasks
Vector quantization
title Making kernel-based vector quantization robust and effective for incomplete educational data clustering
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T22%3A42%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Making%20kernel-based%20vector%20quantization%20robust%20and%20effective%20for%20incomplete%20educational%20data%20clustering&rft.jtitle=Vietnam%20journal%20of%20computer%20science&rft.au=Vo,%20Thi%20Ngoc%20Chau&rft.date=2016-05-01&rft.volume=3&rft.issue=2&rft.spage=93&rft.epage=102&rft.pages=93-102&rft.issn=2196-8888&rft.eissn=2196-8896&rft_id=info:doi/10.1007/s40595-016-0060-6&rft_dat=%3Cproquest_cross%3E4042246611%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1785962777&rft_id=info:pmid/&rfr_iscdi=true