Subject Cross Validation in Human Activity Recognition

K-fold Cross Validation is commonly used to evaluate classifiers and tune their hyperparameters. However, it assumes that data points are Independent and Identically Distributed (i.i.d.) so that samples used in the training and test sets can be selected randomly and uniformly. In Human Activity Reco...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Dehghani, Akbar, Glatard, Tristan, Shihab, Emad
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Learning Statistics - Machine Learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Dehghani, Akbar Glatard, Tristan Shihab, Emad
description	K-fold Cross Validation is commonly used to evaluate classifiers and tune their hyperparameters. However, it assumes that data points are Independent and Identically Distributed (i.i.d.) so that samples used in the training and test sets can be selected randomly and uniformly. In Human Activity Recognition datasets, we note that the samples produced by the same subjects are likely to be correlated due to diverse factors. Hence, k-fold cross validation may overestimate the performance of activity recognizers, in particular when overlapping sliding windows are used. In this paper, we investigate the effect of Subject Cross Validation on the performance of Human Activity Recognition, both with non-overlapping and with overlapping sliding windows. Results show that k-fold cross validation artificially increases the performance of recognizers by about 10%, and even by 16% when overlapping windows are used. In addition, we do not observe any performance gain from the use of overlapping windows. We conclude that Human Activity Recognition systems should be evaluated by Subject Cross Validation, and that overlapping windows are not worth their extra computational cost.
doi_str_mv	10.48550/arxiv.1904.02666
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1904_02666</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1904_02666</sourcerecordid><originalsourceid>FETCH-LOGICAL-a1156-d690db385384d6ea0bacfec4c6ff8c73dd72419ea8fea6c47a21cb9c7c847d1e3</originalsourceid><addsrcrecordid>eNotj82KwjAURrNxIToP4Mq8QGvSpDfpUoqOA4IwP27L7U0yZNBW2ir69qLj6iw-OHyHsZkUqbZ5LhbYXeMllYXQqcgAYMzg61z_eRp42bV9z_d4iA6H2DY8NnxzPmLDlzTESxxu_NNT-9vExzplo4CH3r-9OGE_69V3uUm2u_ePcrlNUMocEgeFcLWyubLagUdRIwVPmiAES0Y5ZzItC482eATSBjNJdUGGrDZOejVh83_v83l16uIRu1v1KKieBeoOfzFBkg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Subject Cross Validation in Human Activity Recognition</title><source>arXiv.org</source><creator>Dehghani, Akbar ; Glatard, Tristan ; Shihab, Emad</creator><creatorcontrib>Dehghani, Akbar ; Glatard, Tristan ; Shihab, Emad</creatorcontrib><description>K-fold Cross Validation is commonly used to evaluate classifiers and tune their hyperparameters. However, it assumes that data points are Independent and Identically Distributed (i.i.d.) so that samples used in the training and test sets can be selected randomly and uniformly. In Human Activity Recognition datasets, we note that the samples produced by the same subjects are likely to be correlated due to diverse factors. Hence, k-fold cross validation may overestimate the performance of activity recognizers, in particular when overlapping sliding windows are used. In this paper, we investigate the effect of Subject Cross Validation on the performance of Human Activity Recognition, both with non-overlapping and with overlapping sliding windows. Results show that k-fold cross validation artificially increases the performance of recognizers by about 10%, and even by 16% when overlapping windows are used. In addition, we do not observe any performance gain from the use of overlapping windows. We conclude that Human Activity Recognition systems should be evaluated by Subject Cross Validation, and that overlapping windows are not worth their extra computational cost.</description><identifier>DOI: 10.48550/arxiv.1904.02666</identifier><language>eng</language><subject>Computer Science - Learning ; Statistics - Machine Learning</subject><creationdate>2019-04</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a1156-d690db385384d6ea0bacfec4c6ff8c73dd72419ea8fea6c47a21cb9c7c847d1e3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1904.02666$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1904.02666$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Dehghani, Akbar</creatorcontrib><creatorcontrib>Glatard, Tristan</creatorcontrib><creatorcontrib>Shihab, Emad</creatorcontrib><title>Subject Cross Validation in Human Activity Recognition</title><description>K-fold Cross Validation is commonly used to evaluate classifiers and tune their hyperparameters. However, it assumes that data points are Independent and Identically Distributed (i.i.d.) so that samples used in the training and test sets can be selected randomly and uniformly. In Human Activity Recognition datasets, we note that the samples produced by the same subjects are likely to be correlated due to diverse factors. Hence, k-fold cross validation may overestimate the performance of activity recognizers, in particular when overlapping sliding windows are used. In this paper, we investigate the effect of Subject Cross Validation on the performance of Human Activity Recognition, both with non-overlapping and with overlapping sliding windows. Results show that k-fold cross validation artificially increases the performance of recognizers by about 10%, and even by 16% when overlapping windows are used. In addition, we do not observe any performance gain from the use of overlapping windows. We conclude that Human Activity Recognition systems should be evaluated by Subject Cross Validation, and that overlapping windows are not worth their extra computational cost.</description><subject>Computer Science - Learning</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj82KwjAURrNxIToP4Mq8QGvSpDfpUoqOA4IwP27L7U0yZNBW2ir69qLj6iw-OHyHsZkUqbZ5LhbYXeMllYXQqcgAYMzg61z_eRp42bV9z_d4iA6H2DY8NnxzPmLDlzTESxxu_NNT-9vExzplo4CH3r-9OGE_69V3uUm2u_ePcrlNUMocEgeFcLWyubLagUdRIwVPmiAES0Y5ZzItC482eATSBjNJdUGGrDZOejVh83_v83l16uIRu1v1KKieBeoOfzFBkg</recordid><startdate>20190404</startdate><enddate>20190404</enddate><creator>Dehghani, Akbar</creator><creator>Glatard, Tristan</creator><creator>Shihab, Emad</creator><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20190404</creationdate><title>Subject Cross Validation in Human Activity Recognition</title><author>Dehghani, Akbar ; Glatard, Tristan ; Shihab, Emad</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a1156-d690db385384d6ea0bacfec4c6ff8c73dd72419ea8fea6c47a21cb9c7c847d1e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Learning</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Dehghani, Akbar</creatorcontrib><creatorcontrib>Glatard, Tristan</creatorcontrib><creatorcontrib>Shihab, Emad</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Dehghani, Akbar</au><au>Glatard, Tristan</au><au>Shihab, Emad</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Subject Cross Validation in Human Activity Recognition</atitle><date>2019-04-04</date><risdate>2019</risdate><abstract>K-fold Cross Validation is commonly used to evaluate classifiers and tune their hyperparameters. However, it assumes that data points are Independent and Identically Distributed (i.i.d.) so that samples used in the training and test sets can be selected randomly and uniformly. In Human Activity Recognition datasets, we note that the samples produced by the same subjects are likely to be correlated due to diverse factors. Hence, k-fold cross validation may overestimate the performance of activity recognizers, in particular when overlapping sliding windows are used. In this paper, we investigate the effect of Subject Cross Validation on the performance of Human Activity Recognition, both with non-overlapping and with overlapping sliding windows. Results show that k-fold cross validation artificially increases the performance of recognizers by about 10%, and even by 16% when overlapping windows are used. In addition, we do not observe any performance gain from the use of overlapping windows. We conclude that Human Activity Recognition systems should be evaluated by Subject Cross Validation, and that overlapping windows are not worth their extra computational cost.</abstract><doi>10.48550/arxiv.1904.02666</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1904.02666
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1904_02666
source	arXiv.org
subjects	Computer Science - Learning Statistics - Machine Learning
title	Subject Cross Validation in Human Activity Recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T03%3A14%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Subject%20Cross%20Validation%20in%20Human%20Activity%20Recognition&rft.au=Dehghani,%20Akbar&rft.date=2019-04-04&rft_id=info:doi/10.48550/arxiv.1904.02666&rft_dat=%3Carxiv_GOX%3E1904_02666%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true