Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques

The increasing popularity of online services has made Internet Traffic Classification a critical field of study. However, the rapid development of internet protocols and encryption limits usable data availability. This paper addresses the challenges of classifying encrypted internet traffic, focusin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2024-07
Hauptverfasser: Zion, Yehonatan, Porat Aharon, Dubin, Ran, Dvir, Amit, Chen Hajaj
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Zion, Yehonatan
Porat Aharon
Dubin, Ran
Dvir, Amit
Chen Hajaj
description The increasing popularity of online services has made Internet Traffic Classification a critical field of study. However, the rapid development of internet protocols and encryption limits usable data availability. This paper addresses the challenges of classifying encrypted internet traffic, focusing on the scarcity of open-source datasets and limitations of existing ones. We propose two Data Augmentation (DA) techniques to synthetically generate data based on real samples: Average augmentation and MTU augmentation. Both augmentations are aimed to improve the performance of the classifier, each from a different perspective: The Average augmentation aims to increase dataset size by generating new synthetic samples, while the MTU augmentation enhances classifier robustness to varying Maximum Transmission Units (MTUs). Our experiments, conducted on two well-known academic datasets and a commercial dataset, demonstrate the effectiveness of these approaches in improving model performance and mitigating constraints associated with limited and homogeneous datasets. Our findings underscore the potential of data augmentation in addressing the challenges of modern internet traffic classification. Specifically, we show that our augmentation techniques significantly enhance encrypted traffic classification models. This improvement can positively impact user Quality of Experience (QoE) by more accurately classifying traffic as video streaming (e.g., YouTube) or chat (e.g., Google Chat). Additionally, it can enhance Quality of Service (QoS) for file downloading activities (e.g., Google Docs).
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3084094283</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3084094283</sourcerecordid><originalsourceid>FETCH-proquest_journals_30840942833</originalsourceid><addsrcrecordid>eNqNi98KgjAcRkcQJOU7DLoW1qZll2JG3XsfY043sd9sf4LevgU-QFffB-ecFUooY4eszCndoNS5kRBCjydaFCxBvAHFQWgYcAPCfmYvO3wHLy1Ij1vL-14LXE_cOR0f99oAbpU1YVC46t6xjcGFe46rMDwl-EWRQoF-Bel2aN3zycl02S3aX5u2vmWzNT_uH6MJFiJ6MFLm5JzTkrH_rC91pkW7</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3084094283</pqid></control><display><type>article</type><title>Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques</title><source>Free E- Journals</source><creator>Zion, Yehonatan ; Porat Aharon ; Dubin, Ran ; Dvir, Amit ; Chen Hajaj</creator><creatorcontrib>Zion, Yehonatan ; Porat Aharon ; Dubin, Ran ; Dvir, Amit ; Chen Hajaj</creatorcontrib><description>The increasing popularity of online services has made Internet Traffic Classification a critical field of study. However, the rapid development of internet protocols and encryption limits usable data availability. This paper addresses the challenges of classifying encrypted internet traffic, focusing on the scarcity of open-source datasets and limitations of existing ones. We propose two Data Augmentation (DA) techniques to synthetically generate data based on real samples: Average augmentation and MTU augmentation. Both augmentations are aimed to improve the performance of the classifier, each from a different perspective: The Average augmentation aims to increase dataset size by generating new synthetic samples, while the MTU augmentation enhances classifier robustness to varying Maximum Transmission Units (MTUs). Our experiments, conducted on two well-known academic datasets and a commercial dataset, demonstrate the effectiveness of these approaches in improving model performance and mitigating constraints associated with limited and homogeneous datasets. Our findings underscore the potential of data augmentation in addressing the challenges of modern internet traffic classification. Specifically, we show that our augmentation techniques significantly enhance encrypted traffic classification models. This improvement can positively impact user Quality of Experience (QoE) by more accurately classifying traffic as video streaming (e.g., YouTube) or chat (e.g., Google Chat). Additionally, it can enhance Quality of Service (QoS) for file downloading activities (e.g., Google Docs).</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Classification ; Critical field (superconductivity) ; Data augmentation ; Datasets ; Electronic documents ; Encryption ; Internet ; Performance enhancement ; User experience ; Video transmission</subject><ispartof>arXiv.org, 2024-07</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>777,781</link.rule.ids></links><search><creatorcontrib>Zion, Yehonatan</creatorcontrib><creatorcontrib>Porat Aharon</creatorcontrib><creatorcontrib>Dubin, Ran</creatorcontrib><creatorcontrib>Dvir, Amit</creatorcontrib><creatorcontrib>Chen Hajaj</creatorcontrib><title>Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques</title><title>arXiv.org</title><description>The increasing popularity of online services has made Internet Traffic Classification a critical field of study. However, the rapid development of internet protocols and encryption limits usable data availability. This paper addresses the challenges of classifying encrypted internet traffic, focusing on the scarcity of open-source datasets and limitations of existing ones. We propose two Data Augmentation (DA) techniques to synthetically generate data based on real samples: Average augmentation and MTU augmentation. Both augmentations are aimed to improve the performance of the classifier, each from a different perspective: The Average augmentation aims to increase dataset size by generating new synthetic samples, while the MTU augmentation enhances classifier robustness to varying Maximum Transmission Units (MTUs). Our experiments, conducted on two well-known academic datasets and a commercial dataset, demonstrate the effectiveness of these approaches in improving model performance and mitigating constraints associated with limited and homogeneous datasets. Our findings underscore the potential of data augmentation in addressing the challenges of modern internet traffic classification. Specifically, we show that our augmentation techniques significantly enhance encrypted traffic classification models. This improvement can positively impact user Quality of Experience (QoE) by more accurately classifying traffic as video streaming (e.g., YouTube) or chat (e.g., Google Chat). Additionally, it can enhance Quality of Service (QoS) for file downloading activities (e.g., Google Docs).</description><subject>Classification</subject><subject>Critical field (superconductivity)</subject><subject>Data augmentation</subject><subject>Datasets</subject><subject>Electronic documents</subject><subject>Encryption</subject><subject>Internet</subject><subject>Performance enhancement</subject><subject>User experience</subject><subject>Video transmission</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNi98KgjAcRkcQJOU7DLoW1qZll2JG3XsfY043sd9sf4LevgU-QFffB-ecFUooY4eszCndoNS5kRBCjydaFCxBvAHFQWgYcAPCfmYvO3wHLy1Ij1vL-14LXE_cOR0f99oAbpU1YVC46t6xjcGFe46rMDwl-EWRQoF-Bel2aN3zycl02S3aX5u2vmWzNT_uH6MJFiJ6MFLm5JzTkrH_rC91pkW7</recordid><startdate>20240723</startdate><enddate>20240723</enddate><creator>Zion, Yehonatan</creator><creator>Porat Aharon</creator><creator>Dubin, Ran</creator><creator>Dvir, Amit</creator><creator>Chen Hajaj</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240723</creationdate><title>Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques</title><author>Zion, Yehonatan ; Porat Aharon ; Dubin, Ran ; Dvir, Amit ; Chen Hajaj</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30840942833</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Classification</topic><topic>Critical field (superconductivity)</topic><topic>Data augmentation</topic><topic>Datasets</topic><topic>Electronic documents</topic><topic>Encryption</topic><topic>Internet</topic><topic>Performance enhancement</topic><topic>User experience</topic><topic>Video transmission</topic><toplevel>online_resources</toplevel><creatorcontrib>Zion, Yehonatan</creatorcontrib><creatorcontrib>Porat Aharon</creatorcontrib><creatorcontrib>Dubin, Ran</creatorcontrib><creatorcontrib>Dvir, Amit</creatorcontrib><creatorcontrib>Chen Hajaj</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zion, Yehonatan</au><au>Porat Aharon</au><au>Dubin, Ran</au><au>Dvir, Amit</au><au>Chen Hajaj</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques</atitle><jtitle>arXiv.org</jtitle><date>2024-07-23</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>The increasing popularity of online services has made Internet Traffic Classification a critical field of study. However, the rapid development of internet protocols and encryption limits usable data availability. This paper addresses the challenges of classifying encrypted internet traffic, focusing on the scarcity of open-source datasets and limitations of existing ones. We propose two Data Augmentation (DA) techniques to synthetically generate data based on real samples: Average augmentation and MTU augmentation. Both augmentations are aimed to improve the performance of the classifier, each from a different perspective: The Average augmentation aims to increase dataset size by generating new synthetic samples, while the MTU augmentation enhances classifier robustness to varying Maximum Transmission Units (MTUs). Our experiments, conducted on two well-known academic datasets and a commercial dataset, demonstrate the effectiveness of these approaches in improving model performance and mitigating constraints associated with limited and homogeneous datasets. Our findings underscore the potential of data augmentation in addressing the challenges of modern internet traffic classification. Specifically, we show that our augmentation techniques significantly enhance encrypted traffic classification models. This improvement can positively impact user Quality of Experience (QoE) by more accurately classifying traffic as video streaming (e.g., YouTube) or chat (e.g., Google Chat). Additionally, it can enhance Quality of Service (QoS) for file downloading activities (e.g., Google Docs).</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-07
issn 2331-8422
language eng
recordid cdi_proquest_journals_3084094283
source Free E- Journals
subjects Classification
Critical field (superconductivity)
Data augmentation
Datasets
Electronic documents
Encryption
Internet
Performance enhancement
User experience
Video transmission
title Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T01%3A24%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Enhancing%20Encrypted%20Internet%20Traffic%20Classification%20Through%20Advanced%20Data%20Augmentation%20Techniques&rft.jtitle=arXiv.org&rft.au=Zion,%20Yehonatan&rft.date=2024-07-23&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3084094283%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3084094283&rft_id=info:pmid/&rfr_iscdi=true