Web traffic anomaly detection using C-LSTM neural networks

•We propose a C-LSTM neural network for effectively detecting anomalies in web traffic data.•CNN extracts spatial features and LSTM models temporal characteristics.•It outperforms the machine learning methods for Yahoo's Webscope S5 dataset.•We reveal the internal operation of anomaly detection...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2018-09, Vol.106, p.66-76
Hauptverfasser: Kim, Tae-Young, Cho, Sung-Bae
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 76
container_issue
container_start_page 66
container_title Expert systems with applications
container_volume 106
creator Kim, Tae-Young
Cho, Sung-Bae
description •We propose a C-LSTM neural network for effectively detecting anomalies in web traffic data.•CNN extracts spatial features and LSTM models temporal characteristics.•It outperforms the machine learning methods for Yahoo's Webscope S5 dataset.•We reveal the internal operation of anomaly detection process by t-SNE algorithm. Web traffic refers to the amount of data that is sent and received by people visiting online websites. Web traffic anomalies represent abnormal changes in time series traffic, and it is important to perform detection quickly and accurately for the efficient operation of complex computer networks systems. In this paper, we propose a C-LSTM neural network for effectively modeling the spatial and temporal information contained in traffic data, which is a one-dimensional time series signal. We also provide a method for automatically extracting robust features of spatial-temporal information from raw data. Experiments demonstrate that our C-LSTM method can extract more complex features by combining a convolutional neural network (CNN), long short-term memory (LSTM), and deep neural network (DNN). The CNN layer is used to reduce the frequency variation in spatial information; the LSTM layer is suitable for modeling time information; and the DNN layer is used to map data into a more separable space. Our C-LSTM method also achieves nearly perfect anomaly detection performance for web traffic data, even for very similar signals that were previously considered to be very difficult to classify. Finally, the C-LSTM method outperforms other state-of-the-art machine learning techniques on Yahoo's well-known Webscope S5 dataset, achieving an overall accuracy of 98.6% and recall of 89.7% on the test dataset.
doi_str_mv 10.1016/j.eswa.2018.04.004
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2083808274</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417418302288</els_id><sourcerecordid>2083808274</sourcerecordid><originalsourceid>FETCH-LOGICAL-c394t-7878d013b1807f3a9b20533f387969d9cca7b3484535a44cc17f0dcf6e1bdebe3</originalsourceid><addsrcrecordid>eNp9kD1PwzAURS0EEqHwB5giMSc8x05sIxZU8SUVMVDEaDnOM3Jok2InVP33pCoz013uue_pEHJJIadAq-s2x7g1eQFU5sBzAH5EEioFyyqh2DFJQJUi41TwU3IWYwtABYBIyM0H1ukQjHPepqbr12a1Sxsc0A6-79Ix-u4znWeLt-VL2uEYzGqKYduHr3hOTpxZRbz4yxl5f7hfzp-yxevj8_xukVmm-JAJKWQDlNVUgnDMqLqAkjHHpFCVapS1RtSMS16y0nBuLRUOGusqpHWDNbIZuTrsbkL_PWIcdNuPoZtO6gIkkyALwadWcWjZ0McY0OlN8GsTdpqC3jvSrd470ntHGrieHE3Q7QHC6f8fj0FH67Gz2PgwGdBN7__DfwGkp26h</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2083808274</pqid></control><display><type>article</type><title>Web traffic anomaly detection using C-LSTM neural networks</title><source>Elsevier ScienceDirect Journals</source><creator>Kim, Tae-Young ; Cho, Sung-Bae</creator><creatorcontrib>Kim, Tae-Young ; Cho, Sung-Bae</creatorcontrib><description>•We propose a C-LSTM neural network for effectively detecting anomalies in web traffic data.•CNN extracts spatial features and LSTM models temporal characteristics.•It outperforms the machine learning methods for Yahoo's Webscope S5 dataset.•We reveal the internal operation of anomaly detection process by t-SNE algorithm. Web traffic refers to the amount of data that is sent and received by people visiting online websites. Web traffic anomalies represent abnormal changes in time series traffic, and it is important to perform detection quickly and accurately for the efficient operation of complex computer networks systems. In this paper, we propose a C-LSTM neural network for effectively modeling the spatial and temporal information contained in traffic data, which is a one-dimensional time series signal. We also provide a method for automatically extracting robust features of spatial-temporal information from raw data. Experiments demonstrate that our C-LSTM method can extract more complex features by combining a convolutional neural network (CNN), long short-term memory (LSTM), and deep neural network (DNN). The CNN layer is used to reduce the frequency variation in spatial information; the LSTM layer is suitable for modeling time information; and the DNN layer is used to map data into a more separable space. Our C-LSTM method also achieves nearly perfect anomaly detection performance for web traffic data, even for very similar signals that were previously considered to be very difficult to classify. Finally, the C-LSTM method outperforms other state-of-the-art machine learning techniques on Yahoo's well-known Webscope S5 dataset, achieving an overall accuracy of 98.6% and recall of 89.7% on the test dataset.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2018.04.004</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Anomalies ; Anomaly detection ; Artificial intelligence ; Artificial neural networks ; C-LSTM ; Computer networks ; Deep learning ; Feature extraction ; Frequency variation ; Internet ; Machine learning ; Modelling ; Neural networks ; Spatial data ; System effectiveness ; Time series ; Traffic information ; Web services ; Web traffic ; Websites</subject><ispartof>Expert systems with applications, 2018-09, Vol.106, p.66-76</ispartof><rights>2018 Elsevier Ltd</rights><rights>Copyright Elsevier BV Sep 15, 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c394t-7878d013b1807f3a9b20533f387969d9cca7b3484535a44cc17f0dcf6e1bdebe3</citedby><cites>FETCH-LOGICAL-c394t-7878d013b1807f3a9b20533f387969d9cca7b3484535a44cc17f0dcf6e1bdebe3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0957417418302288$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Kim, Tae-Young</creatorcontrib><creatorcontrib>Cho, Sung-Bae</creatorcontrib><title>Web traffic anomaly detection using C-LSTM neural networks</title><title>Expert systems with applications</title><description>•We propose a C-LSTM neural network for effectively detecting anomalies in web traffic data.•CNN extracts spatial features and LSTM models temporal characteristics.•It outperforms the machine learning methods for Yahoo's Webscope S5 dataset.•We reveal the internal operation of anomaly detection process by t-SNE algorithm. Web traffic refers to the amount of data that is sent and received by people visiting online websites. Web traffic anomalies represent abnormal changes in time series traffic, and it is important to perform detection quickly and accurately for the efficient operation of complex computer networks systems. In this paper, we propose a C-LSTM neural network for effectively modeling the spatial and temporal information contained in traffic data, which is a one-dimensional time series signal. We also provide a method for automatically extracting robust features of spatial-temporal information from raw data. Experiments demonstrate that our C-LSTM method can extract more complex features by combining a convolutional neural network (CNN), long short-term memory (LSTM), and deep neural network (DNN). The CNN layer is used to reduce the frequency variation in spatial information; the LSTM layer is suitable for modeling time information; and the DNN layer is used to map data into a more separable space. Our C-LSTM method also achieves nearly perfect anomaly detection performance for web traffic data, even for very similar signals that were previously considered to be very difficult to classify. Finally, the C-LSTM method outperforms other state-of-the-art machine learning techniques on Yahoo's well-known Webscope S5 dataset, achieving an overall accuracy of 98.6% and recall of 89.7% on the test dataset.</description><subject>Anomalies</subject><subject>Anomaly detection</subject><subject>Artificial intelligence</subject><subject>Artificial neural networks</subject><subject>C-LSTM</subject><subject>Computer networks</subject><subject>Deep learning</subject><subject>Feature extraction</subject><subject>Frequency variation</subject><subject>Internet</subject><subject>Machine learning</subject><subject>Modelling</subject><subject>Neural networks</subject><subject>Spatial data</subject><subject>System effectiveness</subject><subject>Time series</subject><subject>Traffic information</subject><subject>Web services</subject><subject>Web traffic</subject><subject>Websites</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAURS0EEqHwB5giMSc8x05sIxZU8SUVMVDEaDnOM3Jok2InVP33pCoz013uue_pEHJJIadAq-s2x7g1eQFU5sBzAH5EEioFyyqh2DFJQJUi41TwU3IWYwtABYBIyM0H1ukQjHPepqbr12a1Sxsc0A6-79Ix-u4znWeLt-VL2uEYzGqKYduHr3hOTpxZRbz4yxl5f7hfzp-yxevj8_xukVmm-JAJKWQDlNVUgnDMqLqAkjHHpFCVapS1RtSMS16y0nBuLRUOGusqpHWDNbIZuTrsbkL_PWIcdNuPoZtO6gIkkyALwadWcWjZ0McY0OlN8GsTdpqC3jvSrd470ntHGrieHE3Q7QHC6f8fj0FH67Gz2PgwGdBN7__DfwGkp26h</recordid><startdate>20180915</startdate><enddate>20180915</enddate><creator>Kim, Tae-Young</creator><creator>Cho, Sung-Bae</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20180915</creationdate><title>Web traffic anomaly detection using C-LSTM neural networks</title><author>Kim, Tae-Young ; Cho, Sung-Bae</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c394t-7878d013b1807f3a9b20533f387969d9cca7b3484535a44cc17f0dcf6e1bdebe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Anomalies</topic><topic>Anomaly detection</topic><topic>Artificial intelligence</topic><topic>Artificial neural networks</topic><topic>C-LSTM</topic><topic>Computer networks</topic><topic>Deep learning</topic><topic>Feature extraction</topic><topic>Frequency variation</topic><topic>Internet</topic><topic>Machine learning</topic><topic>Modelling</topic><topic>Neural networks</topic><topic>Spatial data</topic><topic>System effectiveness</topic><topic>Time series</topic><topic>Traffic information</topic><topic>Web services</topic><topic>Web traffic</topic><topic>Websites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kim, Tae-Young</creatorcontrib><creatorcontrib>Cho, Sung-Bae</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kim, Tae-Young</au><au>Cho, Sung-Bae</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Web traffic anomaly detection using C-LSTM neural networks</atitle><jtitle>Expert systems with applications</jtitle><date>2018-09-15</date><risdate>2018</risdate><volume>106</volume><spage>66</spage><epage>76</epage><pages>66-76</pages><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•We propose a C-LSTM neural network for effectively detecting anomalies in web traffic data.•CNN extracts spatial features and LSTM models temporal characteristics.•It outperforms the machine learning methods for Yahoo's Webscope S5 dataset.•We reveal the internal operation of anomaly detection process by t-SNE algorithm. Web traffic refers to the amount of data that is sent and received by people visiting online websites. Web traffic anomalies represent abnormal changes in time series traffic, and it is important to perform detection quickly and accurately for the efficient operation of complex computer networks systems. In this paper, we propose a C-LSTM neural network for effectively modeling the spatial and temporal information contained in traffic data, which is a one-dimensional time series signal. We also provide a method for automatically extracting robust features of spatial-temporal information from raw data. Experiments demonstrate that our C-LSTM method can extract more complex features by combining a convolutional neural network (CNN), long short-term memory (LSTM), and deep neural network (DNN). The CNN layer is used to reduce the frequency variation in spatial information; the LSTM layer is suitable for modeling time information; and the DNN layer is used to map data into a more separable space. Our C-LSTM method also achieves nearly perfect anomaly detection performance for web traffic data, even for very similar signals that were previously considered to be very difficult to classify. Finally, the C-LSTM method outperforms other state-of-the-art machine learning techniques on Yahoo's well-known Webscope S5 dataset, achieving an overall accuracy of 98.6% and recall of 89.7% on the test dataset.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2018.04.004</doi><tpages>11</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2018-09, Vol.106, p.66-76
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_journals_2083808274
source Elsevier ScienceDirect Journals
subjects Anomalies
Anomaly detection
Artificial intelligence
Artificial neural networks
C-LSTM
Computer networks
Deep learning
Feature extraction
Frequency variation
Internet
Machine learning
Modelling
Neural networks
Spatial data
System effectiveness
Time series
Traffic information
Web services
Web traffic
Websites
title Web traffic anomaly detection using C-LSTM neural networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T05%3A05%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Web%20traffic%20anomaly%20detection%20using%20C-LSTM%20neural%20networks&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Kim,%20Tae-Young&rft.date=2018-09-15&rft.volume=106&rft.spage=66&rft.epage=76&rft.pages=66-76&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2018.04.004&rft_dat=%3Cproquest_cross%3E2083808274%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2083808274&rft_id=info:pmid/&rft_els_id=S0957417418302288&rfr_iscdi=true