Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays

Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-12
Hauptverfasser: Jones, Michael, Kepner, Jeremy, Prout, Andrew, Davis, Timothy, Arcand, William, Bestor, David, Bergeron, William, Byun, Chansup, Gadepally, Vijay, Houle, Micheal, Hubbell, Matthew, Hayden Jananthan, Klein, Anna, Milechin, Lauren, Morales, Guillermo, Mullen, Julie, Patel, Ritesh, Pisharody, Sandeep, Reuther, Albert, Rosa, Antonio, Samsi, Siddharth, Yee, Charles, Michaleas, Peter
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Jones, Michael
Kepner, Jeremy
Prout, Andrew
Davis, Timothy
Arcand, William
Bestor, David
Bergeron, William
Byun, Chansup
Gadepally, Vijay
Houle, Micheal
Hubbell, Matthew
Hayden Jananthan
Klein, Anna
Milechin, Lauren
Morales, Guillermo
Mullen, Julie
Patel, Ritesh
Pisharody, Sandeep
Reuther, Albert
Rosa, Antonio
Samsi, Siddharth
Yee, Charles
Michaleas, Peter
description Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires integrating them into operational systems, which comes with its own unique challenges. This paper describes two examples of real-time operational implementations. First, is an operational GraphBLAS implementation that constructs anonymized hypersparse matrices on a high-bandwidth network tap. Second, is an operational D4M implementation that analyzes daily cloud gateway logs. The architectures of these implementations are presented. Detailed measurements of the resources and the performance are collected and analyzed. The implementations are capable of meeting their operational requirements using modest computational resources (a couple of processing cores). GraphBLAS is well-suited for low-level analysis of high-bandwidth connections with relatively structured network data. D4M is well-suited for higher-level analysis of more unstructured data. This work demonstrates that these technologies can be implemented in operational settings.
doi_str_mv 10.48550/arxiv.2309.02464
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2309_02464</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2861990050</sourcerecordid><originalsourceid>FETCH-LOGICAL-a954-1f725631efb771c8390523e5a384bdbc1d998df0c1942ecc4173668f78d2a553</originalsourceid><addsrcrecordid>eNotkF1PwjAYhRsTEwnyA7yyidfDfm7t5QQFE9BEuF9eulaLsM12oPv3TvDq3Dw5OedB6IaSsVBSknsIP_44ZpzoMWEiFRdowDiniRKMXaFRjFtCCEszJiUfoGZqm13d7W3V4trhNwu7ZO33Fr_Y9rsOn3gdwDlvcF7Bros-4kP01TueBWg-Hhb5Cs-7xobYQIgWL6EN3tiIoSrxVCxxHmNtPLT-aHEeAnTxGl062EU7-s8hWj09rifzZPE6e57kiwS0FAl1_b6UU-s2WUaN4ppIxq0ErsSm3Bhaaq1KRwzVglljBM14miqXqZJBf2yIbs-tJxtFE_weQlf8WSlOVnri7kw0of462NgW2_oQ-pOxYCqlWhMiCf8FHMJk1Q</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2861990050</pqid></control><display><type>article</type><title>Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays</title><source>World Web Journals</source><source>arXiv.org</source><creator>Jones, Michael ; Kepner, Jeremy ; Prout, Andrew ; Davis, Timothy ; Arcand, William ; Bestor, David ; Bergeron, William ; Byun, Chansup ; Gadepally, Vijay ; Houle, Micheal ; Hubbell, Matthew ; Hayden Jananthan ; Klein, Anna ; Milechin, Lauren ; Morales, Guillermo ; Mullen, Julie ; Patel, Ritesh ; Pisharody, Sandeep ; Reuther, Albert ; Rosa, Antonio ; Samsi, Siddharth ; Yee, Charles ; Michaleas, Peter</creator><creatorcontrib>Jones, Michael ; Kepner, Jeremy ; Prout, Andrew ; Davis, Timothy ; Arcand, William ; Bestor, David ; Bergeron, William ; Byun, Chansup ; Gadepally, Vijay ; Houle, Micheal ; Hubbell, Matthew ; Hayden Jananthan ; Klein, Anna ; Milechin, Lauren ; Morales, Guillermo ; Mullen, Julie ; Patel, Ritesh ; Pisharody, Sandeep ; Reuther, Albert ; Rosa, Antonio ; Samsi, Siddharth ; Yee, Charles ; Michaleas, Peter</creatorcontrib><description>Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires integrating them into operational systems, which comes with its own unique challenges. This paper describes two examples of real-time operational implementations. First, is an operational GraphBLAS implementation that constructs anonymized hypersparse matrices on a high-bandwidth network tap. Second, is an operational D4M implementation that analyzes daily cloud gateway logs. The architectures of these implementations are presented. Detailed measurements of the resources and the performance are collected and analyzed. The implementations are capable of meeting their operational requirements using modest computational resources (a couple of processing cores). GraphBLAS is well-suited for low-level analysis of high-bandwidth connections with relatively structured network data. D4M is well-suited for higher-level analysis of more unstructured data. This work demonstrates that these technologies can be implemented in operational settings.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2309.02464</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Arrays ; Bandwidths ; Cloud computing ; Communications traffic ; Computer Science - Networking and Internet Architecture ; Computer Science - Social and Information Networks ; Mathematical analysis ; Network analysis ; Real time operation ; Traffic analysis ; Unstructured data</subject><ispartof>arXiv.org, 2023-12</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2309.02464$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1109/HPEC58863.2023.10363581$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Jones, Michael</creatorcontrib><creatorcontrib>Kepner, Jeremy</creatorcontrib><creatorcontrib>Prout, Andrew</creatorcontrib><creatorcontrib>Davis, Timothy</creatorcontrib><creatorcontrib>Arcand, William</creatorcontrib><creatorcontrib>Bestor, David</creatorcontrib><creatorcontrib>Bergeron, William</creatorcontrib><creatorcontrib>Byun, Chansup</creatorcontrib><creatorcontrib>Gadepally, Vijay</creatorcontrib><creatorcontrib>Houle, Micheal</creatorcontrib><creatorcontrib>Hubbell, Matthew</creatorcontrib><creatorcontrib>Hayden Jananthan</creatorcontrib><creatorcontrib>Klein, Anna</creatorcontrib><creatorcontrib>Milechin, Lauren</creatorcontrib><creatorcontrib>Morales, Guillermo</creatorcontrib><creatorcontrib>Mullen, Julie</creatorcontrib><creatorcontrib>Patel, Ritesh</creatorcontrib><creatorcontrib>Pisharody, Sandeep</creatorcontrib><creatorcontrib>Reuther, Albert</creatorcontrib><creatorcontrib>Rosa, Antonio</creatorcontrib><creatorcontrib>Samsi, Siddharth</creatorcontrib><creatorcontrib>Yee, Charles</creatorcontrib><creatorcontrib>Michaleas, Peter</creatorcontrib><title>Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays</title><title>arXiv.org</title><description>Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires integrating them into operational systems, which comes with its own unique challenges. This paper describes two examples of real-time operational implementations. First, is an operational GraphBLAS implementation that constructs anonymized hypersparse matrices on a high-bandwidth network tap. Second, is an operational D4M implementation that analyzes daily cloud gateway logs. The architectures of these implementations are presented. Detailed measurements of the resources and the performance are collected and analyzed. The implementations are capable of meeting their operational requirements using modest computational resources (a couple of processing cores). GraphBLAS is well-suited for low-level analysis of high-bandwidth connections with relatively structured network data. D4M is well-suited for higher-level analysis of more unstructured data. This work demonstrates that these technologies can be implemented in operational settings.</description><subject>Arrays</subject><subject>Bandwidths</subject><subject>Cloud computing</subject><subject>Communications traffic</subject><subject>Computer Science - Networking and Internet Architecture</subject><subject>Computer Science - Social and Information Networks</subject><subject>Mathematical analysis</subject><subject>Network analysis</subject><subject>Real time operation</subject><subject>Traffic analysis</subject><subject>Unstructured data</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNotkF1PwjAYhRsTEwnyA7yyidfDfm7t5QQFE9BEuF9eulaLsM12oPv3TvDq3Dw5OedB6IaSsVBSknsIP_44ZpzoMWEiFRdowDiniRKMXaFRjFtCCEszJiUfoGZqm13d7W3V4trhNwu7ZO33Fr_Y9rsOn3gdwDlvcF7Bros-4kP01TueBWg-Hhb5Cs-7xobYQIgWL6EN3tiIoSrxVCxxHmNtPLT-aHEeAnTxGl062EU7-s8hWj09rifzZPE6e57kiwS0FAl1_b6UU-s2WUaN4ppIxq0ErsSm3Bhaaq1KRwzVglljBM14miqXqZJBf2yIbs-tJxtFE_weQlf8WSlOVnri7kw0of462NgW2_oQ-pOxYCqlWhMiCf8FHMJk1Q</recordid><startdate>20231208</startdate><enddate>20231208</enddate><creator>Jones, Michael</creator><creator>Kepner, Jeremy</creator><creator>Prout, Andrew</creator><creator>Davis, Timothy</creator><creator>Arcand, William</creator><creator>Bestor, David</creator><creator>Bergeron, William</creator><creator>Byun, Chansup</creator><creator>Gadepally, Vijay</creator><creator>Houle, Micheal</creator><creator>Hubbell, Matthew</creator><creator>Hayden Jananthan</creator><creator>Klein, Anna</creator><creator>Milechin, Lauren</creator><creator>Morales, Guillermo</creator><creator>Mullen, Julie</creator><creator>Patel, Ritesh</creator><creator>Pisharody, Sandeep</creator><creator>Reuther, Albert</creator><creator>Rosa, Antonio</creator><creator>Samsi, Siddharth</creator><creator>Yee, Charles</creator><creator>Michaleas, Peter</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231208</creationdate><title>Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays</title><author>Jones, Michael ; Kepner, Jeremy ; Prout, Andrew ; Davis, Timothy ; Arcand, William ; Bestor, David ; Bergeron, William ; Byun, Chansup ; Gadepally, Vijay ; Houle, Micheal ; Hubbell, Matthew ; Hayden Jananthan ; Klein, Anna ; Milechin, Lauren ; Morales, Guillermo ; Mullen, Julie ; Patel, Ritesh ; Pisharody, Sandeep ; Reuther, Albert ; Rosa, Antonio ; Samsi, Siddharth ; Yee, Charles ; Michaleas, Peter</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a954-1f725631efb771c8390523e5a384bdbc1d998df0c1942ecc4173668f78d2a553</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Arrays</topic><topic>Bandwidths</topic><topic>Cloud computing</topic><topic>Communications traffic</topic><topic>Computer Science - Networking and Internet Architecture</topic><topic>Computer Science - Social and Information Networks</topic><topic>Mathematical analysis</topic><topic>Network analysis</topic><topic>Real time operation</topic><topic>Traffic analysis</topic><topic>Unstructured data</topic><toplevel>online_resources</toplevel><creatorcontrib>Jones, Michael</creatorcontrib><creatorcontrib>Kepner, Jeremy</creatorcontrib><creatorcontrib>Prout, Andrew</creatorcontrib><creatorcontrib>Davis, Timothy</creatorcontrib><creatorcontrib>Arcand, William</creatorcontrib><creatorcontrib>Bestor, David</creatorcontrib><creatorcontrib>Bergeron, William</creatorcontrib><creatorcontrib>Byun, Chansup</creatorcontrib><creatorcontrib>Gadepally, Vijay</creatorcontrib><creatorcontrib>Houle, Micheal</creatorcontrib><creatorcontrib>Hubbell, Matthew</creatorcontrib><creatorcontrib>Hayden Jananthan</creatorcontrib><creatorcontrib>Klein, Anna</creatorcontrib><creatorcontrib>Milechin, Lauren</creatorcontrib><creatorcontrib>Morales, Guillermo</creatorcontrib><creatorcontrib>Mullen, Julie</creatorcontrib><creatorcontrib>Patel, Ritesh</creatorcontrib><creatorcontrib>Pisharody, Sandeep</creatorcontrib><creatorcontrib>Reuther, Albert</creatorcontrib><creatorcontrib>Rosa, Antonio</creatorcontrib><creatorcontrib>Samsi, Siddharth</creatorcontrib><creatorcontrib>Yee, Charles</creatorcontrib><creatorcontrib>Michaleas, Peter</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Database (Proquest)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jones, Michael</au><au>Kepner, Jeremy</au><au>Prout, Andrew</au><au>Davis, Timothy</au><au>Arcand, William</au><au>Bestor, David</au><au>Bergeron, William</au><au>Byun, Chansup</au><au>Gadepally, Vijay</au><au>Houle, Micheal</au><au>Hubbell, Matthew</au><au>Hayden Jananthan</au><au>Klein, Anna</au><au>Milechin, Lauren</au><au>Morales, Guillermo</au><au>Mullen, Julie</au><au>Patel, Ritesh</au><au>Pisharody, Sandeep</au><au>Reuther, Albert</au><au>Rosa, Antonio</au><au>Samsi, Siddharth</au><au>Yee, Charles</au><au>Michaleas, Peter</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays</atitle><jtitle>arXiv.org</jtitle><date>2023-12-08</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Matrix/array analysis of networks can provide significant insight into their behavior and aid in their operation and protection. Prior work has demonstrated the analytic, performance, and compression capabilities of GraphBLAS (graphblas.org) hypersparse matrices and D4M (d4m.mit.edu) associative arrays (a mathematical superset of matrices). Obtaining the benefits of these capabilities requires integrating them into operational systems, which comes with its own unique challenges. This paper describes two examples of real-time operational implementations. First, is an operational GraphBLAS implementation that constructs anonymized hypersparse matrices on a high-bandwidth network tap. Second, is an operational D4M implementation that analyzes daily cloud gateway logs. The architectures of these implementations are presented. Detailed measurements of the resources and the performance are collected and analyzed. The implementations are capable of meeting their operational requirements using modest computational resources (a couple of processing cores). GraphBLAS is well-suited for low-level analysis of high-bandwidth connections with relatively structured network data. D4M is well-suited for higher-level analysis of more unstructured data. This work demonstrates that these technologies can be implemented in operational settings.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2309.02464</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-12
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2309_02464
source World Web Journals; arXiv.org
subjects Arrays
Bandwidths
Cloud computing
Communications traffic
Computer Science - Networking and Internet Architecture
Computer Science - Social and Information Networks
Mathematical analysis
Network analysis
Real time operation
Traffic analysis
Unstructured data
title Deployment of Real-Time Network Traffic Analysis using GraphBLAS Hypersparse Matrices and D4M Associative Arrays
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T15%3A30%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Deployment%20of%20Real-Time%20Network%20Traffic%20Analysis%20using%20GraphBLAS%20Hypersparse%20Matrices%20and%20D4M%20Associative%20Arrays&rft.jtitle=arXiv.org&rft.au=Jones,%20Michael&rft.date=2023-12-08&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2309.02464&rft_dat=%3Cproquest_arxiv%3E2861990050%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2861990050&rft_id=info:pmid/&rfr_iscdi=true