AIOps Solutions for Incident Management: Technical Guidelines and A Comprehensive Literature Review
The management of modern IT systems poses unique challenges, necessitating scalability, reliability, and efficiency in handling extensive data streams. Traditional methods, reliant on manual tasks and rule-based approaches, prove inefficient for the substantial data volumes and alerts generated by I...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2024-04 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Remil, Youcef Bendimerad, Anes Mathonat, Romain Kaytoue, Mehdi |
description | The management of modern IT systems poses unique challenges, necessitating scalability, reliability, and efficiency in handling extensive data streams. Traditional methods, reliant on manual tasks and rule-based approaches, prove inefficient for the substantial data volumes and alerts generated by IT systems. Artificial Intelligence for Operating Systems (AIOps) has emerged as a solution, leveraging advanced analytics like machine learning and big data to enhance incident management. AIOps detects and predicts incidents, identifies root causes, and automates healing actions, improving quality and reducing operational costs. However, despite its potential, the AIOps domain is still in its early stages, decentralized across multiple sectors, and lacking standardized conventions. Research and industrial contributions are distributed without consistent frameworks for data management, target problems, implementation details, requirements, and capabilities. This study proposes an AIOps terminology and taxonomy, establishing a structured incident management procedure and providing guidelines for constructing an AIOps framework. The research also categorizes contributions based on criteria such as incident management tasks, application areas, data sources, and technical approaches. The goal is to provide a comprehensive review of technical and research aspects in AIOps for incident management, aiming to structure knowledge, identify gaps, and establish a foundation for future developments in the field. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3031409143</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3031409143</sourcerecordid><originalsourceid>FETCH-proquest_journals_30314091433</originalsourceid><addsrcrecordid>eNqNiksKwjAUAIMgWNQ7PHBdSJPW306KP1AEdV9C-9SU-lLz0evbhQdwNQMzPRYJKZN4ngoxYGPnas65mM5ElsmIlav9qXVwMU3w2pCDm7Gwp1JXSB6OitQdn50u4Yrlg3SpGtiGrjaa0IGiClaQm2dr8YHk9BvhoD1a5YNFOONb42fE-jfVOBz_OGSTzfqa7-LWmldA54vaBEtdKiSXScoXSSrlf9cXmt9Fqw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3031409143</pqid></control><display><type>article</type><title>AIOps Solutions for Incident Management: Technical Guidelines and A Comprehensive Literature Review</title><source>Free E- Journals</source><creator>Remil, Youcef ; Bendimerad, Anes ; Mathonat, Romain ; Kaytoue, Mehdi</creator><creatorcontrib>Remil, Youcef ; Bendimerad, Anes ; Mathonat, Romain ; Kaytoue, Mehdi</creatorcontrib><description>The management of modern IT systems poses unique challenges, necessitating scalability, reliability, and efficiency in handling extensive data streams. Traditional methods, reliant on manual tasks and rule-based approaches, prove inefficient for the substantial data volumes and alerts generated by IT systems. Artificial Intelligence for Operating Systems (AIOps) has emerged as a solution, leveraging advanced analytics like machine learning and big data to enhance incident management. AIOps detects and predicts incidents, identifies root causes, and automates healing actions, improving quality and reducing operational costs. However, despite its potential, the AIOps domain is still in its early stages, decentralized across multiple sectors, and lacking standardized conventions. Research and industrial contributions are distributed without consistent frameworks for data management, target problems, implementation details, requirements, and capabilities. This study proposes an AIOps terminology and taxonomy, establishing a structured incident management procedure and providing guidelines for constructing an AIOps framework. The research also categorizes contributions based on criteria such as incident management tasks, application areas, data sources, and technical approaches. The goal is to provide a comprehensive review of technical and research aspects in AIOps for incident management, aiming to structure knowledge, identify gaps, and establish a foundation for future developments in the field.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial intelligence ; Big Data ; Data management ; Data transmission ; Guidelines ; Literature reviews ; Machine learning ; Taxonomy</subject><ispartof>arXiv.org, 2024-04</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Remil, Youcef</creatorcontrib><creatorcontrib>Bendimerad, Anes</creatorcontrib><creatorcontrib>Mathonat, Romain</creatorcontrib><creatorcontrib>Kaytoue, Mehdi</creatorcontrib><title>AIOps Solutions for Incident Management: Technical Guidelines and A Comprehensive Literature Review</title><title>arXiv.org</title><description>The management of modern IT systems poses unique challenges, necessitating scalability, reliability, and efficiency in handling extensive data streams. Traditional methods, reliant on manual tasks and rule-based approaches, prove inefficient for the substantial data volumes and alerts generated by IT systems. Artificial Intelligence for Operating Systems (AIOps) has emerged as a solution, leveraging advanced analytics like machine learning and big data to enhance incident management. AIOps detects and predicts incidents, identifies root causes, and automates healing actions, improving quality and reducing operational costs. However, despite its potential, the AIOps domain is still in its early stages, decentralized across multiple sectors, and lacking standardized conventions. Research and industrial contributions are distributed without consistent frameworks for data management, target problems, implementation details, requirements, and capabilities. This study proposes an AIOps terminology and taxonomy, establishing a structured incident management procedure and providing guidelines for constructing an AIOps framework. The research also categorizes contributions based on criteria such as incident management tasks, application areas, data sources, and technical approaches. The goal is to provide a comprehensive review of technical and research aspects in AIOps for incident management, aiming to structure knowledge, identify gaps, and establish a foundation for future developments in the field.</description><subject>Artificial intelligence</subject><subject>Big Data</subject><subject>Data management</subject><subject>Data transmission</subject><subject>Guidelines</subject><subject>Literature reviews</subject><subject>Machine learning</subject><subject>Taxonomy</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNiksKwjAUAIMgWNQ7PHBdSJPW306KP1AEdV9C-9SU-lLz0evbhQdwNQMzPRYJKZN4ngoxYGPnas65mM5ElsmIlav9qXVwMU3w2pCDm7Gwp1JXSB6OitQdn50u4Yrlg3SpGtiGrjaa0IGiClaQm2dr8YHk9BvhoD1a5YNFOONb42fE-jfVOBz_OGSTzfqa7-LWmldA54vaBEtdKiSXScoXSSrlf9cXmt9Fqw</recordid><startdate>20240401</startdate><enddate>20240401</enddate><creator>Remil, Youcef</creator><creator>Bendimerad, Anes</creator><creator>Mathonat, Romain</creator><creator>Kaytoue, Mehdi</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240401</creationdate><title>AIOps Solutions for Incident Management: Technical Guidelines and A Comprehensive Literature Review</title><author>Remil, Youcef ; Bendimerad, Anes ; Mathonat, Romain ; Kaytoue, Mehdi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30314091433</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial intelligence</topic><topic>Big Data</topic><topic>Data management</topic><topic>Data transmission</topic><topic>Guidelines</topic><topic>Literature reviews</topic><topic>Machine learning</topic><topic>Taxonomy</topic><toplevel>online_resources</toplevel><creatorcontrib>Remil, Youcef</creatorcontrib><creatorcontrib>Bendimerad, Anes</creatorcontrib><creatorcontrib>Mathonat, Romain</creatorcontrib><creatorcontrib>Kaytoue, Mehdi</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Database (Proquest)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Remil, Youcef</au><au>Bendimerad, Anes</au><au>Mathonat, Romain</au><au>Kaytoue, Mehdi</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>AIOps Solutions for Incident Management: Technical Guidelines and A Comprehensive Literature Review</atitle><jtitle>arXiv.org</jtitle><date>2024-04-01</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>The management of modern IT systems poses unique challenges, necessitating scalability, reliability, and efficiency in handling extensive data streams. Traditional methods, reliant on manual tasks and rule-based approaches, prove inefficient for the substantial data volumes and alerts generated by IT systems. Artificial Intelligence for Operating Systems (AIOps) has emerged as a solution, leveraging advanced analytics like machine learning and big data to enhance incident management. AIOps detects and predicts incidents, identifies root causes, and automates healing actions, improving quality and reducing operational costs. However, despite its potential, the AIOps domain is still in its early stages, decentralized across multiple sectors, and lacking standardized conventions. Research and industrial contributions are distributed without consistent frameworks for data management, target problems, implementation details, requirements, and capabilities. This study proposes an AIOps terminology and taxonomy, establishing a structured incident management procedure and providing guidelines for constructing an AIOps framework. The research also categorizes contributions based on criteria such as incident management tasks, application areas, data sources, and technical approaches. The goal is to provide a comprehensive review of technical and research aspects in AIOps for incident management, aiming to structure knowledge, identify gaps, and establish a foundation for future developments in the field.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2024-04 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_3031409143 |
source | Free E- Journals |
subjects | Artificial intelligence Big Data Data management Data transmission Guidelines Literature reviews Machine learning Taxonomy |
title | AIOps Solutions for Incident Management: Technical Guidelines and A Comprehensive Literature Review |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T19%3A40%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=AIOps%20Solutions%20for%20Incident%20Management:%20Technical%20Guidelines%20and%20A%20Comprehensive%20Literature%20Review&rft.jtitle=arXiv.org&rft.au=Remil,%20Youcef&rft.date=2024-04-01&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3031409143%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3031409143&rft_id=info:pmid/&rfr_iscdi=true |