Insights and Current Gaps in Open-Source LLM Vulnerability Scanners: A Comparative Analysis
This report presents a comparative analysis of open-source vulnerability scanners for conversational large language models (LLMs). As LLMs become integral to various applications, they also present potential attack surfaces, exposed to security risks such as information leakage and jailbreak attacks...
Gespeichert in:
Hauptverfasser: | , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This report presents a comparative analysis of open-source vulnerability
scanners for conversational large language models (LLMs). As LLMs become
integral to various applications, they also present potential attack surfaces,
exposed to security risks such as information leakage and jailbreak attacks.
Our study evaluates prominent scanners - Garak, Giskard, PyRIT, and
CyberSecEval - that adapt red-teaming practices to expose these
vulnerabilities. We detail the distinctive features and practical use of these
scanners, outline unifying principles of their design and perform quantitative
evaluations to compare them. These evaluations uncover significant reliability
issues in detecting successful attacks, highlighting a fundamental gap for
future development. Additionally, we contribute a preliminary labelled dataset,
which serves as an initial step to bridge this gap. Based on the above, we
provide strategic recommendations to assist organizations choose the most
suitable scanner for their red-teaming needs, accounting for customizability,
test suite comprehensiveness, and industry-specific use cases. |
---|---|
DOI: | 10.48550/arxiv.2410.16527 |