Multi-Modal Conformal Prediction Regions with Simple Structures by Optimizing Convex Shape Templates
Conformal prediction is a statistical tool for producing prediction regions for machine learning models that are valid with high probability. A key component of conformal prediction algorithms is a \emph{non-conformity score function} that quantifies how different a model's prediction is from t...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Conformal prediction is a statistical tool for producing prediction regions
for machine learning models that are valid with high probability. A key
component of conformal prediction algorithms is a \emph{non-conformity score
function} that quantifies how different a model's prediction is from the
unknown ground truth value. Essentially, these functions determine the shape
and the size of the conformal prediction regions. While prior work has gone
into creating score functions that produce multi-model prediction regions, such
regions are generally too complex for use in downstream planning and control
problems. We propose a method that optimizes parameterized \emph{shape template
functions} over calibration data, which results in non-conformity score
functions that produce prediction regions with minimum volume. Our approach
results in prediction regions that are \emph{multi-modal}, so they can properly
capture residuals of distributions that have multiple modes, and
\emph{practical}, so each region is convex and can be easily incorporated into
downstream tasks, such as a motion planner using conformal prediction regions.
Our method applies to general supervised learning tasks, while we illustrate
its use in time-series prediction. We provide a toolbox and present
illustrative case studies of F16 fighter jets and autonomous vehicles, showing
an up to $68\%$ reduction in prediction region area compared to a circular
baseline region. |
---|---|
DOI: | 10.48550/arxiv.2312.07434 |