Elixir: A system to enhance data quality for multiple analytics on a video stream
IoT sensors, especially video cameras, are ubiquitously deployed around the world to perform a variety of computer vision tasks in several verticals including retail, healthcare, safety and security, transportation, manufacturing, etc. To amortize their high deployment effort and cost, it is desirab...
Gespeichert in:
Hauptverfasser: | , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | IoT sensors, especially video cameras, are ubiquitously deployed around the
world to perform a variety of computer vision tasks in several verticals
including retail, healthcare, safety and security, transportation,
manufacturing, etc. To amortize their high deployment effort and cost, it is
desirable to perform multiple video analytics tasks, which we refer to as
Analytical Units (AUs), off the video feed coming out of every camera. In this
paper, we first show that in a multi-AU setting, changing the camera setting
has disproportionate impact on different AUs performance. In particular, the
optimal setting for one AU may severely degrade the performance for another AU,
and further the impact on different AUs varies as the environmental condition
changes. We then present Elixir, a system to enhance the video stream quality
for multiple analytics on a video stream. Elixir leverages Multi-Objective
Reinforcement Learning (MORL), where the RL agent caters to the objectives from
different AUs and adjusts the camera setting to simultaneously enhance the
performance of all AUs. To define the multiple objectives in MORL, we develop
new AU-specific quality estimator values for each individual AU. We evaluate
Elixir through real-world experiments on a testbed with three cameras deployed
next to each other (overlooking a large enterprise parking lot) running Elixir
and two baseline approaches, respectively. Elixir correctly detects 7.1%
(22,068) and 5.0% (15,731) more cars, 94% (551) and 72% (478) more faces, and
670.4% (4975) and 158.6% (3507) more persons than the default-setting and
time-sharing approaches, respectively. It also detects 115 license plates, far
more than the time-sharing approach (7) and the default setting (0). |
---|---|
DOI: | 10.48550/arxiv.2212.04061 |