SPIN: Simplifying Polar Invariance for Neural networks Application to vision-based irradiance forecasting
Translational invariance induced by pooling operations is an inherent property of convolutional neural networks, which facilitates numerous computer vision tasks such as classification. Yet to leverage rotational invariant tasks, convolutional architectures require specific rotational invariant laye...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Translational invariance induced by pooling operations is an inherent
property of convolutional neural networks, which facilitates numerous computer
vision tasks such as classification. Yet to leverage rotational invariant
tasks, convolutional architectures require specific rotational invariant layers
or extensive data augmentation to learn from diverse rotated versions of a
given spatial configuration. Unwrapping the image into its polar coordinates
provides a more explicit representation to train a convolutional architecture
as the rotational invariance becomes translational, hence the visually distinct
but otherwise equivalent rotated versions of a given scene can be learnt from a
single image. We show with two common vision-based solar irradiance forecasting
challenges (i.e. using ground-taken sky images or satellite images), that this
preprocessing step significantly improves prediction results by standardising
the scene representation, while decreasing training time by a factor of 4
compared to augmenting data with rotations. In addition, this transformation
magnifies the area surrounding the centre of the rotation, leading to more
accurate short-term irradiance predictions. |
---|---|
DOI: | 10.48550/arxiv.2111.14507 |