KernelWarehouse: Rethinking the Design of Dynamic Convolution
Dynamic convolution learns a linear mixture of n static kernels weighted with their input-dependent attentions, demonstrating superior performance than normal convolution. However, it increases the number of convolutional parameters by n times, and thus is not parameter efficient. This leads to no r...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Dynamic convolution learns a linear mixture of n static kernels weighted with
their input-dependent attentions, demonstrating superior performance than
normal convolution. However, it increases the number of convolutional
parameters by n times, and thus is not parameter efficient. This leads to no
research progress that can allow researchers to explore the setting n>100 (an
order of magnitude larger than the typical setting n |
---|---|
DOI: | 10.48550/arxiv.2406.07879 |