NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models

Structured pruning methods have proven effective in reducing the model size and accelerating inference speed in various network architectures such as Transformers. Despite the versatility of encoder-decoder models in numerous NLP tasks, the structured pruning methods on such models are relatively le...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ko, Jongwoo, Park, Seungjoon, Kim, Yujin, Ahn, Sumyeong, Chang, Du-Seong, Ahn, Euijai, Yun, Se-Young
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!