Sparse aware data storage for inference processing in deep neural network architecture
Systems, apparatuses, and methods may provide techniques to prefetch compressed data and a sparsity bitmap from a memory to store the compressed data in a decode buffer, where the compressed data is associated with a plurality of tensors, where the compressed data is in a compressed format. The tech...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Systems, apparatuses, and methods may provide techniques to prefetch compressed data and a sparsity bitmap from a memory to store the compressed data in a decode buffer, where the compressed data is associated with a plurality of tensors, where the compressed data is in a compressed format. The technique aligns the compressed data with the sparsity bitmap to generate decoded data, and provides the decoded data to a plurality of processing elements.
系统、装置和方法可提供从存储器预取压缩数据和稀疏性位图以将压缩数据存储在解码缓冲器中的技术,其中压缩数据与多个张量相关联,其中压缩数据采用压缩格式。该技术将压缩数据与稀疏性位图对齐,以生成解码后的数据,并且将解码后的数据提供给多个处理元件。 |
---|