Split and shift methodology on cellular processor arrays: area saving versus time penalty

ABSTRACT This paper addresses the so‐called split and shift methodology. This methodology deals with the implementation of kernels of sizes that go above the physically implemented resources (local connections and weighting circuits) on synchronous cellular processor arrays (CPA), including the real...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of circuit theory and applications 2014-03, Vol.42 (3), p.258-295
Hauptverfasser: Fernández, N.A., Brea, V.M., Cabello, D.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:ABSTRACT This paper addresses the so‐called split and shift methodology. This methodology deals with the implementation of kernels of sizes that go above the physically implemented resources (local connections and weighting circuits) on synchronous cellular processor arrays (CPA), including the realization of large neighborhood operations and/or the reduction of the available hardware in order to drop the area consumption. Two main goals are pursued in the development of the methodology, namely: (1) minimum penalty at processing time and (2) absolutely no penalty at functional level. The paper presents different techniques and guidelines for the methodology application and introduces a Figure of Merit to evaluate them by relating area gains with time penalty. This, along with a kernel shape analysis, led us to propose more adequate configurations of weighting circuits and to justify the classical choice of North‐East‐West‐South connectivity. To validate the methodology, we realize several estimates over actual physical implementations, and we propose the realization over CPAs of the spin filters, scale invariant feature transform and speeded‐up robust features algorithms. A more in‐depth trade‐off analysis is realized over the implementation of the pixel level snakes algorithm. Copyright © 2012 John Wiley & Sons, Ltd. This paper addresses the split and shift methodology, which is intended to implement kernels of sizes that go above the physically implemented local connections and weighting circuits on synchronous cellular processor arrays (CPA), either because requiring large neighborhood communications or because of the reduction of the connectivity to save area. To validate the methodology, we realize estimates over actual physical implementations, and we propose the realization over CPAs of the spin filters, scale invariant feature transform, speeded‐up robust features and pixel level snakes algorithms.
ISSN:0098-9886
1097-007X
DOI:10.1002/cta.1850