A Cloud-Scale Acceleration Architecture

Hyperscale datacenter providers have struggled to balance the growing need for specialized hardware (efficiency) with the economic benefits of homogeneity (manageability). In this paper we propose a new cloud architecture that uses reconfigurable logic to accelerate both network plane functions and...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE MICRO 2017-06, p.1-1
Hauptverfasser:	Caulfield, Adrian, Chung, Eric, Putnam, Andrew, Angepat, Hari, Fowers, Jeremy, Heil, Stephen, Kim, Joo-Young, Lo, Daniel, Papamichael, Michael, Massengill, Todd, Chiou, Derek, Burger, Doug
Format:	Artikel
Sprache:	eng
Schlagworte:	Acceleration C Computer Systems Organization C.1 Processor Architectures C.1.3 Other Architecture Styles C.1.3.a Adaptable architectures C.3 Special-Purpose and Application-Based Systems C.3.e Reconfigurable hardware C.5 Computer System Implementation C.5.5 Servers Cloud computing Computer architecture Cryptography Field programmable gate arrays Hardware Servers
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Hyperscale datacenter providers have struggled to balance the growing need for specialized hardware (efficiency) with the economic benefits of homogeneity (manageability). In this paper we propose a new cloud architecture that uses reconfigurable logic to accelerate both network plane functions and applications.This Configurable Cloud architecture places a layer of reconfigurable logic (FPGAs) between the network switches and the servers, enabling network flows to be programmably transformed at line rate, enabling acceleration of local applications running on the server, and enabling the FPGAs to communicate directly, at datacenter scale, to harvest remote FPGAs unused by their local servers.We deployed this design over a production server bed, and show how it can be used for both service acceleration and network. This architecture is much more scalable than prior work which used secondary rack-scale networks for inter-FPGA communication. By coupling to the network plane, direct FPGA-to-FPGA messages can be achieved at comparable latency to previous work, without the secondary network. Additionally, the scale of direct inter-FPGA messaging is much larger. The average round-trip latencies observed in our measurements among 24, 1000, and 250,000 machines are under 3, 9, and 20 microseconds, respectively. The Configurable Cloud architecture has been deployed at hyperscale in Microsoft's production datacenters worldwide.
ISSN:	0272-1732 1937-4143
DOI:	10.1109/MM.2017.265085811