Mesa: geo-replicated, near real-time, scalable data warehousing

Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google's Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and queryabilit...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the VLDB Endowment 2014-08, Vol.7 (12), p.1259-1270
Hauptverfasser: Gupta, Ashish, Yang, Fan, Govig, Jason, Kirsch, Adam, Chan, Kelvin, Lai, Kevin, Wu, Shuo, Dhoot, Sandeep Govind, Kumar, Abhilash Rajesh, Agiwal, Ankur, Bhansali, Sanjay, Hong, Mingsheng, Cameron, Jamie, Siddiqi, Masood, Jones, David, Shute, Jeff, Gubarev, Andrey, Venkataraman, Shivakumar, Agrawal, Divyakant
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google's Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and queryability, as well as high availability, reliability, fault tolerance, and scalability for large data and query volumes. Specifically, Mesa handles petabytes of data, processes millions of row updates per second, and serves billions of queries that fetch trillions of rows per day. Mesa is geo-replicated across multiple datacenters and provides consistent and repeatable query answers at low latency, even when an entire datacenter fails. This paper presents the Mesa system and reports the performance and scale that it achieves.
ISSN:2150-8097
2150-8097
DOI:10.14778/2732977.2732999