acm-header
Sign In

Communications of the ACM

ACM TechNews

Google's Big-Data Tool, Mesa, Holds Petabytes of Data Across Multiple Servers


View as: Print Mobile App Share:
distributed data storage, illustration

Credit: iStockPhoto.com

Google says its big-data architecture, Mesa, can store petabytes of data, update millions of rows of data per second, and field trillions of queries daily across multiple servers, enabling continuous operation of the data warehouse even if a data center fails. "Mesa ingests data generated by upstream services, aggregates and persists the data internally, and serves the data via user queries," note Google researchers. They say Mesa was originally constructed to house and analyze critical measurement data for Google's Internet advertising business, but the technology could be applicable to other, similar data warehouse tasks.

Mesa is dependent on other Google technologies, such as the Colossus distributed file system, the BigTable distributed data storage system, and the MapReduce data analysis framework. Google engineers implemented Paxos, a distributed synchronization protocol, to help address query consistency issues. Mesa also can operate on generic servers, making costly specialized hardware unnecessary and enabling Mesa to be run as a cloud service with the advantage of scalability.

Consultant Curt Monash suggests Mesa may face limited future commercial prospects, and he recommends enterprises seek commercial or open source options to maintain data warehouse consistency across data centers before adopting Google's offerings. He says the majority of new data stores under development boast some form of multi-version currency control.

From IDG News Service
View Full Article

 

Abstracts Copyright © 2014 Information Inc., Bethesda, Maryland, USA


 

No entries found

Sign In for Full Access
» Forgot Password? » Create an ACM Web Account