Are data synopses — such as the hash-based sketches discussed by Li and König — still needed for querying massive...Peter J. Haas From Communications of the ACM | August 2011
Efficient (approximate) computation of set similarity in very large datasets is a common task with many applications inminwise hashing...Ping Li, Arnd Christian König From Communications of the ACM | August 2011
The emergence of wimpy processors and FLASH met a promising deployment scenario in the field of large-scale data centers. The energy efficiency potential of these...Luiz André Barroso From Communications of the ACM | July 2011
This paper presents a fast array of wimpy nodes — FAWN — an approach for achieving low-power data-intensive data-center computing.
David G. Andersen, Jason Franklin, Michael Kaminsky, Amar Phanishayee, Lawrence Tan, Vijay Vasudevan From Communications of the ACM | July 2011
Dremel is a scalable, interactive ad hoc query system for analysis of read-only nested data. By combining multilevel execution trees and columnar data layout, it...Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geoffrey Romer, Shiva Shivakumar, Matt Tolton, Theo Vassilakis From Communications of the ACM | June 2011
The importance of data analysis has never been clearer. Globe-spanning scientific collaborations are exploring...Michael J. Franklin From Communications of the ACM | June 2011
The interaction between computation and logic goes back to the beginnings of computer science with the development of computability theory...Phokion G. Kolaitis From Communications of the ACM | June 2011
We give a logical characterization of the polynomial-time properties of graphs with excluded minors.Martin Grohe From Communications of the ACM | June 2011
CDOs are examples of financial derivatives, with a value that depends on the underlying assets with which they are linked. These kinds of complex financial products...David C. Parkes From Communications of the ACM | May 2011
Securitization of cash flows using financial derivatives transformed the financial industry over the last three decades. Derivatives...Sanjeev Arora, Boaz Barak, Markus Brunnermeier, Rong Ge From Communications of the ACM | May 2011
A system for musical accompaniment is presented in which a computer-driven orchestra follows and learns from a soloist in a concerto-like setting. The system's...Christopher Raphael From Communications of the ACM | March 2011
In the opening of Sibelius' Violin Concerto, a soloist plays delicately. The orchestra responds in kind. As...Juan Bello, Yann LeCun, Robert Rowe From Communications of the ACM | March 2011
While a large body of work exists on DRAM in lab conditions, little has been reported on real DRAM failures in large production clusters. In this paper, we analyze...Bianca Schroeder, Eduardo Pinheiro, Wolf-Dietrich Weber From Communications of the ACM | February 2011
In order to advance the field, knowledge of the types of memory errors at the system level, their frequencies, and conditions that exacerbate or are unrelated to...Norman P. Jouppi From Communications of the ACM | February 2011
Privacy Integrated Queries (PINQ) is an extensible data analysis platform designed to provide unconditional privacy guarantees for the records of the underlying...Frank McSherry From Communications of the ACM | September 2010
Government agencies worldwide release statistical information about population, education, and health, crime...Johannes Gehrke From Communications of the ACM | September 2010
Recent challenges organized by DARPA have induced a significant advance in technology for autopilots for cars; similar to those already used in aircraft and marine...Sebastian Thrun From Communications of the ACM | April 2010
Sebastian Thrun gives us a glimpse into the design and implementation of two winning DARPA grand challenge entries...Leslie Pack Kaelbling From Communications of the ACM | April 2010
Customer preferences for products are drifting over time. Product perception and popularity are constantly changing as new selection emerges. Similarly, customer...Yehuda Koren From Communications of the ACM | April 2010
The past decade has seen an explosion of interest in machine learning and data mining, with significant advances in terms of...Padhraic Smyth, Charles Elkan From Communications of the ACM | April 2010