Privacy Integrated Queries (PINQ) is an extensible data analysis platform designed to provide unconditional privacy guarantees for the records of the underlying...Frank McSherry From Communications of the ACM | September 2010
Government agencies worldwide release statistical information about population, education, and health, crime...Johannes Gehrke From Communications of the ACM | September 2010
Customer preferences for products are drifting over time. Product perception and popularity are constantly changing as new selection emerges. Similarly, customer...Yehuda Koren From Communications of the ACM | April 2010
The past decade has seen an explosion of interest in machine learning and data mining, with significant advances in terms of...Padhraic Smyth, Charles Elkan From Communications of the ACM | April 2010
Information integration is a key challenge faced by all major organizations, business and governmental ones alike. Two research...Balder ten Cate, Phokion G. Kolaitis From Communications of the ACM | January 2010
When you search for products on Amazon.com, you are seeing results from thousands of vendor databases that were developed before Amazon existed. Did you ever wonder...Alon Halevy From Communications of the ACM | January 2010
ThinSight is a thin form-factor interactive surface technology based on optical sensors embedded inside a regular LCD. These augment the display with the ability...Shahram Izadi, Steve Hodges, Alex Butler, Darren West, Alban Rrustemi, Mike Molloy, William Buxton From Communications of the ACM | December 2009
One of the major innovations in computing was the invention of the graphical user interface at MIT, SRI, and Xerox PARC. The...Pat Hanrahan From Communications of the ACM | December 2009
Machine Learning today offers a broad repertoire of methods for classification and regression. But what if we need to predict complex objects like trees, orderings...Thorsten Joachims, Thomas Hofmann, Yisong Yue, Chun-Nam Yu From Communications of the ACM | November 2009
Interest in machine learning can be traced back to the early days of computer science. Alan Turing himself conjectured that...John Shawe-Taylor From Communications of the ACM | November 2009
Declarative Networking is a programming methodology that enables developers to concisely specify network protocols and services, which are directly compiled to...Boon Thau Loo, Tyson Condie, Minos Garofalakis, David E. Gay, Joseph M. Hellerstein, Petros Maniatis, Raghu Ramakrishnan, Timothy Roscoe, Ion Stoica From Communications of the ACM | November 2009
In science, significant advances are often made when researchers from different communities join forces.Peter Druschel From Communications of the ACM | November 2009
Many data generation processes can be modeled as data streams. While this data may be archived and indexed within a data warehouse, it is also important to process...Graham Cormode, Marios Hadjieleftheriou From Communications of the ACM | October 2009
The database and systems communities have made great progress in developing database systems that allow us to store and query huge amounts of data. Real-time analysis...Johannes Gehrke From Communications of the ACM | October 2009
The task of estimating the number of distinct values (DVs) in a large dataset arises in a wide variety of settings in computer science and elsewhere. We provide...Kevin Beyer, Rainer Gemulla, Peter J. Haas, Berthold Reinwald, Yannis Sismanis From Communications of the ACM | October 2009
Relational systems have made it possible to query large collections of data in a declarative style through languages such as SQL. There is a key component that...Surajit Chaudhuri From Communications of the ACM | October 2009
Existing literature on distributed development in software engineering and other fields discusses various challenges,...Christian Bird, Nachiappan Nagappan, Premkumar Devanbu, Harald Gall, Brendan Murphy From Communications of the ACM | August 2009
This article describes mechanisms for asynchronous collaboration in the context of information visualization, recasting visualizations as not just analytic tools...Jeffrey Heer, Fernanda B. Viégas, Martin Wattenberg From Communications of the ACM | January 2009
Visual analysis, a powerful method for finding and telling stories with data, is moving from research into widespread use.Jock D. Mackinlay From Communications of the ACM | January 2009
Beginning in the early to mid-1980s the relational model of data has dominated the DBMS landscape. Moreover, descendents of...Michael Stonebraker From Communications of the ACM | December 2008