The Research archive provides access to all Research articles published in past issues of Communications of the ACM.
As GPUs have become mainstream parallel processing engines, many applications targeting GPUs now have data locality more amenable to traditional caching. The architecture described in "Learning Your Limits" has a number of…
This paper studies the effect of accelerating highly parallel workloads with significant locality on a massively multithreaded GPU.