Utilizing the Parallel Processing Power of GPUs in Database and Information Retrieval

A collection of related papers and resources
(collected by Pedram Ghodsnia)


Videos
Links
Papers





Videos

nvidia super computer GPU #1nvidia super computer GPU #1 NVIDIA Tesla C1060 Computing ProcessorNVIDIA Tesla C1060 Computing Processor Introducing the World's Fastest 1U ServerIntroducing the World's Fastest 1U Server Tesla Personal SupercomputerTesla Personal Supercomputer Tesla SupercomputingTesla Supercomputing The history of CUDAThe history of CUDA Computing on the GPUComputing on the GPU Scalable Parallel Programming with CUDA on Manycore GPUsScalable Parallel Programming with CUDA on Manycore GPUs The Role of Accelerated Computing in the Multi-Core EraThe Role of Accelerated Computing in the Multi-Core Era Core i7 X58 SuperComputer demoCore i7 X58 SuperComputer demo The PeakStream Platform for Many-Core ComputingThe PeakStream Platform for Many-Core ComputingEmbed YouTube Video by VideoLightBox.com v1.8 Parallel Programming and Problem Solving with CUDAParallel Programming and Problem Solving with CUDAEmbed YouTube Video by VideoLightBox.com v1.8

Super Computing 2010 Analyst Crossfire - Supercomputing Wrap-up Show

Links

Intro to CUDA
NVIDIA Website
CUDA Zone
CUDA SDK and other Resources
CUDA Programming Guide (version 3.0)
CUDA Programming Guide (version 4.0)
CUDA Reference Manual (version 3.0)
CUDA Reference Manual (version 4.0)
CUDA Memory Optimization
Introduction to OpenCL
Tesla Personal Supercomputer
GeForce GTX 480
NVIDIA Parallel Nsight (GPU Computing into Visual Studio 2008)
Using CUDA in Data Mining, Analytics, and Databases
GPU Technology Conference 2009 - Recorded Sessions
Introduction to cuda
CUDA GPU Computing Forum
IAP09 CUDA @ MIT / 6.963
Sorting Networks Tutorial
Sort Benchmark Home Page (New)
Sort Benchmark Home Page (Old)
High performance GPU radix sorting in CUDA
Opening keynote with Jen-Hsun Huang, NVIDIA's CEO
GTC 2010 - Recorded Sessions
Parallel Nsight™ 1.5 User Guide
Syllabus for the CUDA Certification Exam
High performance GPU radix sorting in CUDA
GPU Cloud Computing - Amazon EC2
Tesla 20-Series Products
Dirac Testbed Reveals How Applications are Written
http://sigttou.com/cuda-dev-windows
Install Nvidia CUDA in Ubuntu 10.04
Asymmetric Distributed Shared Memory
Large-scale multi-dimensional document clustering on GPU clusters
Efficient Relational Database Management using Graphics Processors
Parallel Data Compression with BZIP2
Bandwidth Test (by Pedram)
Maximizing GPU efficiency by using strerams
Hello World using CUDA
TCC Driver and problem of detecting Tesla C2050
NVIDIA SMI Computing Modes
MapReduce bibTeX
CUDPP bibTeX
CUDA 4.0 QA
CUDA 4.0 key new features
Maximizing GPU Efficiency in Extreme Throughput Applications
Multi-GPU optimization
Modeling GPU-CPU Workloads and Systems
Compiling CUDA projects in Visal Studio 2008 - Common problems
Compiling CUDA projects in Visal Studio 2010- Common problems
Compiling CUDA projects in Visal Studio 2010
CUDA Compute Visual Profiler
NVIDIA CUDA SDK 4.0 RC Distribution Does Not Contain Cutil 64 & 32bit Lib Files
GTC On-Demand
NVIDIA GPUDirect™
CUDA 5
NVIDIA GeForce GTX 680
Nvidia 'Big Kepler' GK-110 Announced for SuperComputers
Inside Kepler
Fermi Whitepaper
NVIDIA @ SC12 - GPU Technology Theater - Inside the Kepler Architecture
Tesla Kepler Family Product Overview
NVIDIA Tesla K20 benchmark: facts, figures and some conclusions

Papers (Ordered by Year)

Title Authors Venue Year bib
Fast Computation of Database Operations using Graphics Processors Naga K. Govindaraju, Brandon Lloyd, Wei Wang, Ming C. Lin, Dinesh Manocha SIGMOD Conference 2004 bib
GPUTeraSort: high performance graphics co-processor sorting for large database management Naga K. Govindaraju, Jim Gray, Ritesh Kumar, Dinesh Manocha SIGMOD Conference 2006 bib
GPUQP: query co-processing using graphics processors Rui Fang, Bingsheng He, Mian Lu, Ke Yang, Naga K. Govindaraju, Qiong Luo, Pedro V. Sander SIGMOD Conference 2007 bib
Database Optimizations for Modern Hardware Cieslewicz, J., Ross, K.A.; Proceedings of the IEEE 2008 bib
High-Speed Private Information Retrieval Computation on GPU Carlos Aguilar Melchor, Benot Crespin, Philippe Gaborit, Vincent Jolivet, Pierre Rousseau SECURWARE 2008 bib
Mars: a MapReduce framework on graphics processors Bingsheng He, Wenbin Fang, Qiong Luo, Naga K. Govindaraju, Tuyong Wang PACT 2008 bib
Relational joins on graphics processors Bingsheng He, Ke Yang, Rui Fang, Mian Lu, Naga K. Govindaraju, Qiong Luo, Pedro V. Sander SIGMOD Conference 2008 bib
Using graphics processors for high-performance IR query processing Shuai Ding, Jinru He, Hao Yan, Torsten Suel WWW 2008 bib
GPU-Quicksort: A practical Quicksort algorithm for graphics processors Daniel Cederman, Philippas Tsigas ACM Journal of Experimental Algorithmics 2009 bib
Data Parallel Bin-Based Indexing for Answering Queries on Multi-core Architectures Luke J. Gosink, Kesheng Wu, E. Wes Bethel, John D. Owens, Kenneth I. Joy SSDBM 2009 bib
An Empirically Optimized Radix Sort for GPU Bonan Huang, Jinlan Gao, Xiaoming Li International Symposium on Parallel and Distributed Processing with Applications 2009 bib
Designing efficient sorting algorithms for manycore GPUs Nadathur Satish, Mark Harris, Michael Garland IPDPS 2009 bib
Sorting on architecturally diverse computer systems Roger D. Chamberlain, Narayan Ganesan SC-HPRCTA 2009 bib
Relational query coprocessing on graphics processors Bingsheng He, Mian Lu, Ke Yang, Rui Fang, Naga K. Govindaraju, Qiong Luo, Pedro V. Sander ACM Trans. Database Syst. 2009 bib
Fast Four-Way Parallel Radix Sorting on GPUs Linh K. Ha, Jens Kruger, Claudio T. Silva Comput. Graph. Forum 2009 bib
A Batched GPU Algorithm for Set Intersection Di Wu, Fan Zhang, Naiyong Ao, Fang Wang, Xiaoguang Liu, Gang Wang ISPAN 2009 bib
On efficient posting list intersection with multicore processors Shirish Tatikonda, Flavio Junqueira, Berkant Barla Cambazoglu, Vassilis Plachouras SIGIR 2009 bib
Revisiting sorting for GPGPU stream architectures Duane Merrill, Andrew S. Grimshaw PACT 2010 bib
Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort Nadathur Satish, Changkyu Kim, Jatin Chhugani, Anthony D. Nguyen, Victor W. Lee, Daehyun Kim, Pradeep Dubey SIGMOD Conference 2010 bib
FAST: fast architecture sensitive tree search on modern CPUs and GPUs Changkyu Kim, Jatin Chhugani, Nadathur Satish, Eric Sedlar, Anthony D. Nguyen, Tim Kaldewey, Victor W. Lee, Scott A. Brandt, Pradeep Dubey SIGMOD Conference 2010 bib
Exploring utilisation of GPU for database applications Slawomir Walkowiak, Konrad Wawruch, Marita Nowotka, Lukasz Ligowski, Witold Rudnicki Procedia CS 2010 bib
Accelerating SQL database operations on a GPU with CUDA Peter Bakkum, Kevin Skadron GPGPU 2010 bib
GPU support for batch oriented workloads Lauro B. Costa, Samer Al-Kiswany, Matei Ripeanu IPCCC 2009 bib
Ubiquitous Parallel Computing from Berkeley, Illinois, and Stanford Bryan C. Catanzaro, Armando Fox, Kurt Keutzer, David A. Patterson, Bor-Yiing Su, Marc Snir, Kunle Olukotun, Pat Hanrahan, Hassan Chafi IEEE Micro 2010 bib
CUDA Compression Final Report Leiwen Wu, Mark Storus and David Cross Technical report 2009 bib
MITHRA: Multiple data independent tasks ona heterogeneous resource architecture Reza Farivar, Abhishek Verma, Ellick Chan, Roy H. Campbell CLUSTER 2009 bib
DisMaRC: A Distributed Map Reduce framework on CUDA Alok Mooley, Karthik Murthy, Harshdeep Singh Technical report 2009 bib
A map reduce framework for programming graphics processors Bryan Catanzaro, Narayanan Sundaram, Kurt Keutzer In Workshop on Software Tools for MultiCore Systems 2008 bib
Multi-GPU MapReduce on GPU Clusters Jeffery Stuart, John D. Owens IPDPS 2011 2011
Mars: Accelerating MapReduce with Graphics Processors Wenbin Fang and Bingsheng He and Qiong Luo and Naga K. Govindaraju IEEE Transactions on Parallel and Distributed Systems 2011 bib
Database Compression on Graphics Processors Wenbin Fang, Bingsheng He, Qiong Luo PVLDB 2010 bib
LEEN: Locality/Fairness-Aware Key Partitioning for MapReduce in the Cloud Shadi Ibrahim, Hai Jin, Lu Lu, Song Wu, Bingsheng He, Li Qi CloudCom 2010 bib
High-Throughput Transaction Executions on Graphics Processors Bingsheng He, Jeffrey Xu Yu VLDB 2011






InfiniBand* Architecture
InfiniBand