Go to file
2015-07-20 17:09:26 +01:00
results Add NVIDIA GTX 980 Ti and 780 Ti results 2015-07-16 18:47:12 +01:00
.gitignore Add .gitignore file 2015-07-16 17:35:39 +01:00
cl.hpp Add cl.hpp from Khronos website 2015-07-15 23:20:19 +01:00
cuda-stream.cu CUDA fix geting device name with the device index 2015-07-20 17:09:26 +01:00
LICENSE Add 3-clause BSD license file 2015-07-15 23:36:08 +01:00
Makefile Update Makefile with all and cuda stuff 2015-07-16 18:16:27 +01:00
ocl-stream-kernels.cl Add STREAM OpenCL kernels 2015-07-15 23:16:23 +01:00
ocl-stream.cpp OCL: select number to times from CLI 2015-07-16 18:29:42 +01:00
README.md Update README 2015-07-16 18:23:28 +01:00

GPU-STREAM

Measure memory transfer rates to/from global device memory on GPUs. This benchmark is similar in spirit, and based on, the STREAM benchmark [1] for CPUs.

Unlike other GPU memory bandwidth benchmarks this does not include the PCIe transfer time.

Usage

Build the OpenCL and CUDA binaries with make

Run the OpenCL version with ./gpu-stream-ocl and the CUDA version with ./gpu-stream-cuda

Results

Sample results can be found in the results subdirectory. If you would like to submit updated results, please submit a Pull Request.

[1]: McCalpin, John D., 1995: "Memory Bandwidth and Machine Balance in Current High Performance Computers", IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, December 1995.