|
|
||
|---|---|---|
| results | ||
| .gitignore | ||
| cl.hpp | ||
| cuda-stream.cu | ||
| LICENSE | ||
| Makefile | ||
| ocl-stream-kernels.cl | ||
| ocl-stream.cpp | ||
| README.md | ||
GPU-STREAM
Measure memory transfer rates to/from global device memory on GPUs. This benchmark is similar in spirit, and based on, the STREAM benchmark [1] for CPUs.
Unlike other GPU memory bandwidth benchmarks this does not include the PCIe transfer time.
Usage
Build the OpenCL and CUDA binaries with make
Run the OpenCL version with ./gpu-stream-ocl and the CUDA version with ./gpu-stream-cuda
Results
Sample results can be found in the results subdirectory. If you would like to submit updated results, please submit a Pull Request.
[1]: McCalpin, John D., 1995: "Memory Bandwidth and Machine Balance in Current High Performance Computers", IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, December 1995.