GPU-STREAM ========== Measure memory transfer rates to/from global device memory on GPUs. This benchmark is similar in spirit, and based on, the STREAM benchmark [1] for CPUs. Unlike other GPU memory bandwidth benchmarks this does *not* include the PCIe transfer time. Usage ----- Build the OpenCL and CUDA binaries with `make` Run the OpenCL version with `./gpu-stream-ocl` and the CUDA version with `./gpu-stream-cuda` Results ------- Sample results can be found in the `results` subdirectory. If you would like to submit updated results, please submit a Pull Request. [1]: McCalpin, John D., 1995: "Memory Bandwidth and Machine Balance in Current High Performance Computers", IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, December 1995.