22 lines
788 B
Markdown
22 lines
788 B
Markdown
GPU-STREAM
|
|
==========
|
|
|
|
Measure memory transfer rates to/from global device memory on GPUs.
|
|
This benchmark is similar in spirit, and based on, the STREAM benchmark [1] for CPUs.
|
|
|
|
Unlike other GPU memory bandwidth benchmarks this does *not* include the PCIe transfer time.
|
|
|
|
Usage
|
|
-----
|
|
|
|
Build the OpenCL and CUDA binaries with `make`
|
|
|
|
Run the OpenCL version with `./gpu-stream-ocl` and the CUDA version with `./gpu-stream-cuda`
|
|
|
|
Results
|
|
-------
|
|
|
|
Sample results can be found in the `results` subdirectory. If you would like to submit updated results, please submit a Pull Request.
|
|
|
|
[1]: McCalpin, John D., 1995: "Memory Bandwidth and Machine Balance in Current High Performance Computers", IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, December 1995.
|