James Price
856a520687
Merge pull request #10 from sunway513/pull-request-HIP
...
Pull request hip
2016-10-21 12:53:50 +01:00
Tom Deakin
823e12708f
Add dot kernel to Kokkos
2016-10-21 10:58:26 +01:00
Tom Deakin
7203f96f6b
Restructure results directory
2016-10-20 17:35:22 +01:00
Ruyman Reyes
d562283cde
Minor performance tuning for SYCL benchmark
...
* Pre-compiling kernel binaries when setting up the benchmark,
like OpenCL equivalent
* Using the linear access syntax for buffers
2016-10-18 13:09:19 +01:00
Tom Deakin
d3b497a9ca
Add a CUDA dot kernel
2016-10-14 17:51:40 +01:00
Tom Deakin
2085cacea0
Add an OpenCL dot kernel
...
We have to name the kernel stream_dot (for example) because the
"dot" kernel already exists.
2016-10-14 17:07:55 +01:00
Tom Deakin
8a100f07b4
Add dot kernel to OpenMP 4.5 - tested with clang-ykt
2016-10-14 15:19:25 +01:00
Tom Deakin
abe423ac6b
Implement dot kernel in OpenMP 3
2016-10-14 15:05:06 +01:00
Tom Deakin
08fe695d51
Fix typo in main file
2016-10-14 15:04:04 +01:00
Tom Deakin
275bfb2066
Check result of the final reduction
2016-10-14 14:45:28 +01:00
Tom Deakin
04ca357159
Call the Dot kernel and print out results
2016-10-14 14:40:28 +01:00
Tom Deakin
0ef9b6691b
Implement the reduction in OpenACC
2016-10-14 14:40:08 +01:00
Tom Deakin
614613e7d4
Add the dot routine to the abstract class
2016-10-14 14:39:48 +01:00
Ruyman Reyes
23a43bfa6d
Adding support for ComputeCpp CE
...
This patch updates the CMake building options to support
the ComputeCpp Community Edition 0.1, including the FindComputeCpp
CMake module provided with the ComputeCpp SDK.
In order to build with ComputeCpp, only the standard CMake flags from
the SDK are required:
cmake ../
-DHAS_SYCL=ON \
-DCOMPUTECPP_PACKAGE_ROOT_DIR=/path/to/computecpp/package \
-DCMAKE_MODULE_PATH=GPU-STREAM/cmake/Modules/
2016-10-12 17:51:41 +01:00
pensun
300ccd01d9
move hip_runtime.h after copyright info
2016-10-12 10:41:50 -05:00
pensun
f9fea7f8b5
Add performance results for HIP version.
2016-09-06 11:19:33 -05:00
pensun
a1f9d9ece7
Add support of HIP version of GPU-STREAM.
...
This commit was tested with HIP developer preview branch.
2016-09-05 23:41:01 -05:00
Tom Deakin
21f0bcfde4
Add v2.0 results
2016-08-26 15:54:22 +01:00
Tom Deakin
2352b6cc42
Move v1 results into subdirectory
2016-08-26 15:53:21 +01:00
Tom Deakin
4e3ba0639e
Merge branch 'master' of github.com:UoB-HPC/GPU-STREAM
2016-07-19 15:46:17 +01:00
Tom Deakin
39e51246b0
Update citation
2016-07-19 15:46:08 +01:00
James Price
74a4a3b0bd
[SYCL] Set WGSIZE to more sensible value for AMD Fiji
2016-07-07 09:40:16 +01:00
James Price
f94e36f320
[CUDA] Fix device name output (OpenCL->CUDA)
2016-07-06 17:16:35 +01:00
James Price
a2778f523e
[OCL] Print build log if kernel compilation fails
2016-07-06 14:11:29 +01:00
James Price
1ec98b4496
Use 'Release' if CMAKE_BUILD_TYPE not specified
2016-07-04 15:46:00 +01:00
James Price
525fb4d5e2
Fix out-of-tree builds
2016-06-30 17:04:23 +01:00
Tom Deakin
eda9d56ed9
Merge branch 'refactor'
2016-06-30 16:51:40 +01:00
Tom Deakin
04f321db85
Replace cudaDeviceSyncronise with Kokkos::fence() function
2016-06-30 16:32:52 +01:00
Tom Deakin
8be4b26bb1
Add restrict to extra pointers in OpenACC
2016-06-30 16:24:32 +01:00
Tom Deakin
2033f3f0c6
Break RAJA build but fix all the others..
2016-05-12 16:37:06 +01:00
Tom Deakin
2001ab5fb1
Build against a RAJA installation in the CMake build system
2016-05-12 15:40:22 +01:00
Tom Deakin
942188d836
Add copyright header to source with it missing
2016-05-12 12:53:26 +01:00
Tom Deakin
d75084b753
Fix Kokkos CMake so it works..
2016-05-12 12:35:47 +01:00
Tom Deakin
2381f059ed
Set KOKKOS_PATH to build Kokkos version
2016-05-12 12:31:16 +01:00
Tom Deakin
88d194b75c
Use a variable to get Kokkos Path
2016-05-12 12:30:35 +01:00
Tom Deakin
f6fca3ac06
Add Kokkos building to CMake config
2016-05-12 12:30:06 +01:00
James Price
3ebad06bd4
[SYCL] Fix detection of CL/sycl.hpp for C++14 versions
2016-05-11 22:22:20 +01:00
James Price
7cd14f480d
[SYCL] Auto-detect presence of CL/sycl.hpp and ComputeCpp
2016-05-11 22:00:04 +01:00
Tom Deakin
d4e74a88e9
Add binary names to gitignore
2016-05-11 17:53:33 +01:00
Tom Deakin
5638cbb283
Check for OpenMP support and build OMP3 version
2016-05-11 17:49:48 +01:00
Tom Deakin
bf9c6fb6cd
Add -fopenacc flag on linking with GCC compiler
2016-05-11 17:21:52 +01:00
Tom Deakin
d420032c66
Remove warning about iteration count when using floats as new data values work for 100 iterations
2016-05-11 17:15:43 +01:00
Tom Deakin
9449e08886
update readme
2016-05-11 16:23:14 +01:00
Tom Deakin
494e89d16b
Add placeholder banners for CMake build systems to fix
2016-05-11 16:02:34 +01:00
Tom Deakin
9b2a586e08
Add rule to build OMP4.5 on Cray
2016-05-11 15:57:39 +01:00
Tom Deakin
eae8da57ac
Delete commented out C++ flag for OpenACC as no longer needed
2016-05-11 15:57:20 +01:00
Tom Deakin
31cb567e21
Switch data from 1.0, 2.0 and 3.0 to 0.1, 0.2, and 0.3 resp.
...
Using integers for maths gets unstable past 38 interations even
in double precision. Using the original values/10 is safe up to
the default 100 iterations.
2016-05-11 15:51:19 +01:00
Tom Deakin
55a858e0c0
Use 2^25 as default size because 2^26 gives too many thread blocks for CUDA
2016-05-11 15:43:52 +01:00
Tom Deakin
4954ef7cf0
Add map clauses to OpenMP 4.5 kernels
2016-05-11 15:17:06 +01:00
Tom Deakin
eb10c716f2
First attempt at OpenMP 4.5
2016-05-11 15:08:08 +01:00