Commit Graph

282 Commits

Author SHA1 Message Date
James Price
cfc1aba2c0 Use WGSIZE=256 for dot for compatability with AMD 2016-10-24 12:51:01 +01:00
James Price
c9b3d07b84 Fix OpenCL host code for dot kernel
Wrong number of blocks was being copied and summed, and the host sums
vector didn't have the correct size.
2016-10-24 12:49:58 +01:00
James Price
8a8f44b4ce Fix CUDA host code for dot kernel
Wrong number of blocks was being copied and summed.
2016-10-24 12:47:25 +01:00
James Price
1e94870859 Fix verification of dot kernel 2016-10-24 12:47:01 +01:00
Tom Deakin
28c2660b52 Merge branch 'master' into kernel-dot 2016-10-24 12:21:16 +01:00
Tom Deakin
7408ab0366 Add RAJA dot kernel 2016-10-24 11:34:40 +01:00
James Price
fe41771bd4 Move HIP results into new directory structure 2016-10-21 12:57:31 +01:00
James Price
856a520687 Merge pull request #10 from sunway513/pull-request-HIP
Pull request hip
2016-10-21 12:53:50 +01:00
Tom Deakin
823e12708f Add dot kernel to Kokkos 2016-10-21 10:58:26 +01:00
Tom Deakin
7203f96f6b Restructure results directory 2016-10-20 17:35:22 +01:00
Tom Deakin
d3b497a9ca Add a CUDA dot kernel 2016-10-14 17:51:40 +01:00
Tom Deakin
2085cacea0 Add an OpenCL dot kernel
We have to name the kernel stream_dot (for example) because the
"dot" kernel already exists.
2016-10-14 17:07:55 +01:00
Tom Deakin
8a100f07b4 Add dot kernel to OpenMP 4.5 - tested with clang-ykt 2016-10-14 15:19:25 +01:00
Tom Deakin
abe423ac6b Implement dot kernel in OpenMP 3 2016-10-14 15:05:06 +01:00
Tom Deakin
08fe695d51 Fix typo in main file 2016-10-14 15:04:04 +01:00
Tom Deakin
275bfb2066 Check result of the final reduction 2016-10-14 14:45:28 +01:00
Tom Deakin
04ca357159 Call the Dot kernel and print out results 2016-10-14 14:40:28 +01:00
Tom Deakin
0ef9b6691b Implement the reduction in OpenACC 2016-10-14 14:40:08 +01:00
Tom Deakin
614613e7d4 Add the dot routine to the abstract class 2016-10-14 14:39:48 +01:00
pensun
300ccd01d9 move hip_runtime.h after copyright info 2016-10-12 10:41:50 -05:00
pensun
f9fea7f8b5 Add performance results for HIP version. 2016-09-06 11:19:33 -05:00
pensun
a1f9d9ece7 Add support of HIP version of GPU-STREAM.
This commit was tested with HIP developer preview branch.
2016-09-05 23:41:01 -05:00
Tom Deakin
21f0bcfde4 Add v2.0 results 2016-08-26 15:54:22 +01:00
Tom Deakin
2352b6cc42 Move v1 results into subdirectory 2016-08-26 15:53:21 +01:00
Tom Deakin
4e3ba0639e Merge branch 'master' of github.com:UoB-HPC/GPU-STREAM 2016-07-19 15:46:17 +01:00
Tom Deakin
39e51246b0 Update citation 2016-07-19 15:46:08 +01:00
James Price
74a4a3b0bd [SYCL] Set WGSIZE to more sensible value for AMD Fiji 2016-07-07 09:40:16 +01:00
James Price
f94e36f320 [CUDA] Fix device name output (OpenCL->CUDA) 2016-07-06 17:16:35 +01:00
James Price
a2778f523e [OCL] Print build log if kernel compilation fails 2016-07-06 14:11:29 +01:00
James Price
1ec98b4496 Use 'Release' if CMAKE_BUILD_TYPE not specified 2016-07-04 15:46:00 +01:00
James Price
525fb4d5e2 Fix out-of-tree builds 2016-06-30 17:04:23 +01:00
Tom Deakin
eda9d56ed9 Merge branch 'refactor' 2016-06-30 16:51:40 +01:00
Tom Deakin
04f321db85 Replace cudaDeviceSyncronise with Kokkos::fence() function 2016-06-30 16:32:52 +01:00
Tom Deakin
8be4b26bb1 Add restrict to extra pointers in OpenACC 2016-06-30 16:24:32 +01:00
Tom Deakin
2033f3f0c6 Break RAJA build but fix all the others.. 2016-05-12 16:37:06 +01:00
Tom Deakin
2001ab5fb1 Build against a RAJA installation in the CMake build system 2016-05-12 15:40:22 +01:00
Tom Deakin
942188d836 Add copyright header to source with it missing 2016-05-12 12:53:26 +01:00
Tom Deakin
d75084b753 Fix Kokkos CMake so it works.. 2016-05-12 12:35:47 +01:00
Tom Deakin
2381f059ed Set KOKKOS_PATH to build Kokkos version 2016-05-12 12:31:16 +01:00
Tom Deakin
88d194b75c Use a variable to get Kokkos Path 2016-05-12 12:30:35 +01:00
Tom Deakin
f6fca3ac06 Add Kokkos building to CMake config 2016-05-12 12:30:06 +01:00
James Price
3ebad06bd4 [SYCL] Fix detection of CL/sycl.hpp for C++14 versions 2016-05-11 22:22:20 +01:00
James Price
7cd14f480d [SYCL] Auto-detect presence of CL/sycl.hpp and ComputeCpp 2016-05-11 22:00:04 +01:00
Tom Deakin
d4e74a88e9 Add binary names to gitignore 2016-05-11 17:53:33 +01:00
Tom Deakin
5638cbb283 Check for OpenMP support and build OMP3 version 2016-05-11 17:49:48 +01:00
Tom Deakin
bf9c6fb6cd Add -fopenacc flag on linking with GCC compiler 2016-05-11 17:21:52 +01:00
Tom Deakin
d420032c66 Remove warning about iteration count when using floats as new data values work for 100 iterations 2016-05-11 17:15:43 +01:00
Tom Deakin
9449e08886 update readme 2016-05-11 16:23:14 +01:00
Tom Deakin
494e89d16b Add placeholder banners for CMake build systems to fix 2016-05-11 16:02:34 +01:00
Tom Deakin
9b2a586e08 Add rule to build OMP4.5 on Cray 2016-05-11 15:57:39 +01:00