Commit Graph

521 Commits

Author SHA1 Message Date
Tom Deakin
8a100f07b4 Add dot kernel to OpenMP 4.5 - tested with clang-ykt 2016-10-14 15:19:25 +01:00
Tom Deakin
abe423ac6b Implement dot kernel in OpenMP 3 2016-10-14 15:05:06 +01:00
Tom Deakin
08fe695d51 Fix typo in main file 2016-10-14 15:04:04 +01:00
Tom Deakin
275bfb2066 Check result of the final reduction 2016-10-14 14:45:28 +01:00
Tom Deakin
04ca357159 Call the Dot kernel and print out results 2016-10-14 14:40:28 +01:00
Tom Deakin
0ef9b6691b Implement the reduction in OpenACC 2016-10-14 14:40:08 +01:00
Tom Deakin
614613e7d4 Add the dot routine to the abstract class 2016-10-14 14:39:48 +01:00
Ruyman Reyes
23a43bfa6d Adding support for ComputeCpp CE
This patch updates the CMake building options to support
the ComputeCpp Community Edition 0.1, including the FindComputeCpp
CMake module provided with the ComputeCpp SDK.

In order to build with ComputeCpp, only the standard CMake flags from
the SDK are required:

cmake ../
  -DHAS_SYCL=ON \
  -DCOMPUTECPP_PACKAGE_ROOT_DIR=/path/to/computecpp/package \
  -DCMAKE_MODULE_PATH=GPU-STREAM/cmake/Modules/
2016-10-12 17:51:41 +01:00
pensun
300ccd01d9 move hip_runtime.h after copyright info 2016-10-12 10:41:50 -05:00
pensun
f9fea7f8b5 Add performance results for HIP version. 2016-09-06 11:19:33 -05:00
pensun
a1f9d9ece7 Add support of HIP version of GPU-STREAM.
This commit was tested with HIP developer preview branch.
2016-09-05 23:41:01 -05:00
Tom Deakin
21f0bcfde4 Add v2.0 results 2016-08-26 15:54:22 +01:00
Tom Deakin
2352b6cc42 Move v1 results into subdirectory 2016-08-26 15:53:21 +01:00
Tom Deakin
4e3ba0639e Merge branch 'master' of github.com:UoB-HPC/GPU-STREAM 2016-07-19 15:46:17 +01:00
Tom Deakin
39e51246b0 Update citation 2016-07-19 15:46:08 +01:00
James Price
74a4a3b0bd [SYCL] Set WGSIZE to more sensible value for AMD Fiji 2016-07-07 09:40:16 +01:00
James Price
f94e36f320 [CUDA] Fix device name output (OpenCL->CUDA) 2016-07-06 17:16:35 +01:00
James Price
a2778f523e [OCL] Print build log if kernel compilation fails 2016-07-06 14:11:29 +01:00
James Price
1ec98b4496 Use 'Release' if CMAKE_BUILD_TYPE not specified 2016-07-04 15:46:00 +01:00
James Price
525fb4d5e2 Fix out-of-tree builds 2016-06-30 17:04:23 +01:00
Tom Deakin
eda9d56ed9 Merge branch 'refactor' 2016-06-30 16:51:40 +01:00
Tom Deakin
04f321db85 Replace cudaDeviceSyncronise with Kokkos::fence() function 2016-06-30 16:32:52 +01:00
Tom Deakin
8be4b26bb1 Add restrict to extra pointers in OpenACC 2016-06-30 16:24:32 +01:00
Tom Deakin
2033f3f0c6 Break RAJA build but fix all the others.. 2016-05-12 16:37:06 +01:00
Tom Deakin
2001ab5fb1 Build against a RAJA installation in the CMake build system 2016-05-12 15:40:22 +01:00
Tom Deakin
942188d836 Add copyright header to source with it missing 2016-05-12 12:53:26 +01:00
Tom Deakin
d75084b753 Fix Kokkos CMake so it works.. 2016-05-12 12:35:47 +01:00
Tom Deakin
2381f059ed Set KOKKOS_PATH to build Kokkos version 2016-05-12 12:31:16 +01:00
Tom Deakin
88d194b75c Use a variable to get Kokkos Path 2016-05-12 12:30:35 +01:00
Tom Deakin
f6fca3ac06 Add Kokkos building to CMake config 2016-05-12 12:30:06 +01:00
James Price
3ebad06bd4 [SYCL] Fix detection of CL/sycl.hpp for C++14 versions 2016-05-11 22:22:20 +01:00
James Price
7cd14f480d [SYCL] Auto-detect presence of CL/sycl.hpp and ComputeCpp 2016-05-11 22:00:04 +01:00
Tom Deakin
d4e74a88e9 Add binary names to gitignore 2016-05-11 17:53:33 +01:00
Tom Deakin
5638cbb283 Check for OpenMP support and build OMP3 version 2016-05-11 17:49:48 +01:00
Tom Deakin
bf9c6fb6cd Add -fopenacc flag on linking with GCC compiler 2016-05-11 17:21:52 +01:00
Tom Deakin
d420032c66 Remove warning about iteration count when using floats as new data values work for 100 iterations 2016-05-11 17:15:43 +01:00
Tom Deakin
9449e08886 update readme 2016-05-11 16:23:14 +01:00
Tom Deakin
494e89d16b Add placeholder banners for CMake build systems to fix 2016-05-11 16:02:34 +01:00
Tom Deakin
9b2a586e08 Add rule to build OMP4.5 on Cray 2016-05-11 15:57:39 +01:00
Tom Deakin
eae8da57ac Delete commented out C++ flag for OpenACC as no longer needed 2016-05-11 15:57:20 +01:00
Tom Deakin
31cb567e21 Switch data from 1.0, 2.0 and 3.0 to 0.1, 0.2, and 0.3 resp.
Using integers for maths gets unstable past 38 interations even
in double precision. Using the original values/10 is safe up to
the default 100 iterations.
2016-05-11 15:51:19 +01:00
Tom Deakin
55a858e0c0 Use 2^25 as default size because 2^26 gives too many thread blocks for CUDA 2016-05-11 15:43:52 +01:00
Tom Deakin
4954ef7cf0 Add map clauses to OpenMP 4.5 kernels 2016-05-11 15:17:06 +01:00
Tom Deakin
eb10c716f2 First attempt at OpenMP 4.5 2016-05-11 15:08:08 +01:00
Tom Deakin
e095cb67f8 Remove ugly CMake endif text in parenthesis 2016-05-11 13:37:12 +01:00
Tom Deakin
bf29b02d35 Add banners in CMakeLists file so easy to spot build rules for versions 2016-05-11 13:35:24 +01:00
Tom Deakin
8a195b6416 Remove printout of compiler id in cmake 2016-05-11 13:35:12 +01:00
Tom Deakin
e0ca56bd67 Set the C++11 flag when using the Cray compiler 2016-05-11 13:33:01 +01:00
Tom Deakin
8d45e61f6c Check for OpenACC support by checking the various compiler flags 2016-05-11 13:20:15 +01:00
Tom Deakin
1a9225ca95 If building CUDA on Darwin with Xcode 7.3.1 skip becuase CUDA doesn't work this version 2016-05-11 12:54:12 +01:00