James Price
|
c9b3d07b84
|
Fix OpenCL host code for dot kernel
Wrong number of blocks was being copied and summed, and the host sums
vector didn't have the correct size.
|
2016-10-24 12:49:58 +01:00 |
|
James Price
|
8a8f44b4ce
|
Fix CUDA host code for dot kernel
Wrong number of blocks was being copied and summed.
|
2016-10-24 12:47:25 +01:00 |
|
James Price
|
1e94870859
|
Fix verification of dot kernel
|
2016-10-24 12:47:01 +01:00 |
|
Tom Deakin
|
28c2660b52
|
Merge branch 'master' into kernel-dot
|
2016-10-24 12:21:16 +01:00 |
|
Tom Deakin
|
7408ab0366
|
Add RAJA dot kernel
|
2016-10-24 11:34:40 +01:00 |
|
James Price
|
fe41771bd4
|
Move HIP results into new directory structure
|
2016-10-21 12:57:31 +01:00 |
|
James Price
|
856a520687
|
Merge pull request #10 from sunway513/pull-request-HIP
Pull request hip
|
2016-10-21 12:53:50 +01:00 |
|
Tom Deakin
|
823e12708f
|
Add dot kernel to Kokkos
|
2016-10-21 10:58:26 +01:00 |
|
Tom Deakin
|
7203f96f6b
|
Restructure results directory
|
2016-10-20 17:35:22 +01:00 |
|
Tom Deakin
|
d3b497a9ca
|
Add a CUDA dot kernel
|
2016-10-14 17:51:40 +01:00 |
|
Tom Deakin
|
2085cacea0
|
Add an OpenCL dot kernel
We have to name the kernel stream_dot (for example) because the
"dot" kernel already exists.
|
2016-10-14 17:07:55 +01:00 |
|
Tom Deakin
|
8a100f07b4
|
Add dot kernel to OpenMP 4.5 - tested with clang-ykt
|
2016-10-14 15:19:25 +01:00 |
|
Tom Deakin
|
abe423ac6b
|
Implement dot kernel in OpenMP 3
|
2016-10-14 15:05:06 +01:00 |
|
Tom Deakin
|
08fe695d51
|
Fix typo in main file
|
2016-10-14 15:04:04 +01:00 |
|
Tom Deakin
|
275bfb2066
|
Check result of the final reduction
|
2016-10-14 14:45:28 +01:00 |
|
Tom Deakin
|
04ca357159
|
Call the Dot kernel and print out results
|
2016-10-14 14:40:28 +01:00 |
|
Tom Deakin
|
0ef9b6691b
|
Implement the reduction in OpenACC
|
2016-10-14 14:40:08 +01:00 |
|
Tom Deakin
|
614613e7d4
|
Add the dot routine to the abstract class
|
2016-10-14 14:39:48 +01:00 |
|
pensun
|
300ccd01d9
|
move hip_runtime.h after copyright info
|
2016-10-12 10:41:50 -05:00 |
|
pensun
|
f9fea7f8b5
|
Add performance results for HIP version.
|
2016-09-06 11:19:33 -05:00 |
|
pensun
|
a1f9d9ece7
|
Add support of HIP version of GPU-STREAM.
This commit was tested with HIP developer preview branch.
|
2016-09-05 23:41:01 -05:00 |
|
Tom Deakin
|
21f0bcfde4
|
Add v2.0 results
|
2016-08-26 15:54:22 +01:00 |
|
Tom Deakin
|
2352b6cc42
|
Move v1 results into subdirectory
|
2016-08-26 15:53:21 +01:00 |
|
Tom Deakin
|
4e3ba0639e
|
Merge branch 'master' of github.com:UoB-HPC/GPU-STREAM
|
2016-07-19 15:46:17 +01:00 |
|
Tom Deakin
|
39e51246b0
|
Update citation
|
2016-07-19 15:46:08 +01:00 |
|
James Price
|
74a4a3b0bd
|
[SYCL] Set WGSIZE to more sensible value for AMD Fiji
|
2016-07-07 09:40:16 +01:00 |
|
James Price
|
f94e36f320
|
[CUDA] Fix device name output (OpenCL->CUDA)
|
2016-07-06 17:16:35 +01:00 |
|
James Price
|
a2778f523e
|
[OCL] Print build log if kernel compilation fails
|
2016-07-06 14:11:29 +01:00 |
|
James Price
|
1ec98b4496
|
Use 'Release' if CMAKE_BUILD_TYPE not specified
|
2016-07-04 15:46:00 +01:00 |
|
James Price
|
525fb4d5e2
|
Fix out-of-tree builds
|
2016-06-30 17:04:23 +01:00 |
|
Tom Deakin
|
eda9d56ed9
|
Merge branch 'refactor'
|
2016-06-30 16:51:40 +01:00 |
|
Tom Deakin
|
04f321db85
|
Replace cudaDeviceSyncronise with Kokkos::fence() function
|
2016-06-30 16:32:52 +01:00 |
|
Tom Deakin
|
8be4b26bb1
|
Add restrict to extra pointers in OpenACC
|
2016-06-30 16:24:32 +01:00 |
|
Tom Deakin
|
2033f3f0c6
|
Break RAJA build but fix all the others..
|
2016-05-12 16:37:06 +01:00 |
|
Tom Deakin
|
2001ab5fb1
|
Build against a RAJA installation in the CMake build system
|
2016-05-12 15:40:22 +01:00 |
|
Tom Deakin
|
942188d836
|
Add copyright header to source with it missing
|
2016-05-12 12:53:26 +01:00 |
|
Tom Deakin
|
d75084b753
|
Fix Kokkos CMake so it works..
|
2016-05-12 12:35:47 +01:00 |
|
Tom Deakin
|
2381f059ed
|
Set KOKKOS_PATH to build Kokkos version
|
2016-05-12 12:31:16 +01:00 |
|
Tom Deakin
|
88d194b75c
|
Use a variable to get Kokkos Path
|
2016-05-12 12:30:35 +01:00 |
|
Tom Deakin
|
f6fca3ac06
|
Add Kokkos building to CMake config
|
2016-05-12 12:30:06 +01:00 |
|
James Price
|
3ebad06bd4
|
[SYCL] Fix detection of CL/sycl.hpp for C++14 versions
|
2016-05-11 22:22:20 +01:00 |
|
James Price
|
7cd14f480d
|
[SYCL] Auto-detect presence of CL/sycl.hpp and ComputeCpp
|
2016-05-11 22:00:04 +01:00 |
|
Tom Deakin
|
d4e74a88e9
|
Add binary names to gitignore
|
2016-05-11 17:53:33 +01:00 |
|
Tom Deakin
|
5638cbb283
|
Check for OpenMP support and build OMP3 version
|
2016-05-11 17:49:48 +01:00 |
|
Tom Deakin
|
bf9c6fb6cd
|
Add -fopenacc flag on linking with GCC compiler
|
2016-05-11 17:21:52 +01:00 |
|
Tom Deakin
|
d420032c66
|
Remove warning about iteration count when using floats as new data values work for 100 iterations
|
2016-05-11 17:15:43 +01:00 |
|
Tom Deakin
|
9449e08886
|
update readme
|
2016-05-11 16:23:14 +01:00 |
|
Tom Deakin
|
494e89d16b
|
Add placeholder banners for CMake build systems to fix
|
2016-05-11 16:02:34 +01:00 |
|
Tom Deakin
|
9b2a586e08
|
Add rule to build OMP4.5 on Cray
|
2016-05-11 15:57:39 +01:00 |
|
Tom Deakin
|
eae8da57ac
|
Delete commented out C++ flag for OpenACC as no longer needed
|
2016-05-11 15:57:20 +01:00 |
|