Tom Deakin
|
1bd4adfe7b
|
[RAJA] Align the memory to 2MB pages
|
2017-04-06 10:14:51 +01:00 |
|
Tom Deakin
|
1eb75f034a
|
[RAJA] Use xHost and streaming stores with the Intel compiler
|
2017-04-06 10:02:25 +01:00 |
|
Tom Deakin
|
d7a93be739
|
[Kokkos] Add a COMPILER option to Makefile, which turns on streaming stores for Intel
|
2017-04-05 22:23:27 +01:00 |
|
Tom Deakin
|
d7e38c1ca9
|
Add Kokkos build instructions to README
|
2017-04-05 22:09:58 +01:00 |
|
Tom Deakin
|
d9dfc3f552
|
[Kokkos] Use long for iterator variable
|
2017-04-05 21:57:55 +01:00 |
|
Tom Deakin
|
bf57cf578d
|
[CUDA] Merge pull request #28 from psteinb/extra_cuda_make_variable
Allow specifying compiler and flags for build
|
2017-03-17 14:22:37 +00:00 |
|
Peter Steinbach
|
d8cb7494e0
|
pulled -O3 out into CXXFLAGS, refactored CUDA compiler into CUDA_CXX
make variable to cope with clang as CUDA compiler as well
|
2017-03-17 15:18:13 +01:00 |
|
James Price
|
703eb945a2
|
[OpenMP] Align memory (2MB by default)
|
2017-03-13 17:17:20 +00:00 |
|
James Price
|
4f288ddc3d
|
[OpenMP] Add -qopt-streaming-stores for Intel
|
2017-03-13 17:15:10 +00:00 |
|
Peter Steinbach
|
8c7a801a84
|
put -O3 into CXXFLAGS to comply with OpenMP.make
|
2017-03-13 15:22:26 +01:00 |
|
Peter Steinbach
|
ea12f2a9a1
|
added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to specify debug flags or gencode flags
|
2017-03-13 14:41:16 +01:00 |
|
James Price
|
94e0900377
|
Use static shared memory in dot for CUDA and HIP
|
2017-02-28 13:24:45 +00:00 |
|
Tom Deakin
|
e7a619c63c
|
Merge pull request #27 from psteinb/fix_minus_for_an_equal
replaced - for = so that assignment takes place
|
2017-02-28 12:46:43 +00:00 |
|
Peter Steinbach
|
e570b458a6
|
replaced - for = so that assignment takes place
|
2017-02-28 13:43:57 +01:00 |
|
James Price
|
8a47b72764
|
Merge pull request #26 from psteinb/fix_sharedmem_hip
Fix sharedmem hip
|
2017-02-28 12:36:37 +00:00 |
|
Peter Steinbach
|
58773a79b7
|
removed extra lines introduced by hipify, removed obsolete commented code
|
2017-02-28 13:33:21 +01:00 |
|
Peter Steinbach
|
ceada6922f
|
proper declaration of tb_sum with HIP_DYNAMIC_SHARED macro
|
2017-02-28 10:07:48 +01:00 |
|
Peter Steinbach
|
350a151c3b
|
removed CUDA_PATH sentinel from HIP.make
|
2017-02-28 10:04:36 +01:00 |
|
Peter Steinbach
|
ee7cd066ac
|
renamed HIPStream implementation
|
2017-02-28 10:03:23 +01:00 |
|
Tom Deakin
|
cc90cefeeb
|
Minor version bump to signal build system update
|
2017-02-25 14:14:59 +00:00 |
|
Tom Deakin
|
4d24e2341f
|
Merge pull request #24 from UoB-HPC/bugfix/build
Simplify build system
|
2017-02-25 14:14:29 +00:00 |
|
James Price
|
2416727239
|
Refactor compiler flag handling in RAJA Makefile
|
2017-02-24 22:28:16 +00:00 |
|
Tom Deakin
|
050a27ca83
|
Add XL compiler support to OpenMP and RAJA makefiles
|
2017-02-24 17:37:30 +00:00 |
|
James Price
|
dfe5503cba
|
Allow user to override CXX in OpenCL.make
|
2017-02-24 09:33:59 -06:00 |
|
James Price
|
569cfa1d31
|
Make Cray OpenMP flag non-empty to fix error
|
2017-02-24 09:02:51 -06:00 |
|
James Price
|
a7d7998326
|
Use -framework OpenCL on Darwin
|
2017-02-24 13:40:54 +00:00 |
|
James Price
|
1aec057e48
|
Add help messages to OpenMP.make and refactor
|
2017-02-24 13:32:59 +00:00 |
|
James Price
|
8fee86a232
|
Add compiler help to OpenCL.make
|
2017-02-24 13:17:12 +00:00 |
|
James Price
|
82de818855
|
Add support for Intel as host compiler for OpenCL
|
2017-02-24 13:14:13 +00:00 |
|
James Price
|
6008f8c536
|
Add intermediate objects to OpenACC clean rule
PGI creates these, even though we don't ask for them.
|
2017-02-24 13:14:13 +00:00 |
|
Tom Deakin
|
c470b88dee
|
Add compiler help text to OpenACC
|
2017-02-24 13:13:08 +00:00 |
|
Tom Deakin
|
3be4ebc1a2
|
Add help messages to RAJA Makefile
|
2017-02-24 13:11:07 +00:00 |
|
Tom Deakin
|
bbdd5b9fcb
|
Add help message to Kokkos TARGET variable
|
2017-02-24 13:07:35 +00:00 |
|
Tom Deakin
|
91131baf3d
|
Remove BDW and KNL targets from OpenACC as not yet supported by PGI
|
2017-02-24 12:52:13 +00:00 |
|
Tom Deakin
|
452804f1e0
|
Merge branch 'bugfix/build' of github.com:uob-hpc/gpu-stream into bugfix/build
|
2017-02-24 12:48:25 +00:00 |
|
James Price
|
282251f026
|
Add options for specific targets in OpenACC.make
|
2017-02-24 12:48:00 +00:00 |
|
Tom Deakin
|
10b2376634
|
Add TARGET option to OpenMP Makefile
|
2017-02-24 12:24:35 +00:00 |
|
Tom Deakin
|
761ba2dce0
|
Add a clean option to all Makefiles
|
2017-02-24 11:57:34 +00:00 |
|
Tom Deakin
|
cbf15e8954
|
Have -std=c++11 as default flags when COMPILER is not set for ACC, CL and MP
|
2017-02-24 11:50:25 +00:00 |
|
James Price
|
e1ea01f0c2
|
Add -xHOST to Intel for OpenMP
|
2017-02-24 11:02:50 +00:00 |
|
James Price
|
b485e77ef3
|
Add SYCL intermediate outputs to .gitignore
|
2017-02-23 19:38:14 +00:00 |
|
James Price
|
b62c93527d
|
Use computecpp_info for SYCL device compiler flags
|
2017-02-23 19:26:55 +00:00 |
|
James Price
|
13f5c837f6
|
Add HIP.make with CUDA support
Still needs additional work for HIP via HCC.
|
2017-02-23 19:19:47 +00:00 |
|
James Price
|
f90afb1940
|
Merge branch 'master' into bugfix/build
|
2017-02-23 19:08:52 +00:00 |
|
James Price
|
ce4f49e08b
|
Add dot kernel to HIP implementation
|
2017-02-23 19:08:25 +00:00 |
|
James Price
|
d78baa4c2b
|
Use CXXFLAGS everywhere for consistency
|
2017-02-23 18:39:40 +00:00 |
|
James Price
|
063446b6a7
|
Add EXTRA_FLAGS to SYCL.make
|
2017-02-23 18:15:27 +00:00 |
|
James Price
|
cdea91abc6
|
Add required ARCH flag for RAJA on GPUs
|
2017-02-23 17:49:46 +00:00 |
|
James Price
|
f3abd66578
|
Make Kokkos build without intermediate objects
|
2017-02-23 17:24:14 +00:00 |
|
Tom Deakin
|
54a2a39ccf
|
Merge branch 'master' into bugfix/build
|
2017-02-23 16:46:25 +00:00 |
|