Tom Deakin
8ae8c70188
Merge pull request #81 from Kerilk/master
...
Ensure OpenCL destructors are called in the "correct" order.
2020-10-26 09:58:05 +00:00
Brice Videau
e92d034f64
Ensure OpenCL destructors are called in the correct order.
2020-10-16 18:05:23 -05:00
Tom Deakin
6f46267e6c
Add AOMP build options
2020-08-13 17:46:45 +01:00
Tom Deakin
66d915fa2e
[OpenMP] Fix ARMCLANG Makefile bug where it didn't set the flags
2020-08-12 15:39:13 +01:00
Tom Deakin
f31181dedb
Add -O3 flat to HIP.make to fix segmentation fault
2020-08-12 14:09:22 +01:00
Tom Deakin
da3946a7d5
Add missing O3 flag for OpenMP ARMCLANG
2020-08-07 17:09:46 +01:00
Tom Deakin
0ff841bbf5
Update CHANGELOG.md
2020-08-07 12:29:28 +01:00
Tom Deakin
17f057c38a
Merge pull request #79 from tom91136/master
...
Update build flags for SYCL, Kokkos, and OpenMP, tracking newest versions of each compiler
2020-08-07 12:27:43 +01:00
Tom Lin
cdaf6cb88e
Fixed a bug where ComputeCpp's flags is omitted
...
Renamed INTEL_GT -> INTEL_GPU
Only use NVCC with Kokkos if not using HIPCC
2020-08-07 11:00:56 +01:00
Tom Lin
59274d6a91
Add NVIDIA as target for dpcpp
2020-08-05 08:54:40 +01:00
Tom Lin
603dc7d136
Add HIP compilers for Kokkos
2020-08-05 08:49:18 +01:00
Tom Lin
98b0939669
Add Intel GT OMP offloading support for icpc
2020-08-04 23:52:08 +01:00
Tom Lin
09458ef866
Fixed hipSYCL flag propagation
2020-08-03 17:22:39 +01:00
Tom Lin
f0403d2b09
Add CXX support for hipSYCL, dpcpp, and ComputeCpp
2020-08-03 16:45:06 +01:00
Tom Deakin
2f9f533890
Merge pull request #76 from tom91136/master
...
Add PPC+GNU combination
2020-07-29 14:10:27 +01:00
Tom Lin
0cb6b3d421
Add PPC+GNU combination
2020-07-16 11:07:01 +01:00
Tom Deakin
8ece4079fd
Update CHANGELOG.md
2020-07-14 14:03:04 +01:00
Tom Deakin
d28df3b71e
Merge pull request #73 from tom91136/master
...
Added support for Cray, PGI, and Armclang for Kokkos and OpenMP
2020-07-14 14:02:18 +01:00
Tom Lin
a22bb92516
Add armclang-cpu flags for OpenMP
2020-07-14 13:57:40 +01:00
Tom Lin
1fcd062d6c
Add Cray, PGI, Armclang support for Kokkos/OpenMP
...
Fixed GCC's missing openmp flag in Kokkos
2020-07-14 13:56:55 +01:00
Tom Deakin
6c57b6305e
Update CHANGELOG.md
...
Summarise move of build system to Kokkos 3.
2020-07-13 09:35:55 +01:00
Tom Deakin
a8b85e71bd
Merge pull request #72 from tom91136/master
...
Update Kokkos to support version 3+
2020-07-13 09:34:46 +01:00
Tom Lin
1ffd069e80
Update Kokkos to support version 3+
2020-07-13 03:02:34 +01:00
Tom Deakin
64617c6dee
Update OpenMP Cray flags
...
Fixes #68
2020-07-10 13:28:23 +01:00
Tom Deakin
5d0ee99de6
Remove Cray flags for OpenACC following removal of support in latest compiler
2020-07-10 13:27:21 +01:00
Tom Deakin
d6520daf11
Update README with differentiation from STREAM
2020-06-02 15:41:00 +01:00
Tom Deakin
272c73a622
Merge pull request #66 from ams-cs/master
...
Add GNU OpenACC support for AMD GCN
2020-05-22 13:00:15 +01:00
Andrew Stubbs
09271eda17
Add GNU OpenACC support for AMD GCN
...
Autodetect the device type, rather than hard-code NVidia.
Add GNU command line options to the makefile, and adjust the "restrict"
extension usage. For now, we assume the toolchain is only configured for one
accelerator.
2020-05-21 20:54:04 +01:00
Tom Deakin
d410c65c97
[OpenMP] Change GNU -mcpu=native to -march=native as former is deprecated
2020-05-12 11:48:26 +01:00
Tom Deakin
b792c422f7
[OpenMP] Add build flags for OpenMP offload to AMD and NVIDIA with GCC 10.1
...
Closes #65
2020-05-12 11:24:29 +01:00
Tom Deakin
87b126f5ea
Merge branch 'local'
...
Conflicts:
SYCLStream.cpp
2020-05-11 17:20:01 +01:00
Tom Deakin
0919d95aa4
[SYCL] Use SYCL runtime device discovery
...
Fixes #63
2020-05-11 17:16:47 +01:00
Tom Deakin
1d6da069b3
[SYCL] Pass explicit async_handler to queue constructor
2020-05-11 17:13:36 +01:00
Tom Deakin
7f1637d679
[SYCL] Remove unused program variable
2020-05-11 17:10:48 +01:00
Tom Deakin
6db2c7a0ec
[SYCL] Remove unused program variable
2020-05-11 17:09:21 +01:00
Tom Deakin
1bc4395f48
Update local copy of OpenCL C++ header file.
...
This closes #62
2020-03-16 16:43:55 +00:00
Tom Deakin
8776901733
[SYCL] Use the cl::sycl::id parameter in the parallel_for kernels
...
The cl::sycl::item provides extra features for extracing global/local
ids which aren't required by the kernels.
This also means the kernels don't need to extract the id from the item.
2019-11-01 15:19:01 +00:00
Tom Deakin
4bcb777100
Add Zen target for OpenACC
2019-08-08 14:36:20 +00:00
Tom Deakin
63cc964847
Update CHANGELOG with updates from #58
2019-06-26 12:06:06 +01:00
Tom Deakin
022793bdd6
Merge pull request #58 from GeorgeWeb/sycl-compliant
...
Making BabelStream's SYCL code compliant
2019-06-26 12:03:47 +01:00
GeorgeWeb
e657bfa897
based on perf comparison, and discussions, the use pre-built kernels is unnecessary in this case
2019-06-20 14:24:46 +01:00
GeorgeWeb
54737d87cb
enclosing computecpp specific code in macros, rather than removing it
2019-06-20 10:13:39 +01:00
GeorgeWeb
a2e53d6728
remove use of pre-built kernel in parallel_for as is not conformant with the SYCL spec. (yet)
2019-06-18 17:31:40 +01:00
Georgi Mirazchiyski
60817e25a1
fix deprecated use of get_global() and get_local()
2019-06-18 17:22:49 +01:00
Tom Deakin
289a2c204c
Version bump
2019-04-10 14:12:00 +01:00
Tom Deakin
decc44f04a
Merge pull request #56 from UoB-HPC/base2
...
Add a --mibibytes flag to output bandwidth and array sizes in base 2
2019-04-10 14:09:24 +01:00
Tom Deakin
dd6f3af98b
Update changelog
2019-04-10 14:06:50 +01:00
Tom Deakin
08348d1f0f
Use ternary operator for simpler base 2 output checks
2019-04-10 14:06:05 +01:00
Patrick Atkinson
c50eba9caf
fix for mibibytes in printing
2019-04-10 11:04:29 +00:00
Tom Deakin
5a1396671e
Add a --mibibytes flag to output bandwidth and array sizes in base 2
...
This sets MiB = 2^20, GiB = 2^30 rather than the default of
MB = 10^6 and GB = 10^9.
2019-04-09 09:50:44 +01:00