Tom Deakin
a8b85e71bd
Merge pull request #72 from tom91136/master
...
Update Kokkos to support version 3+
2020-07-13 09:34:46 +01:00
Tom Lin
1ffd069e80
Update Kokkos to support version 3+
2020-07-13 03:02:34 +01:00
Tom Deakin
64617c6dee
Update OpenMP Cray flags
...
Fixes #68
2020-07-10 13:28:23 +01:00
Tom Deakin
5d0ee99de6
Remove Cray flags for OpenACC following removal of support in latest compiler
2020-07-10 13:27:21 +01:00
Tom Deakin
d6520daf11
Update README with differentiation from STREAM
2020-06-02 15:41:00 +01:00
Tom Deakin
272c73a622
Merge pull request #66 from ams-cs/master
...
Add GNU OpenACC support for AMD GCN
2020-05-22 13:00:15 +01:00
Andrew Stubbs
09271eda17
Add GNU OpenACC support for AMD GCN
...
Autodetect the device type, rather than hard-code NVidia.
Add GNU command line options to the makefile, and adjust the "restrict"
extension usage. For now, we assume the toolchain is only configured for one
accelerator.
2020-05-21 20:54:04 +01:00
Tom Deakin
d410c65c97
[OpenMP] Change GNU -mcpu=native to -march=native as former is deprecated
2020-05-12 11:48:26 +01:00
Tom Deakin
b792c422f7
[OpenMP] Add build flags for OpenMP offload to AMD and NVIDIA with GCC 10.1
...
Closes #65
2020-05-12 11:24:29 +01:00
Tom Deakin
87b126f5ea
Merge branch 'local'
...
Conflicts:
SYCLStream.cpp
2020-05-11 17:20:01 +01:00
Tom Deakin
0919d95aa4
[SYCL] Use SYCL runtime device discovery
...
Fixes #63
2020-05-11 17:16:47 +01:00
Tom Deakin
1d6da069b3
[SYCL] Pass explicit async_handler to queue constructor
2020-05-11 17:13:36 +01:00
Tom Deakin
7f1637d679
[SYCL] Remove unused program variable
2020-05-11 17:10:48 +01:00
Tom Deakin
6db2c7a0ec
[SYCL] Remove unused program variable
2020-05-11 17:09:21 +01:00
Tom Deakin
1bc4395f48
Update local copy of OpenCL C++ header file.
...
This closes #62
2020-03-16 16:43:55 +00:00
Tom Deakin
8776901733
[SYCL] Use the cl::sycl::id parameter in the parallel_for kernels
...
The cl::sycl::item provides extra features for extracing global/local
ids which aren't required by the kernels.
This also means the kernels don't need to extract the id from the item.
2019-11-01 15:19:01 +00:00
Tom Deakin
4bcb777100
Add Zen target for OpenACC
2019-08-08 14:36:20 +00:00
Tom Deakin
63cc964847
Update CHANGELOG with updates from #58
2019-06-26 12:06:06 +01:00
Tom Deakin
022793bdd6
Merge pull request #58 from GeorgeWeb/sycl-compliant
...
Making BabelStream's SYCL code compliant
2019-06-26 12:03:47 +01:00
GeorgeWeb
e657bfa897
based on perf comparison, and discussions, the use pre-built kernels is unnecessary in this case
2019-06-20 14:24:46 +01:00
GeorgeWeb
54737d87cb
enclosing computecpp specific code in macros, rather than removing it
2019-06-20 10:13:39 +01:00
GeorgeWeb
a2e53d6728
remove use of pre-built kernel in parallel_for as is not conformant with the SYCL spec. (yet)
2019-06-18 17:31:40 +01:00
Georgi Mirazchiyski
60817e25a1
fix deprecated use of get_global() and get_local()
2019-06-18 17:22:49 +01:00
Tom Deakin
289a2c204c
Version bump
2019-04-10 14:12:00 +01:00
Tom Deakin
decc44f04a
Merge pull request #56 from UoB-HPC/base2
...
Add a --mibibytes flag to output bandwidth and array sizes in base 2
2019-04-10 14:09:24 +01:00
Tom Deakin
dd6f3af98b
Update changelog
2019-04-10 14:06:50 +01:00
Tom Deakin
08348d1f0f
Use ternary operator for simpler base 2 output checks
2019-04-10 14:06:05 +01:00
Patrick Atkinson
c50eba9caf
fix for mibibytes in printing
2019-04-10 11:04:29 +00:00
Tom Deakin
5a1396671e
Add a --mibibytes flag to output bandwidth and array sizes in base 2
...
This sets MiB = 2^20, GiB = 2^30 rather than the default of
MB = 10^6 and GB = 10^9.
2019-04-09 09:50:44 +01:00
Tom Deakin
db2a4c40d8
[OpenACC] Add PGI support for Power 9
2019-03-14 15:56:51 +00:00
Tom Deakin
7ec2108896
[OpenMP] Use -qarch=auto with XL compiler
2019-03-14 15:39:45 +00:00
Tom Deakin
c8098a5cc0
[OpenACC] Add KNL support
2019-03-14 09:11:16 -05:00
Tom Deakin
f1f31d2a9b
[OpenACC] Add PGI compiler support for Skylake
2019-03-13 04:14:11 -05:00
Tom Deakin
db9bf78530
[OpenMP] Add PGI compiler support
2019-03-13 04:13:38 -05:00
Tom Deakin
6229b83e62
update changelog
2019-03-11 17:40:33 +00:00
Tom Deakin
8275c97050
[OpenMP] Add NEC compiler flags
2019-03-11 11:11:18 +00:00
Tom Deakin
da085f3f59
Add logo to README
2018-10-17 09:50:20 +01:00
Tom Deakin
02bcd9b762
Fix trailing comma in CSV output
2018-10-04 14:37:27 +01:00
Tom Deakin
a1f7b94820
Support CSV output for triad only running mode
...
Fixes #54
2018-10-04 14:36:59 +01:00
Tom Deakin
96216628bf
Update CHANGELOG.md
2018-09-14 12:57:14 +01:00
Tom Deakin
a88ebd7b38
Merge pull request #51 from Ruyk/master
...
Minor SYCL and OpenCL fixes
2018-09-14 12:56:07 +01:00
Tom Deakin
e5d54dd521
Use parallel loop for OpenACC instead of kernels
...
Closes #53 .
2018-07-25 15:53:50 +00:00
Tom Deakin
d9b089a0f9
Merge pull request #52 from plavin/master
...
Add Titan Xp numbers
2018-05-08 10:43:45 +01:00
Patrick Robert Lavin
b92c01c650
Add Titan Xp numbers
2018-05-07 11:42:11 -04:00
Ruyman Reyes
8920cfd633
Split compilation lines for SYCL Stream
...
The Makefile for SYCL Stream compilation is now split with
individual lines for each file. This facilitates identifying
compile time errors and avoids warnings on unused symbols.
2018-05-02 15:22:20 +01:00
Ruyman Reyes
63f32fcb51
Manually clearing the global device vector
...
The vector of devices is a global object, which destruction order is
undefined. In some platforms, the OpenCL library has been unloaded
before this destructor is hit, which causes a segmentation fault after
the program ends. By clearing the global vector of devices on
destruction of the OpenCL and SYCL Stream benchmarks we avoid the
problem.
2018-05-02 15:21:41 +01:00
Tom Deakin
54fc326097
Add mcpu=native flag to GNU OpenMP builds
2018-04-27 13:21:30 +01:00
Tom Deakin
54b8a549c1
Update CHANGELOG.md
...
Add notice to changelog about #49 .
2018-03-19 11:07:32 +00:00
Tom Deakin
c5b64b9cad
Merge pull request #49 from mangupta/fix_hip_build
...
Make hip-stream buildable with latest version of HIP
2018-03-19 11:06:25 +00:00
Maneesh Gupta
1dc35f0d99
[HIP] Search for hipcc in the preferred way
2018-03-19 15:34:07 +05:30