Commit Graph

593 Commits

Author SHA1 Message Date
Tom Deakin
a8b85e71bd
Merge pull request #72 from tom91136/master
Update Kokkos to support version 3+
2020-07-13 09:34:46 +01:00
Tom Lin
1ffd069e80 Update Kokkos to support version 3+ 2020-07-13 03:02:34 +01:00
Tom Deakin
64617c6dee Update OpenMP Cray flags
Fixes #68
2020-07-10 13:28:23 +01:00
Tom Deakin
5d0ee99de6 Remove Cray flags for OpenACC following removal of support in latest compiler 2020-07-10 13:27:21 +01:00
Tom Deakin
d6520daf11 Update README with differentiation from STREAM 2020-06-02 15:41:00 +01:00
Tom Deakin
272c73a622
Merge pull request #66 from ams-cs/master
Add GNU OpenACC support for AMD GCN
2020-05-22 13:00:15 +01:00
Andrew Stubbs
09271eda17 Add GNU OpenACC support for AMD GCN
Autodetect the device type, rather than hard-code NVidia.

Add GNU command line options to the makefile, and adjust the "restrict"
extension usage. For now, we assume the toolchain is only configured for one
accelerator.
2020-05-21 20:54:04 +01:00
Tom Deakin
d410c65c97 [OpenMP] Change GNU -mcpu=native to -march=native as former is deprecated 2020-05-12 11:48:26 +01:00
Tom Deakin
b792c422f7 [OpenMP] Add build flags for OpenMP offload to AMD and NVIDIA with GCC 10.1
Closes #65
2020-05-12 11:24:29 +01:00
Tom Deakin
87b126f5ea Merge branch 'local'
Conflicts:
	SYCLStream.cpp
2020-05-11 17:20:01 +01:00
Tom Deakin
0919d95aa4 [SYCL] Use SYCL runtime device discovery
Fixes #63
2020-05-11 17:16:47 +01:00
Tom Deakin
1d6da069b3 [SYCL] Pass explicit async_handler to queue constructor 2020-05-11 17:13:36 +01:00
Tom Deakin
7f1637d679 [SYCL] Remove unused program variable 2020-05-11 17:10:48 +01:00
Tom Deakin
6db2c7a0ec [SYCL] Remove unused program variable 2020-05-11 17:09:21 +01:00
Tom Deakin
1bc4395f48 Update local copy of OpenCL C++ header file.
This closes #62
2020-03-16 16:43:55 +00:00
Tom Deakin
8776901733 [SYCL] Use the cl::sycl::id parameter in the parallel_for kernels
The cl::sycl::item provides extra features for extracing global/local
ids which aren't required by the kernels.
This also means the kernels don't need to extract the id from the item.
2019-11-01 15:19:01 +00:00
Tom Deakin
4bcb777100 Add Zen target for OpenACC 2019-08-08 14:36:20 +00:00
Tom Deakin
63cc964847 Update CHANGELOG with updates from #58 2019-06-26 12:06:06 +01:00
Tom Deakin
022793bdd6
Merge pull request #58 from GeorgeWeb/sycl-compliant
Making BabelStream's SYCL code compliant
2019-06-26 12:03:47 +01:00
GeorgeWeb
e657bfa897 based on perf comparison, and discussions, the use pre-built kernels is unnecessary in this case 2019-06-20 14:24:46 +01:00
GeorgeWeb
54737d87cb enclosing computecpp specific code in macros, rather than removing it 2019-06-20 10:13:39 +01:00
GeorgeWeb
a2e53d6728 remove use of pre-built kernel in parallel_for as is not conformant with the SYCL spec. (yet) 2019-06-18 17:31:40 +01:00
Georgi Mirazchiyski
60817e25a1 fix deprecated use of get_global() and get_local() 2019-06-18 17:22:49 +01:00
Tom Deakin
289a2c204c Version bump 2019-04-10 14:12:00 +01:00
Tom Deakin
decc44f04a
Merge pull request #56 from UoB-HPC/base2
Add a --mibibytes flag to output bandwidth and array sizes in base 2
2019-04-10 14:09:24 +01:00
Tom Deakin
dd6f3af98b Update changelog 2019-04-10 14:06:50 +01:00
Tom Deakin
08348d1f0f Use ternary operator for simpler base 2 output checks 2019-04-10 14:06:05 +01:00
Patrick Atkinson
c50eba9caf fix for mibibytes in printing 2019-04-10 11:04:29 +00:00
Tom Deakin
5a1396671e Add a --mibibytes flag to output bandwidth and array sizes in base 2
This sets MiB = 2^20, GiB = 2^30 rather than the default of
MB = 10^6 and GB = 10^9.
2019-04-09 09:50:44 +01:00
Tom Deakin
db2a4c40d8 [OpenACC] Add PGI support for Power 9 2019-03-14 15:56:51 +00:00
Tom Deakin
7ec2108896 [OpenMP] Use -qarch=auto with XL compiler 2019-03-14 15:39:45 +00:00
Tom Deakin
c8098a5cc0 [OpenACC] Add KNL support 2019-03-14 09:11:16 -05:00
Tom Deakin
f1f31d2a9b [OpenACC] Add PGI compiler support for Skylake 2019-03-13 04:14:11 -05:00
Tom Deakin
db9bf78530 [OpenMP] Add PGI compiler support 2019-03-13 04:13:38 -05:00
Tom Deakin
6229b83e62 update changelog 2019-03-11 17:40:33 +00:00
Tom Deakin
8275c97050 [OpenMP] Add NEC compiler flags 2019-03-11 11:11:18 +00:00
Tom Deakin
da085f3f59
Add logo to README 2018-10-17 09:50:20 +01:00
Tom Deakin
02bcd9b762 Fix trailing comma in CSV output 2018-10-04 14:37:27 +01:00
Tom Deakin
a1f7b94820 Support CSV output for triad only running mode
Fixes #54
2018-10-04 14:36:59 +01:00
Tom Deakin
96216628bf
Update CHANGELOG.md 2018-09-14 12:57:14 +01:00
Tom Deakin
a88ebd7b38
Merge pull request #51 from Ruyk/master
Minor SYCL and OpenCL fixes
2018-09-14 12:56:07 +01:00
Tom Deakin
e5d54dd521 Use parallel loop for OpenACC instead of kernels
Closes #53.
2018-07-25 15:53:50 +00:00
Tom Deakin
d9b089a0f9
Merge pull request #52 from plavin/master
Add Titan Xp numbers
2018-05-08 10:43:45 +01:00
Patrick Robert Lavin
b92c01c650 Add Titan Xp numbers 2018-05-07 11:42:11 -04:00
Ruyman Reyes
8920cfd633 Split compilation lines for SYCL Stream
The Makefile for SYCL Stream compilation is now split with
individual lines for each file. This facilitates identifying
compile time errors and avoids warnings on unused symbols.
2018-05-02 15:22:20 +01:00
Ruyman Reyes
63f32fcb51 Manually clearing the global device vector
The vector of devices is a global object, which destruction order is
undefined. In some platforms, the OpenCL library has been unloaded
before this destructor is hit, which causes a segmentation fault after
the program ends.  By clearing the global vector of devices on
destruction of the OpenCL and SYCL Stream benchmarks we avoid the
problem.
2018-05-02 15:21:41 +01:00
Tom Deakin
54fc326097 Add mcpu=native flag to GNU OpenMP builds 2018-04-27 13:21:30 +01:00
Tom Deakin
54b8a549c1
Update CHANGELOG.md
Add notice to changelog about #49.
2018-03-19 11:07:32 +00:00
Tom Deakin
c5b64b9cad
Merge pull request #49 from mangupta/fix_hip_build
Make hip-stream buildable with latest version of HIP
2018-03-19 11:06:25 +00:00
Maneesh Gupta
1dc35f0d99 [HIP] Search for hipcc in the preferred way 2018-03-19 15:34:07 +05:30