Commit Graph

68 Commits

Author SHA1 Message Date
Tom Deakin
6185d3aca6 Use long double for check solution in case of very large problem sizes 2022-04-01 10:51:56 +01:00
Tom Deakin
5645b0290d update changelog 2022-02-16 14:36:20 +00:00
Tom Deakin
b0f9c179ea version bump changelog 2021-12-22 13:03:40 +00:00
Tom Deakin
06e46908e7 Merge branch 'main' into sycl-2020 2021-12-09 11:22:49 +00:00
Tom Deakin
ed960d88a3
Update CHANGELOG.md 2021-12-02 10:41:55 +00:00
Tom Deakin
fa6433bab1 update changelog 2021-06-25 09:45:38 -05:00
Tom Deakin
25e021caa3
Update CHANGELOG.md 2021-06-03 16:08:14 +01:00
Tom Deakin
2ab68ab39e Update changelog 2021-05-18 16:47:56 +01:00
Tom Deakin
b772d00fe4
Revert "Add check for FP64 support" 2021-05-18 16:44:06 +01:00
Tom Deakin
6581ee63b8 Update changelog 2021-05-17 15:33:54 +01:00
Tom Lin
2eca3974e6 Disable CI for RAJA on gcc-10+CUDA due to ICE
Update changelog to include RAJA 0.13.x
2021-04-21 16:28:12 +01:00
Tom Deakin
6ecd10bf98 Deprecate HC 2021-04-20 16:16:56 +01:00
Tom Deakin
a94d678b34
Update CHANGELOG.md 2021-04-08 17:06:43 +01:00
Tom Deakin
f7dc20b875 Merge branch 'sycl-2020' of github.com:uob-hpc/babelstream into sycl-2020
Conflicts:
	CHANGELOG.md
2021-02-22 15:20:38 +00:00
Tom Deakin
13c9e0c1c7 update changelog 2021-02-18 13:48:45 +00:00
Tom Deakin
9715ffff27 Merge branch 'main' into nstream
Conflicts:
	CHANGELOG.md
	main.cpp
2021-02-18 13:20:36 +00:00
Tom Deakin
ba47571ab2 update changelog 2021-02-18 12:37:58 +00:00
Tom Deakin
46bbab6ebb Revert "Update initial starting values"
This reverts commit 44e74b574b.
2021-02-18 11:06:14 +00:00
Tom Deakin
707bc5d0bf Update SYCL version to SYCL 2020 2021-02-09 10:31:18 +00:00
Tom Deakin
44e74b574b Update initial starting values 2021-02-03 11:21:56 +00:00
Tom Deakin
579247dc06 Normalise sum result to mitigate errors with large iteration counts 2021-02-03 10:16:13 +00:00
Tom Deakin
210cfb7520 Revert "Update initial array values to ensure dot product works with the nstream kernel"
This reverts commit 5346e1226d.

Conflicts:
	CHANGELOG.md
2021-02-03 10:14:58 +00:00
Tom Deakin
fdac285110 Merge branch 'main' into nstream
Conflicts:
	CHANGELOG.md
2021-02-02 15:46:39 +00:00
Tom Deakin
018d8a4510 [OpenCL] Remove dot kernel object in deconstructor 2021-02-02 15:45:54 +00:00
Tom Deakin
84406024cf Update CHANGELOG 2021-02-02 11:28:33 +00:00
Tom Deakin
5346e1226d Update initial array values to ensure dot product works with the nstream kernel 2021-02-02 11:27:54 +00:00
Tom Deakin
20c3284629 Update CHANGELOG with signed int change 2021-01-12 10:23:21 +00:00
Tom Deakin
9c211bca96 Update changelog for CUDA memory mode 2020-12-07 15:13:06 +00:00
Tom Deakin
e8fb3a6be4 Add C++20 version using for_each_n and range factories
Closes #85
2020-12-07 14:55:54 +00:00
Tom Deakin
ffa221fd35 Fix OpenMP Clang NVIDIA Target flags (missing sm architecture) with new NVARCH option
Example usage:
make -f OpenMP.make COMPILER=CLANG TARGET=NVIDIA NVARCH=sm_61

Fixes #61
2020-12-07 12:23:11 +00:00
Tom Deakin
829aa15da0 Allocate driver solution check vectors *after* the main computation
Each Stream implementation owns its own data, so the driver code
shouldn't allocate a large array just before. On processors with
strong NUMA effects and smaller memory capacities per NUMA domain,
these checking vectors can result in the main arrays being
allocated in the wrong NUMA domain.

The fix is to simply move the driver allocation until after the
computation has finished and we want to check the answers.

This commit only changes the driver; each model will be updated
in subsequent commits.

Fixes #80.
2020-12-07 10:39:37 +00:00
Tom Deakin
f373927ce8 Rename branch name 2020-12-07 10:23:27 +00:00
Gonzalo Brito Gadeschi
0855805ce2 Add NVIDIA HPC SDK C++ parallel STL implementation
This commits adds an implementation using the C++ parallel STL.
The Makefile uses the NVIDIA HPC SDK `nvc++` compiler with the `-stdpar` flag.
Tested using the NVIDIA HPC SDK 20.9.
2020-11-23 03:08:44 -08:00
Tom Deakin
5182342403
Update CHANGELOG.md 2020-10-26 09:58:57 +00:00
Tom Deakin
0ff841bbf5
Update CHANGELOG.md 2020-08-07 12:29:28 +01:00
Tom Deakin
8ece4079fd
Update CHANGELOG.md 2020-07-14 14:03:04 +01:00
Tom Deakin
6c57b6305e
Update CHANGELOG.md
Summarise move of build system to Kokkos 3.
2020-07-13 09:35:55 +01:00
Tom Deakin
64617c6dee Update OpenMP Cray flags
Fixes #68
2020-07-10 13:28:23 +01:00
Tom Deakin
5d0ee99de6 Remove Cray flags for OpenACC following removal of support in latest compiler 2020-07-10 13:27:21 +01:00
Andrew Stubbs
09271eda17 Add GNU OpenACC support for AMD GCN
Autodetect the device type, rather than hard-code NVidia.

Add GNU command line options to the makefile, and adjust the "restrict"
extension usage. For now, we assume the toolchain is only configured for one
accelerator.
2020-05-21 20:54:04 +01:00
Tom Deakin
b792c422f7 [OpenMP] Add build flags for OpenMP offload to AMD and NVIDIA with GCC 10.1
Closes #65
2020-05-12 11:24:29 +01:00
Tom Deakin
0919d95aa4 [SYCL] Use SYCL runtime device discovery
Fixes #63
2020-05-11 17:16:47 +01:00
Tom Deakin
1d6da069b3 [SYCL] Pass explicit async_handler to queue constructor 2020-05-11 17:13:36 +01:00
Tom Deakin
1bc4395f48 Update local copy of OpenCL C++ header file.
This closes #62
2020-03-16 16:43:55 +00:00
Tom Deakin
8776901733 [SYCL] Use the cl::sycl::id parameter in the parallel_for kernels
The cl::sycl::item provides extra features for extracing global/local
ids which aren't required by the kernels.
This also means the kernels don't need to extract the id from the item.
2019-11-01 15:19:01 +00:00
Tom Deakin
63cc964847 Update CHANGELOG with updates from #58 2019-06-26 12:06:06 +01:00
Tom Deakin
289a2c204c Version bump 2019-04-10 14:12:00 +01:00
Tom Deakin
dd6f3af98b Update changelog 2019-04-10 14:06:50 +01:00
Tom Deakin
db2a4c40d8 [OpenACC] Add PGI support for Power 9 2019-03-14 15:56:51 +00:00
Tom Deakin
7ec2108896 [OpenMP] Use -qarch=auto with XL compiler 2019-03-14 15:39:45 +00:00