Commit Graph

776 Commits

Author SHA1 Message Date
Tom Deakin
4d3f1ff0b2 Add new reference to README 2018-02-07 15:17:23 +00:00
Tom Deakin
a11cb21bac Add new reference to README 2018-02-07 15:15:30 +00:00
Tom Deakin
73f122089d [OpenMP 4.5] Remove superfluous map clauses
Fixes #41.
OpenMP 4.5 does not require the map clauses on the target region
if the data has been previously defined using unstructured data
enter/exit clauses. Removing this clauses works fine with the
Clang compiler, however we noticed issues with the Cray compiler.
The issue is that the Cray compiler does not block the target
region "kernel calls" and so the timing is incorrect. This was
not noticed before due to the presence of the map clauses.
For now, we have had to add an update from clause of a scalar
value to ensure that the kenel blocks. It is hoped that we can
remove this in due course. But in the vein of showing how the
models work we want to keep the minimum required correct code
(which is without the map clause) but need the code to also
work correctly.
2018-02-07 15:05:06 +00:00
Tom Deakin
cfb89018f6 Update README with new Kokkos building instructions 2018-02-07 14:34:22 +00:00
Tom Deakin
8b83802fb1 [OpenMP 4.5] Swap map and reduction clause on dot kernel
This fixes #42. On the Clang compiler, the map clause should appear
before the reduction clause to ensure that the evaluation order
of the clauses copies the reduction value back correctly.
This makes no difference with the Cray compiler.
2018-02-07 14:16:39 +00:00
Tom Deakin
41621ac136
Merge pull request #39 from antonrv/sycl-121
SYCL implementation adapted to 1.2.1 interface
2018-01-10 14:43:10 +00:00
Anton Rey
b6d9795476 SYCL implementation adapted to 1.2.1 interface 2017-12-08 12:49:21 +00:00
Simon McIntosh-Smith
f8044a7679
Adding BabelStream cache-aware poster from SC17
Might need to update this when it's available from the SC17 website itself (not at the time of writing).
2017-11-26 11:51:05 +00:00
Tom Deakin
81b80b93f4 Add OpenACC Volta flags 2017-11-10 15:33:22 +00:00
Tom Deakin
87eb4361b4 Version bump 2017-08-02 16:35:40 +01:00
Tom Deakin
8a39185331 Merge pull request #37 from UoB-HPC/triad-only
Implement --triad-only switch
2017-08-02 16:35:00 +01:00
James Price
6a2da4c862 Implement --triad-only switch 2017-08-02 15:43:56 +01:00
James Price
18c20efbfe Merge pull request #36 from wyaneva/master
Updated the Makefile for SYCL
2017-08-01 11:22:54 +01:00
Vanya Yaneva
b8f7a5427e Added exception after printing the SYCL exceptions 2017-07-31 17:44:58 +01:00
Tom Deakin
5ad8341b39 Merge pull request #35 from psteinb/adding_csv_output
Adding csv output
2017-07-31 15:03:00 +01:00
Peter Steinbach
01d4eea7b7 removed obsolete spaces 2017-07-31 14:52:18 +02:00
Tom Deakin
b49c8dab34 Merge pull request #32 from psteinb/rocm_hc_support
Rocm hc support
2017-07-31 13:50:08 +01:00
Peter Steinbach
f9ffa712cf removed doublicate spaces 2017-07-31 14:46:50 +02:00
Peter Steinbach
df6fff1d2e added missing space for consistency 2017-07-31 14:30:08 +02:00
Peter Steinbach
2dbb693761 renamed nreps to be more consistent with the naming scheme 2017-07-31 14:23:39 +02:00
Peter Steinbach
3b9052664f enable propagation of preprocessor macros from CLI to compiler command 2017-07-31 14:21:16 +02:00
Peter Steinbach
8509917dff refactored n_tiles into preprocessor macro 2017-07-31 14:20:59 +02:00
Peter Steinbach
7ed0308cb7 code formatting fixed 2017-07-31 14:14:52 +02:00
Peter Steinbach
2415bdc7c0 fixed if-clause formatting 2017-07-31 14:00:44 +02:00
Vanya Yaneva
6d7e4b5f3f Changed name of sycl make var to match the sdk 2017-07-27 17:55:56 +01:00
Vanya Yaneva
9916a81bc5 Small formatting change 2017-07-27 17:39:13 +01:00
Vanya Yaneva
8c4af581d1 Reverted changes in kernel build 2017-07-27 17:36:12 +01:00
Peter Steinbach
7911e6a0ae fixed compilation error due to unpropagated typo fix 2017-07-26 17:28:41 +02:00
Peter Steinbach
add9973b67 fixed typo 2017-07-26 17:21:17 +02:00
Peter Steinbach
99fad100c6 added csv-output-sentinals and output 2017-07-26 14:22:24 +02:00
Peter Steinbach
ee8ab08eaf added csv flag 2017-07-26 14:02:32 +02:00
Peter Steinbach
26279688d1 Merge branch 'master' of https://github.com/UoB-HPC/BabelStream into rocm_hc_support 2017-07-25 17:05:31 +02:00
Peter Steinbach
6712e26035 added ifndef for virtual tile size (default value found empirically on R9 Nano card) 2017-07-25 17:05:22 +02:00
Peter Steinbach
78a4f0cb73 upon call of HC.make, the size of the virtual size can be set 2017-07-25 17:04:51 +02:00
Vanya Yaneva
05fc803858 Updated SYCL makefile and kernel build 2017-07-25 13:49:08 +01:00
Tom Deakin
3bd65a0716 Merge branch 'master' into cuda-memory 2017-05-11 11:28:33 +01:00
Peter Steinbach
bfe8996775 switched to using hcc-config directly 2017-05-04 09:30:51 +02:00
Peter Steinbach
0a8176e273 bumped to rocm 1.5 2017-05-04 09:22:10 +02:00
Peter Steinbach
c0559c2d86 fixed copy and paste error in error messages 2017-04-18 15:03:24 +02:00
Peter Steinbach
a971591cc9 replaced operator[] by .at method to trigger an exception in case of out-of-range errors 2017-04-18 15:00:29 +02:00
Peter Steinbach
f6d6874c04 dropped obsolete comments 2017-04-18 14:58:34 +02:00
Peter Steinbach
b1fc309f3a removed obsolete comment 2017-04-18 14:57:35 +02:00
Tom Deakin
dafc63030f Rename to BabelStream 2017-04-08 12:16:29 +01:00
Tom Deakin
9c08fdd184 Minor version bump 2017-04-06 10:38:48 +01:00
Tom Deakin
50e3a1970f Add RAJA CUDA build instructions 2017-04-06 10:38:03 +01:00
Tom Deakin
c534600d04 [RAJA] Use Index_type for iterator index type instead of hardcoding int 2017-04-06 10:36:01 +01:00
Tom Deakin
3331f62f42 Add RAJA build instructions to README 2017-04-06 10:16:34 +01:00
Tom Deakin
5f9b288570 [RAJA] Declare pointers using RAJA_RESTRICT 2017-04-06 10:15:11 +01:00
Tom Deakin
1bd4adfe7b [RAJA] Align the memory to 2MB pages 2017-04-06 10:14:51 +01:00
Tom Deakin
1eb75f034a [RAJA] Use xHost and streaming stores with the Intel compiler 2017-04-06 10:02:25 +01:00