Tom Deakin
8b83802fb1
[OpenMP 4.5] Swap map and reduction clause on dot kernel
...
This fixes #42 . On the Clang compiler, the map clause should appear
before the reduction clause to ensure that the evaluation order
of the clauses copies the reduction value back correctly.
This makes no difference with the Cray compiler.
2018-02-07 14:16:39 +00:00
Tom Deakin
41621ac136
Merge pull request #39 from antonrv/sycl-121
...
SYCL implementation adapted to 1.2.1 interface
2018-01-10 14:43:10 +00:00
Anton Rey
b6d9795476
SYCL implementation adapted to 1.2.1 interface
2017-12-08 12:49:21 +00:00
Simon McIntosh-Smith
f8044a7679
Adding BabelStream cache-aware poster from SC17
...
Might need to update this when it's available from the SC17 website itself (not at the time of writing).
2017-11-26 11:51:05 +00:00
Tom Deakin
81b80b93f4
Add OpenACC Volta flags
2017-11-10 15:33:22 +00:00
Tom Deakin
87eb4361b4
Version bump
2017-08-02 16:35:40 +01:00
Tom Deakin
8a39185331
Merge pull request #37 from UoB-HPC/triad-only
...
Implement --triad-only switch
2017-08-02 16:35:00 +01:00
James Price
6a2da4c862
Implement --triad-only switch
2017-08-02 15:43:56 +01:00
James Price
18c20efbfe
Merge pull request #36 from wyaneva/master
...
Updated the Makefile for SYCL
2017-08-01 11:22:54 +01:00
Vanya Yaneva
b8f7a5427e
Added exception after printing the SYCL exceptions
2017-07-31 17:44:58 +01:00
Tom Deakin
5ad8341b39
Merge pull request #35 from psteinb/adding_csv_output
...
Adding csv output
2017-07-31 15:03:00 +01:00
Peter Steinbach
01d4eea7b7
removed obsolete spaces
2017-07-31 14:52:18 +02:00
Tom Deakin
b49c8dab34
Merge pull request #32 from psteinb/rocm_hc_support
...
Rocm hc support
2017-07-31 13:50:08 +01:00
Peter Steinbach
f9ffa712cf
removed doublicate spaces
2017-07-31 14:46:50 +02:00
Peter Steinbach
df6fff1d2e
added missing space for consistency
2017-07-31 14:30:08 +02:00
Peter Steinbach
2dbb693761
renamed nreps to be more consistent with the naming scheme
2017-07-31 14:23:39 +02:00
Peter Steinbach
3b9052664f
enable propagation of preprocessor macros from CLI to compiler command
2017-07-31 14:21:16 +02:00
Peter Steinbach
8509917dff
refactored n_tiles into preprocessor macro
2017-07-31 14:20:59 +02:00
Peter Steinbach
7ed0308cb7
code formatting fixed
2017-07-31 14:14:52 +02:00
Peter Steinbach
2415bdc7c0
fixed if-clause formatting
2017-07-31 14:00:44 +02:00
Vanya Yaneva
6d7e4b5f3f
Changed name of sycl make var to match the sdk
2017-07-27 17:55:56 +01:00
Vanya Yaneva
9916a81bc5
Small formatting change
2017-07-27 17:39:13 +01:00
Vanya Yaneva
8c4af581d1
Reverted changes in kernel build
2017-07-27 17:36:12 +01:00
Peter Steinbach
7911e6a0ae
fixed compilation error due to unpropagated typo fix
2017-07-26 17:28:41 +02:00
Peter Steinbach
add9973b67
fixed typo
2017-07-26 17:21:17 +02:00
Peter Steinbach
99fad100c6
added csv-output-sentinals and output
2017-07-26 14:22:24 +02:00
Peter Steinbach
ee8ab08eaf
added csv flag
2017-07-26 14:02:32 +02:00
Peter Steinbach
26279688d1
Merge branch 'master' of https://github.com/UoB-HPC/BabelStream into rocm_hc_support
2017-07-25 17:05:31 +02:00
Peter Steinbach
6712e26035
added ifndef for virtual tile size (default value found empirically on R9 Nano card)
2017-07-25 17:05:22 +02:00
Peter Steinbach
78a4f0cb73
upon call of HC.make, the size of the virtual size can be set
2017-07-25 17:04:51 +02:00
Vanya Yaneva
05fc803858
Updated SYCL makefile and kernel build
2017-07-25 13:49:08 +01:00
Tom Deakin
3bd65a0716
Merge branch 'master' into cuda-memory
2017-05-11 11:28:33 +01:00
Peter Steinbach
bfe8996775
switched to using hcc-config directly
2017-05-04 09:30:51 +02:00
Peter Steinbach
0a8176e273
bumped to rocm 1.5
2017-05-04 09:22:10 +02:00
Peter Steinbach
c0559c2d86
fixed copy and paste error in error messages
2017-04-18 15:03:24 +02:00
Peter Steinbach
a971591cc9
replaced operator[] by .at method to trigger an exception in case of out-of-range errors
2017-04-18 15:00:29 +02:00
Peter Steinbach
f6d6874c04
dropped obsolete comments
2017-04-18 14:58:34 +02:00
Peter Steinbach
b1fc309f3a
removed obsolete comment
2017-04-18 14:57:35 +02:00
Tom Deakin
dafc63030f
Rename to BabelStream
2017-04-08 12:16:29 +01:00
Tom Deakin
9c08fdd184
Minor version bump
2017-04-06 10:38:48 +01:00
Tom Deakin
50e3a1970f
Add RAJA CUDA build instructions
2017-04-06 10:38:03 +01:00
Tom Deakin
c534600d04
[RAJA] Use Index_type for iterator index type instead of hardcoding int
2017-04-06 10:36:01 +01:00
Tom Deakin
3331f62f42
Add RAJA build instructions to README
2017-04-06 10:16:34 +01:00
Tom Deakin
5f9b288570
[RAJA] Declare pointers using RAJA_RESTRICT
2017-04-06 10:15:11 +01:00
Tom Deakin
1bd4adfe7b
[RAJA] Align the memory to 2MB pages
2017-04-06 10:14:51 +01:00
Tom Deakin
1eb75f034a
[RAJA] Use xHost and streaming stores with the Intel compiler
2017-04-06 10:02:25 +01:00
Tom Deakin
d7a93be739
[Kokkos] Add a COMPILER option to Makefile, which turns on streaming stores for Intel
2017-04-05 22:23:27 +01:00
Tom Deakin
d7e38c1ca9
Add Kokkos build instructions to README
2017-04-05 22:09:58 +01:00
Tom Deakin
d9dfc3f552
[Kokkos] Use long for iterator variable
2017-04-05 21:57:55 +01:00
Peter Steinbach
04589d4d4f
added fixed bug in dot product
2017-04-03 14:16:25 +02:00