Commit Graph

637 Commits

Author SHA1 Message Date
Tom Deakin
45df09010c [Kokkos] Fix minor formatting 2018-02-14 22:17:10 +00:00
Tom Deakin
0092d23461 [Kokkos] Remove defining View layout as Kokkos does it correctly by default.
This fixes #43.
2018-02-14 22:14:47 +00:00
Tom Deakin
b93ac5d7cf [Kokkos] Rename files to match Kokkos case conventions 2018-02-14 22:05:50 +00:00
Tom Deakin
4e2450e6ac [Kokkos] Change case of implementation string define 2018-02-14 22:01:57 +00:00
Tom Deakin
3925c71851 [Kokkos] Remove global use of gobal namespace 2018-02-14 22:00:21 +00:00
Tom Deakin
1d84002cb6
Fix GitHub formatting in CHANGELOG 2018-02-07 16:54:18 +00:00
Tom Deakin
53e8f408ad
Fix GitHub formatting in CHANGELOG 2018-02-07 16:53:00 +00:00
Tom Deakin
88c8854a54 Add unreleased changes to CHANGELOG 2018-02-07 16:51:57 +00:00
Tom Deakin
710a18916c Add a Changelog file to document project changes 2018-02-07 16:46:18 +00:00
Tom Deakin
288fabc0d1 [Kokkos] Use seperate build and link in Kokkos Makefile
Fixes #40.
The Kokkos programming model examples now recommend using seperate
build and link stages.
2018-02-07 15:34:19 +00:00
Tom Deakin
72ca26ac94 Add better link to SC17 publication in README 2018-02-07 15:22:04 +00:00
Tom Deakin
4d3f1ff0b2 Add new reference to README 2018-02-07 15:17:23 +00:00
Tom Deakin
a11cb21bac Add new reference to README 2018-02-07 15:15:30 +00:00
Tom Deakin
73f122089d [OpenMP 4.5] Remove superfluous map clauses
Fixes #41.
OpenMP 4.5 does not require the map clauses on the target region
if the data has been previously defined using unstructured data
enter/exit clauses. Removing this clauses works fine with the
Clang compiler, however we noticed issues with the Cray compiler.
The issue is that the Cray compiler does not block the target
region "kernel calls" and so the timing is incorrect. This was
not noticed before due to the presence of the map clauses.
For now, we have had to add an update from clause of a scalar
value to ensure that the kenel blocks. It is hoped that we can
remove this in due course. But in the vein of showing how the
models work we want to keep the minimum required correct code
(which is without the map clause) but need the code to also
work correctly.
2018-02-07 15:05:06 +00:00
Tom Deakin
cfb89018f6 Update README with new Kokkos building instructions 2018-02-07 14:34:22 +00:00
Tom Deakin
8b83802fb1 [OpenMP 4.5] Swap map and reduction clause on dot kernel
This fixes #42. On the Clang compiler, the map clause should appear
before the reduction clause to ensure that the evaluation order
of the clauses copies the reduction value back correctly.
This makes no difference with the Cray compiler.
2018-02-07 14:16:39 +00:00
Tom Deakin
41621ac136
Merge pull request #39 from antonrv/sycl-121
SYCL implementation adapted to 1.2.1 interface
2018-01-10 14:43:10 +00:00
Anton Rey
b6d9795476 SYCL implementation adapted to 1.2.1 interface 2017-12-08 12:49:21 +00:00
Simon McIntosh-Smith
f8044a7679
Adding BabelStream cache-aware poster from SC17
Might need to update this when it's available from the SC17 website itself (not at the time of writing).
2017-11-26 11:51:05 +00:00
Tom Deakin
81b80b93f4 Add OpenACC Volta flags 2017-11-10 15:33:22 +00:00
Tom Deakin
87eb4361b4 Version bump 2017-08-02 16:35:40 +01:00
Tom Deakin
8a39185331 Merge pull request #37 from UoB-HPC/triad-only
Implement --triad-only switch
2017-08-02 16:35:00 +01:00
James Price
6a2da4c862 Implement --triad-only switch 2017-08-02 15:43:56 +01:00
James Price
18c20efbfe Merge pull request #36 from wyaneva/master
Updated the Makefile for SYCL
2017-08-01 11:22:54 +01:00
Vanya Yaneva
b8f7a5427e Added exception after printing the SYCL exceptions 2017-07-31 17:44:58 +01:00
Tom Deakin
5ad8341b39 Merge pull request #35 from psteinb/adding_csv_output
Adding csv output
2017-07-31 15:03:00 +01:00
Peter Steinbach
01d4eea7b7 removed obsolete spaces 2017-07-31 14:52:18 +02:00
Tom Deakin
b49c8dab34 Merge pull request #32 from psteinb/rocm_hc_support
Rocm hc support
2017-07-31 13:50:08 +01:00
Peter Steinbach
f9ffa712cf removed doublicate spaces 2017-07-31 14:46:50 +02:00
Peter Steinbach
df6fff1d2e added missing space for consistency 2017-07-31 14:30:08 +02:00
Peter Steinbach
2dbb693761 renamed nreps to be more consistent with the naming scheme 2017-07-31 14:23:39 +02:00
Peter Steinbach
3b9052664f enable propagation of preprocessor macros from CLI to compiler command 2017-07-31 14:21:16 +02:00
Peter Steinbach
8509917dff refactored n_tiles into preprocessor macro 2017-07-31 14:20:59 +02:00
Peter Steinbach
7ed0308cb7 code formatting fixed 2017-07-31 14:14:52 +02:00
Peter Steinbach
2415bdc7c0 fixed if-clause formatting 2017-07-31 14:00:44 +02:00
Vanya Yaneva
6d7e4b5f3f Changed name of sycl make var to match the sdk 2017-07-27 17:55:56 +01:00
Vanya Yaneva
9916a81bc5 Small formatting change 2017-07-27 17:39:13 +01:00
Vanya Yaneva
8c4af581d1 Reverted changes in kernel build 2017-07-27 17:36:12 +01:00
Peter Steinbach
7911e6a0ae fixed compilation error due to unpropagated typo fix 2017-07-26 17:28:41 +02:00
Peter Steinbach
add9973b67 fixed typo 2017-07-26 17:21:17 +02:00
Peter Steinbach
99fad100c6 added csv-output-sentinals and output 2017-07-26 14:22:24 +02:00
Peter Steinbach
ee8ab08eaf added csv flag 2017-07-26 14:02:32 +02:00
Peter Steinbach
26279688d1 Merge branch 'master' of https://github.com/UoB-HPC/BabelStream into rocm_hc_support 2017-07-25 17:05:31 +02:00
Peter Steinbach
6712e26035 added ifndef for virtual tile size (default value found empirically on R9 Nano card) 2017-07-25 17:05:22 +02:00
Peter Steinbach
78a4f0cb73 upon call of HC.make, the size of the virtual size can be set 2017-07-25 17:04:51 +02:00
Vanya Yaneva
05fc803858 Updated SYCL makefile and kernel build 2017-07-25 13:49:08 +01:00
Tom Deakin
3bd65a0716 Merge branch 'master' into cuda-memory 2017-05-11 11:28:33 +01:00
Peter Steinbach
bfe8996775 switched to using hcc-config directly 2017-05-04 09:30:51 +02:00
Peter Steinbach
0a8176e273 bumped to rocm 1.5 2017-05-04 09:22:10 +02:00
Peter Steinbach
c0559c2d86 fixed copy and paste error in error messages 2017-04-18 15:03:24 +02:00