Commit Graph

862 Commits

Author SHA1 Message Date
Tom Deakin
5ad8341b39 Merge pull request #35 from psteinb/adding_csv_output
Adding csv output
2017-07-31 15:03:00 +01:00
Peter Steinbach
01d4eea7b7 removed obsolete spaces 2017-07-31 14:52:18 +02:00
Tom Deakin
b49c8dab34 Merge pull request #32 from psteinb/rocm_hc_support
Rocm hc support
2017-07-31 13:50:08 +01:00
Peter Steinbach
f9ffa712cf removed doublicate spaces 2017-07-31 14:46:50 +02:00
Peter Steinbach
df6fff1d2e added missing space for consistency 2017-07-31 14:30:08 +02:00
Peter Steinbach
2dbb693761 renamed nreps to be more consistent with the naming scheme 2017-07-31 14:23:39 +02:00
Peter Steinbach
3b9052664f enable propagation of preprocessor macros from CLI to compiler command 2017-07-31 14:21:16 +02:00
Peter Steinbach
8509917dff refactored n_tiles into preprocessor macro 2017-07-31 14:20:59 +02:00
Peter Steinbach
7ed0308cb7 code formatting fixed 2017-07-31 14:14:52 +02:00
Peter Steinbach
2415bdc7c0 fixed if-clause formatting 2017-07-31 14:00:44 +02:00
Vanya Yaneva
6d7e4b5f3f Changed name of sycl make var to match the sdk 2017-07-27 17:55:56 +01:00
Vanya Yaneva
9916a81bc5 Small formatting change 2017-07-27 17:39:13 +01:00
Vanya Yaneva
8c4af581d1 Reverted changes in kernel build 2017-07-27 17:36:12 +01:00
Peter Steinbach
7911e6a0ae fixed compilation error due to unpropagated typo fix 2017-07-26 17:28:41 +02:00
Peter Steinbach
add9973b67 fixed typo 2017-07-26 17:21:17 +02:00
Peter Steinbach
99fad100c6 added csv-output-sentinals and output 2017-07-26 14:22:24 +02:00
Peter Steinbach
ee8ab08eaf added csv flag 2017-07-26 14:02:32 +02:00
Peter Steinbach
26279688d1 Merge branch 'master' of https://github.com/UoB-HPC/BabelStream into rocm_hc_support 2017-07-25 17:05:31 +02:00
Peter Steinbach
6712e26035 added ifndef for virtual tile size (default value found empirically on R9 Nano card) 2017-07-25 17:05:22 +02:00
Peter Steinbach
78a4f0cb73 upon call of HC.make, the size of the virtual size can be set 2017-07-25 17:04:51 +02:00
Vanya Yaneva
05fc803858 Updated SYCL makefile and kernel build 2017-07-25 13:49:08 +01:00
Tom Deakin
3bd65a0716 Merge branch 'master' into cuda-memory 2017-05-11 11:28:33 +01:00
Peter Steinbach
bfe8996775 switched to using hcc-config directly 2017-05-04 09:30:51 +02:00
Peter Steinbach
0a8176e273 bumped to rocm 1.5 2017-05-04 09:22:10 +02:00
Peter Steinbach
c0559c2d86 fixed copy and paste error in error messages 2017-04-18 15:03:24 +02:00
Peter Steinbach
a971591cc9 replaced operator[] by .at method to trigger an exception in case of out-of-range errors 2017-04-18 15:00:29 +02:00
Peter Steinbach
f6d6874c04 dropped obsolete comments 2017-04-18 14:58:34 +02:00
Peter Steinbach
b1fc309f3a removed obsolete comment 2017-04-18 14:57:35 +02:00
Tom Deakin
dafc63030f Rename to BabelStream 2017-04-08 12:16:29 +01:00
Tom Deakin
9c08fdd184 Minor version bump 2017-04-06 10:38:48 +01:00
Tom Deakin
50e3a1970f Add RAJA CUDA build instructions 2017-04-06 10:38:03 +01:00
Tom Deakin
c534600d04 [RAJA] Use Index_type for iterator index type instead of hardcoding int 2017-04-06 10:36:01 +01:00
Tom Deakin
3331f62f42 Add RAJA build instructions to README 2017-04-06 10:16:34 +01:00
Tom Deakin
5f9b288570 [RAJA] Declare pointers using RAJA_RESTRICT 2017-04-06 10:15:11 +01:00
Tom Deakin
1bd4adfe7b [RAJA] Align the memory to 2MB pages 2017-04-06 10:14:51 +01:00
Tom Deakin
1eb75f034a [RAJA] Use xHost and streaming stores with the Intel compiler 2017-04-06 10:02:25 +01:00
Tom Deakin
d7a93be739 [Kokkos] Add a COMPILER option to Makefile, which turns on streaming stores for Intel 2017-04-05 22:23:27 +01:00
Tom Deakin
d7e38c1ca9 Add Kokkos build instructions to README 2017-04-05 22:09:58 +01:00
Tom Deakin
d9dfc3f552 [Kokkos] Use long for iterator variable 2017-04-05 21:57:55 +01:00
Peter Steinbach
04589d4d4f added fixed bug in dot product 2017-04-03 14:16:25 +02:00
Peter Steinbach
fd35d895d9 added optimized flags to CXXFLAGS 2017-04-03 14:16:06 +02:00
Peter Steinbach
55f467e24d moved experimental dot product implementation of dot_impl which is build only if -DHC_DEVELOP is given 2017-03-27 14:22:56 +02:00
Peter Steinbach
2882383324 Merge remote-tracking branch 'upstream/master' into rocm_hc_support 2017-03-24 15:46:41 +01:00
Peter Steinbach
0e45f86588 added cascaded reduction based on C++AMP book 2017-03-24 15:19:48 +01:00
Peter Steinbach
96bc566ce1 added debug flag 2017-03-24 15:19:22 +01:00
Peter Steinbach
0535cbcd5b renamed variables and introduced views 2017-03-23 15:55:23 +01:00
Tom Deakin
bf57cf578d [CUDA] Merge pull request #28 from psteinb/extra_cuda_make_variable
Allow specifying compiler and flags for build
2017-03-17 14:22:37 +00:00
Peter Steinbach
d8cb7494e0 pulled -O3 out into CXXFLAGS, refactored CUDA compiler into CUDA_CXX
make variable to cope with clang as CUDA compiler as well
2017-03-17 15:18:13 +01:00
James Price
703eb945a2 [OpenMP] Align memory (2MB by default) 2017-03-13 17:17:20 +00:00
James Price
4f288ddc3d [OpenMP] Add -qopt-streaming-stores for Intel 2017-03-13 17:15:10 +00:00