Peter Steinbach
|
f9ffa712cf
|
removed doublicate spaces
|
2017-07-31 14:46:50 +02:00 |
|
Peter Steinbach
|
df6fff1d2e
|
added missing space for consistency
|
2017-07-31 14:30:08 +02:00 |
|
Peter Steinbach
|
2dbb693761
|
renamed nreps to be more consistent with the naming scheme
|
2017-07-31 14:23:39 +02:00 |
|
Peter Steinbach
|
3b9052664f
|
enable propagation of preprocessor macros from CLI to compiler command
|
2017-07-31 14:21:16 +02:00 |
|
Peter Steinbach
|
8509917dff
|
refactored n_tiles into preprocessor macro
|
2017-07-31 14:20:59 +02:00 |
|
Peter Steinbach
|
7ed0308cb7
|
code formatting fixed
|
2017-07-31 14:14:52 +02:00 |
|
Peter Steinbach
|
2415bdc7c0
|
fixed if-clause formatting
|
2017-07-31 14:00:44 +02:00 |
|
Vanya Yaneva
|
6d7e4b5f3f
|
Changed name of sycl make var to match the sdk
|
2017-07-27 17:55:56 +01:00 |
|
Vanya Yaneva
|
9916a81bc5
|
Small formatting change
|
2017-07-27 17:39:13 +01:00 |
|
Vanya Yaneva
|
8c4af581d1
|
Reverted changes in kernel build
|
2017-07-27 17:36:12 +01:00 |
|
Peter Steinbach
|
7911e6a0ae
|
fixed compilation error due to unpropagated typo fix
|
2017-07-26 17:28:41 +02:00 |
|
Peter Steinbach
|
add9973b67
|
fixed typo
|
2017-07-26 17:21:17 +02:00 |
|
Peter Steinbach
|
99fad100c6
|
added csv-output-sentinals and output
|
2017-07-26 14:22:24 +02:00 |
|
Peter Steinbach
|
ee8ab08eaf
|
added csv flag
|
2017-07-26 14:02:32 +02:00 |
|
Peter Steinbach
|
26279688d1
|
Merge branch 'master' of https://github.com/UoB-HPC/BabelStream into rocm_hc_support
|
2017-07-25 17:05:31 +02:00 |
|
Peter Steinbach
|
6712e26035
|
added ifndef for virtual tile size (default value found empirically on R9 Nano card)
|
2017-07-25 17:05:22 +02:00 |
|
Peter Steinbach
|
78a4f0cb73
|
upon call of HC.make, the size of the virtual size can be set
|
2017-07-25 17:04:51 +02:00 |
|
Vanya Yaneva
|
05fc803858
|
Updated SYCL makefile and kernel build
|
2017-07-25 13:49:08 +01:00 |
|
Peter Steinbach
|
bfe8996775
|
switched to using hcc-config directly
|
2017-05-04 09:30:51 +02:00 |
|
Peter Steinbach
|
0a8176e273
|
bumped to rocm 1.5
|
2017-05-04 09:22:10 +02:00 |
|
Peter Steinbach
|
c0559c2d86
|
fixed copy and paste error in error messages
|
2017-04-18 15:03:24 +02:00 |
|
Peter Steinbach
|
a971591cc9
|
replaced operator[] by .at method to trigger an exception in case of out-of-range errors
|
2017-04-18 15:00:29 +02:00 |
|
Peter Steinbach
|
f6d6874c04
|
dropped obsolete comments
|
2017-04-18 14:58:34 +02:00 |
|
Peter Steinbach
|
b1fc309f3a
|
removed obsolete comment
|
2017-04-18 14:57:35 +02:00 |
|
Tom Deakin
|
dafc63030f
|
Rename to BabelStream
|
2017-04-08 12:16:29 +01:00 |
|
Tom Deakin
|
9c08fdd184
|
Minor version bump
|
2017-04-06 10:38:48 +01:00 |
|
Tom Deakin
|
50e3a1970f
|
Add RAJA CUDA build instructions
|
2017-04-06 10:38:03 +01:00 |
|
Tom Deakin
|
c534600d04
|
[RAJA] Use Index_type for iterator index type instead of hardcoding int
|
2017-04-06 10:36:01 +01:00 |
|
Tom Deakin
|
3331f62f42
|
Add RAJA build instructions to README
|
2017-04-06 10:16:34 +01:00 |
|
Tom Deakin
|
5f9b288570
|
[RAJA] Declare pointers using RAJA_RESTRICT
|
2017-04-06 10:15:11 +01:00 |
|
Tom Deakin
|
1bd4adfe7b
|
[RAJA] Align the memory to 2MB pages
|
2017-04-06 10:14:51 +01:00 |
|
Tom Deakin
|
1eb75f034a
|
[RAJA] Use xHost and streaming stores with the Intel compiler
|
2017-04-06 10:02:25 +01:00 |
|
Tom Deakin
|
d7a93be739
|
[Kokkos] Add a COMPILER option to Makefile, which turns on streaming stores for Intel
|
2017-04-05 22:23:27 +01:00 |
|
Tom Deakin
|
d7e38c1ca9
|
Add Kokkos build instructions to README
|
2017-04-05 22:09:58 +01:00 |
|
Tom Deakin
|
d9dfc3f552
|
[Kokkos] Use long for iterator variable
|
2017-04-05 21:57:55 +01:00 |
|
Peter Steinbach
|
04589d4d4f
|
added fixed bug in dot product
|
2017-04-03 14:16:25 +02:00 |
|
Peter Steinbach
|
fd35d895d9
|
added optimized flags to CXXFLAGS
|
2017-04-03 14:16:06 +02:00 |
|
Peter Steinbach
|
55f467e24d
|
moved experimental dot product implementation of dot_impl which is build only if -DHC_DEVELOP is given
|
2017-03-27 14:22:56 +02:00 |
|
Peter Steinbach
|
2882383324
|
Merge remote-tracking branch 'upstream/master' into rocm_hc_support
|
2017-03-24 15:46:41 +01:00 |
|
Peter Steinbach
|
0e45f86588
|
added cascaded reduction based on C++AMP book
|
2017-03-24 15:19:48 +01:00 |
|
Peter Steinbach
|
96bc566ce1
|
added debug flag
|
2017-03-24 15:19:22 +01:00 |
|
Peter Steinbach
|
0535cbcd5b
|
renamed variables and introduced views
|
2017-03-23 15:55:23 +01:00 |
|
Tom Deakin
|
bf57cf578d
|
[CUDA] Merge pull request #28 from psteinb/extra_cuda_make_variable
Allow specifying compiler and flags for build
|
2017-03-17 14:22:37 +00:00 |
|
Peter Steinbach
|
d8cb7494e0
|
pulled -O3 out into CXXFLAGS, refactored CUDA compiler into CUDA_CXX
make variable to cope with clang as CUDA compiler as well
|
2017-03-17 15:18:13 +01:00 |
|
James Price
|
703eb945a2
|
[OpenMP] Align memory (2MB by default)
|
2017-03-13 17:17:20 +00:00 |
|
James Price
|
4f288ddc3d
|
[OpenMP] Add -qopt-streaming-stores for Intel
|
2017-03-13 17:15:10 +00:00 |
|
Peter Steinbach
|
8c7a801a84
|
put -O3 into CXXFLAGS to comply with OpenMP.make
|
2017-03-13 15:22:26 +01:00 |
|
Peter Steinbach
|
ea12f2a9a1
|
added EXTRA_FLAGS variable to CUDA Makefile to provide the freedom to specify debug flags or gencode flags
|
2017-03-13 14:41:16 +01:00 |
|
James Price
|
94e0900377
|
Use static shared memory in dot for CUDA and HIP
|
2017-02-28 13:24:45 +00:00 |
|
Tom Deakin
|
e7a619c63c
|
Merge pull request #27 from psteinb/fix_minus_for_an_equal
replaced - for = so that assignment takes place
|
2017-02-28 12:46:43 +00:00 |
|