Vanya Yaneva
9916a81bc5
Small formatting change
2017-07-27 17:39:13 +01:00
Vanya Yaneva
8c4af581d1
Reverted changes in kernel build
2017-07-27 17:36:12 +01:00
Vanya Yaneva
05fc803858
Updated SYCL makefile and kernel build
2017-07-25 13:49:08 +01:00
James Price
db01715806
[SYCL] Explictly use first dimension of ranges
2016-11-18 00:35:36 +00:00
James Price
1e976ff150
[SYCL] Fix multiple template specializations
2016-11-18 00:14:46 +00:00
James Price
66776d5839
[SYCL] Use consistent syntax for indexing
2016-11-17 23:52:13 +00:00
James Price
02bff60870
[SYCL] Fix start index in reduction loop
2016-11-17 21:01:30 +00:00
Tom Deakin
d42bcd4675
Merge remote-tracking branch 'origin/init-arrays' into devel
2016-11-04 09:17:54 +00:00
James Price
7f4761ae52
Replace write_arrays with init_arrays
...
This allows each model to initialise their arrays with a parallel
approach, which yields the first touch required for good performance
on NUMA architectures.
2016-11-02 11:22:01 +00:00
James Price
dd296d2231
[SYCL] Prebuild dot kernel like the others
2016-10-28 21:15:12 +01:00
James Price
b09b90f6fc
Merge remote-tracking branch 'origin' into kernel-dot
2016-10-28 21:07:57 +01:00
James Price
cce8e78cae
Merge pull request #12 from Ruyk/master
...
Updated the SYCL Stream benchmark with latest ComputeCpp CE 0.1.1 Edition
2016-10-28 21:06:34 +01:00
James Price
d7c48c5063
Slight tweak to dot config output to fix parsing scripts
2016-10-26 15:47:10 +01:00
James Price
cbf97dc7d9
[SYCL] Automatically determine dot NDRange config
2016-10-26 15:19:14 +01:00
James Price
ed630e7dbc
[SYCL] Implement dot kernel
2016-10-25 16:39:23 +01:00
Tom Deakin
47128d47c0
[SYCL] Use global defined scalar value
2016-10-24 13:37:43 +01:00
Ruyman Reyes
d562283cde
Minor performance tuning for SYCL benchmark
...
* Pre-compiling kernel binaries when setting up the benchmark,
like OpenCL equivalent
* Using the linear access syntax for buffers
2016-10-18 13:09:19 +01:00
James Price
74a4a3b0bd
[SYCL] Set WGSIZE to more sensible value for AMD Fiji
2016-07-07 09:40:16 +01:00
Tom Deakin
31cb567e21
Switch data from 1.0, 2.0 and 3.0 to 0.1, 0.2, and 0.3 resp.
...
Using integers for maths gets unstable past 38 interations even
in double precision. Using the original values/10 is safe up to
the default 100 iterations.
2016-05-11 15:51:19 +01:00
Tom Deakin
81fa9e1922
Require SYCL array size to be multiple of WGSIZE
2016-05-11 12:23:21 +01:00
James Price
084d7417b9
[SYCL] Remove unneeded cl_device_info line
2016-05-09 15:20:11 +01:00
James Price
6d913bab4b
[SYCL] Actually use device_index to select device
2016-05-08 21:35:24 +01:00
James Price
3b3f6dfc26
[SYCL] Implement device list/selection functionality
2016-05-08 19:22:09 +01:00
James Price
54834e05f4
[SYCL] Use nd_range instead of range to specify work-group size
2016-05-06 22:41:10 +01:00
James Price
d4b3b3533c
Update SYCL version to work with ComputeCpp
...
Still needs proper CMake rules and kernel names need to be fixed for
multiple template instantiations.
2016-05-06 00:38:30 +01:00
James Price
b45f311e0d
Add missing SYCL source files
2016-05-03 14:48:35 +01:00