Tom Deakin
4203ccb017
Add nstream kernel to OpenCL
2021-02-02 15:46:53 +00:00
Tom Deakin
94c7c3dbd8
use signed array size for OpenCL
2021-01-12 10:21:48 +00:00
Brice Videau
e92d034f64
Ensure OpenCL destructors are called in the correct order.
2020-10-16 18:05:23 -05:00
Tom Deakin
d42bcd4675
Merge remote-tracking branch 'origin/init-arrays' into devel
2016-11-04 09:17:54 +00:00
James Price
7f4761ae52
Replace write_arrays with init_arrays
...
This allows each model to initialise their arrays with a parallel
approach, which yields the first touch required for good performance
on NUMA architectures.
2016-11-02 11:22:01 +00:00
James Price
21556af500
[OCL] Automatically determine dot NDRange config
2016-10-26 15:19:14 +01:00
James Price
d5482b74f4
Improve performance of OpenCL dot implementation
2016-10-24 21:26:09 +01:00
Tom Deakin
f32cf3bad3
Merge branch 'master' into kernel-dot
...
Conflicts:
main.cpp
2016-10-24 13:53:58 +01:00
Tom Deakin
0bed614734
[OpenCL] Use global defined scalar value
2016-10-24 13:51:47 +01:00
James Price
cfc1aba2c0
Use WGSIZE=256 for dot for compatability with AMD
2016-10-24 12:51:01 +01:00
Tom Deakin
2085cacea0
Add an OpenCL dot kernel
...
We have to name the kernel stream_dot (for example) because the
"dot" kernel already exists.
2016-10-14 17:07:55 +01:00
Tom Deakin
530b2adda2
Add License text to all files
2016-05-03 12:32:03 +01:00
Tom Deakin
a355acf2ee
Move source files to top level directory
2016-05-03 11:43:25 +01:00