Tom Deakin
f32cf3bad3
Merge branch 'master' into kernel-dot
...
Conflicts:
main.cpp
2016-10-24 13:53:58 +01:00
Tom Deakin
5b1e67f666
[CUDA] Use new value of scalar
2016-10-24 13:19:54 +01:00
James Price
8a8f44b4ce
Fix CUDA host code for dot kernel
...
Wrong number of blocks was being copied and summed.
2016-10-24 12:47:25 +01:00
Tom Deakin
d3b497a9ca
Add a CUDA dot kernel
2016-10-14 17:51:40 +01:00
James Price
f94e36f320
[CUDA] Fix device name output (OpenCL->CUDA)
2016-07-06 17:16:35 +01:00
Tom Deakin
31cb567e21
Switch data from 1.0, 2.0 and 3.0 to 0.1, 0.2, and 0.3 resp.
...
Using integers for maths gets unstable past 38 interations even
in double precision. Using the original values/10 is safe up to
the default 100 iterations.
2016-05-11 15:51:19 +01:00
Tom Deakin
2462023ed9
Set thread block size in CUDA with a #define, and check that array size is multiple of it
2016-05-11 12:21:29 +01:00
Tom Deakin
530b2adda2
Add License text to all files
2016-05-03 12:32:03 +01:00
Tom Deakin
a355acf2ee
Move source files to top level directory
2016-05-03 11:43:25 +01:00