Tom Deakin
62860284b2
[CUDA] Add Managed memory and Page fault options
...
To use managed memory, compile the code defining MANAGED
To use CUDA 8 page-fault memory, compile the code defining PAGEFAULT
2016-12-19 05:00:15 -07:00
Tom Deakin
d42bcd4675
Merge remote-tracking branch 'origin/init-arrays' into devel
2016-11-04 09:17:54 +00:00
James Price
7f4761ae52
Replace write_arrays with init_arrays
...
This allows each model to initialise their arrays with a parallel
approach, which yields the first touch required for good performance
on NUMA architectures.
2016-11-02 11:22:01 +00:00
James Price
dfc79eeb4d
Improve performance of CUDA dot implementation
2016-10-24 21:42:39 +01:00
Tom Deakin
d3b497a9ca
Add a CUDA dot kernel
2016-10-14 17:51:40 +01:00
Tom Deakin
2462023ed9
Set thread block size in CUDA with a #define, and check that array size is multiple of it
2016-05-11 12:21:29 +01:00
Tom Deakin
530b2adda2
Add License text to all files
2016-05-03 12:32:03 +01:00
Tom Deakin
a355acf2ee
Move source files to top level directory
2016-05-03 11:43:25 +01:00