Tom Deakin
|
b514969193
|
Create OCL device buffers
|
2016-04-28 15:08:12 +01:00 |
|
Tom Deakin
|
77f6df856c
|
Call kernels in OCL
|
2016-04-28 15:05:01 +01:00 |
|
Tom Deakin
|
eeaf9358ab
|
Create OCL kernel functors
|
2016-04-28 15:01:43 +01:00 |
|
Tom Deakin
|
38e1e3b704
|
Add starts of OpenCL implementation
|
2016-04-28 12:59:14 +01:00 |
|
Tom Deakin
|
a745ffc724
|
Add more keywords to CUDA header
|
2016-04-28 12:07:09 +01:00 |
|
Tom Deakin
|
59fe9738b6
|
Add a templated run function to make double/float switch easy
|
2016-04-28 12:03:50 +01:00 |
|
Tom Deakin
|
8d88afdedb
|
Tidy up timing printing to reduce code duplication
|
2016-04-28 11:57:09 +01:00 |
|
Tom Deakin
|
377b348748
|
Move implementation string to the common header file
|
2016-04-28 11:15:25 +01:00 |
|
Tom Deakin
|
daa7f643b9
|
Print out timing results
|
2016-04-27 13:18:06 +01:00 |
|
Tom Deakin
|
3d5a49317e
|
Free CUDA buffers in destructor
|
2016-04-27 12:11:19 +01:00 |
|
Tom Deakin
|
c28e70ae70
|
Add timers and run multiple times
|
2016-04-27 12:08:49 +01:00 |
|
Tom Deakin
|
40c787d040
|
Check bufers fit on CUDA device
|
2016-04-27 11:52:15 +01:00 |
|
Tom Deakin
|
9aa27cd91d
|
Print out average error on check if there is an error
|
2016-04-27 11:42:23 +01:00 |
|
Tom Deakin
|
6225ae90a7
|
Add start of check results function
|
2016-04-27 11:35:12 +01:00 |
|
Tom Deakin
|
6522d9114a
|
Add new line at end of file
|
2016-04-27 11:35:04 +01:00 |
|
Tom Deakin
|
9730cd071e
|
Overridden functions should have more keywords
|
2016-04-27 11:34:42 +01:00 |
|
Tom Deakin
|
9c673317a7
|
Store array size in class so can use it for kernel launches
|
2016-04-26 16:09:51 +01:00 |
|
Tom Deakin
|
319e11011c
|
Add triad kernel
|
2016-04-26 16:07:32 +01:00 |
|
Tom Deakin
|
7a3a546a6e
|
Add mul CUDA kernel
|
2016-04-26 16:06:17 +01:00 |
|
Tom Deakin
|
dec0237353
|
Add mul kernel
|
2016-04-26 16:03:28 +01:00 |
|
Tom Deakin
|
c22b74ba47
|
Add read_arrays definition for CUDA
|
2016-04-26 15:30:37 +01:00 |
|
Tom Deakin
|
8e534daf8b
|
Add methods to copy data between host and device
|
2016-04-26 15:02:41 +01:00 |
|
Tom Deakin
|
ae679a5775
|
Fix indentation in Stream.h
|
2016-04-26 14:50:58 +01:00 |
|
Tom Deakin
|
ee4820b5e4
|
Create CUDA device buffers
|
2016-04-26 14:50:22 +01:00 |
|
Tom Deakin
|
03b01e190f
|
Add cuda constructor declaration and error checking function
|
2016-04-26 14:49:04 +01:00 |
|
Tom Deakin
|
6169bdb7b5
|
Add some global variables
|
2016-04-26 14:40:49 +01:00 |
|
Tom Deakin
|
0bf68f9909
|
Make a copy kernel using the private variables
|
2016-04-26 14:34:25 +01:00 |
|
Tom Deakin
|
1a259d4fc8
|
Add a copy kernel
|
2016-04-26 14:24:04 +01:00 |
|
Tom Deakin
|
2234841b16
|
Initial commit of new design with classes
|
2016-04-26 14:08:59 +01:00 |
|