Tom Deakin
|
ae7a352809
|
Add a float option to the kernel
|
2015-07-23 12:12:02 +01:00 |
|
Tom Deakin
|
99d2fe8526
|
Add S9150 result
|
2015-07-22 12:44:07 +01:00 |
|
Tom Deakin
|
cd786dbe25
|
CUDA fix geting device name with the device index
|
2015-07-20 17:09:26 +01:00 |
|
Tom Deakin
|
91c804edcd
|
Add NVIDIA GTX 980 Ti and 780 Ti results
|
2015-07-16 18:47:12 +01:00 |
|
Tom Deakin
|
7e0821278a
|
Add NVIDIA GTX 980 and 580 results
|
2015-07-16 18:43:41 +01:00 |
|
Tom Deakin
|
5fc14bbba1
|
Change binary name in cuda help
|
2015-07-16 18:32:23 +01:00 |
|
Tom Deakin
|
30305d83e0
|
CUDA: select number to times from CLI
|
2015-07-16 18:31:54 +01:00 |
|
Tom Deakin
|
0edd0646a6
|
OCL: select number to times from CLI
|
2015-07-16 18:29:42 +01:00 |
|
Tom Deakin
|
c52c7f290c
|
Update README
|
2015-07-16 18:23:28 +01:00 |
|
Tom Deakin
|
0aaceef49d
|
Update Makefile with all and cuda stuff
|
2015-07-16 18:16:27 +01:00 |
|
Tom Deakin
|
39a63b7d32
|
CUDA fix bug with listing device names
|
2015-07-16 18:14:11 +01:00 |
|
Tom Deakin
|
153b6f2105
|
Add results directory with some sample results
|
2015-07-16 18:12:26 +01:00 |
|
Tom Deakin
|
f2536f8764
|
Enqueue CUDA kernels
|
2015-07-16 18:06:35 +01:00 |
|
Tom Deakin
|
2c2dbf2c3f
|
You have to specify block size in CUDA, so make sure ARRAY_SIZE divides 1024
|
2015-07-16 18:06:07 +01:00 |
|
Tom Deakin
|
9da2cd6d09
|
CUDA: make device buffers arrays not vectors
|
2015-07-16 17:53:09 +01:00 |
|
Tom Deakin
|
e76cc9fb62
|
Uncomment kernels in CUDA - have to remove restrict keyword
|
2015-07-16 17:46:50 +01:00 |
|
Tom Deakin
|
fa865dca4f
|
Enumerate and select CUDA devices
|
2015-07-16 17:46:03 +01:00 |
|
Tom Deakin
|
99a63245be
|
Add .gitignore file
|
2015-07-16 17:35:39 +01:00 |
|
Tom Deakin
|
cf6809ba37
|
Change output to say CUDA if CUDA version
|
2015-07-16 17:34:59 +01:00 |
|
Tom Deakin
|
3ff57ba9d6
|
Add parseInt to cuda version for device index
|
2015-07-16 17:33:58 +01:00 |
|
Tom Deakin
|
8c7224ab42
|
Remove cl_uint reference in CUDA version
|
2015-07-16 17:31:59 +01:00 |
|
Tom Deakin
|
06c24b2867
|
Remove OpenCL header and use CUDA header in CUDA version
|
2015-07-16 17:30:54 +01:00 |
|
Tom Deakin
|
435cb3b971
|
Remove catching cl::Error in CUDA version
|
2015-07-16 17:30:13 +01:00 |
|
Tom Deakin
|
cfe2f862ae
|
Add a half broken CUDA port
|
2015-07-16 17:28:28 +01:00 |
|
Tom Deakin
|
c31644d001
|
Revert "Revert "Remove option for float - only use doubles for now""
This reverts commit 903a66b2e9.
|
2015-07-16 17:01:48 +01:00 |
|
Tom Deakin
|
903a66b2e9
|
Revert "Remove option for float - only use doubles for now"
This reverts commit 4225ba0a2b.
|
2015-07-16 16:59:43 +01:00 |
|
Tom Deakin
|
4225ba0a2b
|
Remove option for float - only use doubles for now
|
2015-07-16 16:58:12 +01:00 |
|
Tom Deakin
|
e5bd4fcb19
|
Be consistent with indenting
|
2015-07-16 16:36:30 +01:00 |
|
Tom Deakin
|
97c179feb3
|
Add parseArguments etc to get the device from the command line
|
2015-07-16 16:32:49 +01:00 |
|
Tom Deakin
|
9e6e269c3b
|
Add STREAM reference to README
|
2015-07-16 12:52:09 +01:00 |
|
Tom Deakin
|
f37ee97e35
|
Print out OpenCL device name
|
2015-07-16 12:39:32 +01:00 |
|
Tom Deakin
|
3a7a95b00d
|
Print out short version banner
|
2015-07-16 12:39:03 +01:00 |
|
Tom Deakin
|
c8e35d1307
|
Update Makefile with better rules and a clean option
|
2015-07-16 12:30:30 +01:00 |
|
Tom Deakin
|
9791b7e3d1
|
Use floating point abs value
|
2015-07-16 12:26:16 +01:00 |
|
Tom Deakin
|
0596d01913
|
Validate the results
|
2015-07-16 12:23:21 +01:00 |
|
Tom Deakin
|
4d93a5e0b4
|
Copy back buffers after execution
|
2015-07-16 12:19:32 +01:00 |
|
Tom Deakin
|
c901069cf8
|
Average time should be over NTIMES-1 not NTIMES
|
2015-07-16 12:19:05 +01:00 |
|
Tom Deakin
|
847cdc9b5e
|
Time the runs and print out results
|
2015-07-16 12:04:07 +01:00 |
|
Tom Deakin
|
097aa38fb7
|
Add 3-clause BSD license file
|
2015-07-15 23:36:08 +01:00 |
|
Tom Deakin
|
d4b45250ce
|
Add empty README.md file
|
2015-07-15 23:29:40 +01:00 |
|
Tom Deakin
|
0d64d58c50
|
Create device buffers and copy host vectors to device
|
2015-07-15 23:27:19 +01:00 |
|
Tom Deakin
|
dd3ddf3aa3
|
Add cl.hpp from Khronos website
|
2015-07-15 23:20:19 +01:00 |
|
Tom Deakin
|
8b1a2e2f5a
|
Add Makefile
|
2015-07-15 23:20:13 +01:00 |
|
Tom Deakin
|
11a661790f
|
Add main method - setup OpenCL for default GPU device
|
2015-07-15 23:19:40 +01:00 |
|
Tom Deakin
|
8ad233c12e
|
Add STREAM OpenCL kernels
|
2015-07-15 23:16:23 +01:00 |
|