pensun
|
207701219a
|
Add looper optimization for cuda-stream.cu, remove result files
|
2016-04-03 06:49:56 -05:00 |
|
Tom Deakin
|
70330c7b9b
|
Display CUDA driver version in output header
This mimics the OpenCL change in issue #4.
|
2015-09-24 12:03:44 +01:00 |
|
Tom Deakin
|
6dcc0ffc5c
|
Throw std::runtime_error(string) rather than define own exception classes just for strings!
|
2015-07-31 17:02:48 +01:00 |
|
Tom Deakin
|
5eec4dddbc
|
Remove trailing whitespaces
|
2015-07-31 15:35:40 +01:00 |
|
Tom Deakin
|
4cda93eff5
|
Remove dead line from cuda code
|
2015-07-31 14:09:48 +01:00 |
|
Tom Deakin
|
ec8799c4db
|
Check the device can support the array sizes before trying to allocate memory
|
2015-07-31 14:00:48 +01:00 |
|
Tom Deakin
|
70ccd22919
|
Free CUDA buffers
|
2015-07-31 13:59:34 +01:00 |
|
Tom Deakin
|
d6116f7fe8
|
Update License
|
2015-07-30 13:57:00 +01:00 |
|
Tom Deakin
|
b891f0bf29
|
Print out NTIMES
|
2015-07-30 13:23:31 +01:00 |
|
Tom Deakin
|
d02bb84b9f
|
Check array size is not zero after resize
|
2015-07-29 16:06:18 +01:00 |
|
Tom Deakin
|
0bd4bbe82d
|
Make init arrays 1.0f, etc if float
|
2015-07-29 14:47:37 +01:00 |
|
Tom Deakin
|
dcf734387e
|
Free host arrays at end of program
|
2015-07-28 13:59:31 +01:00 |
|
Tom Deakin
|
dea95fe68e
|
Move common exceptions to common.h
|
2015-07-28 12:06:21 +01:00 |
|
michael boulton
|
e605b056a6
|
Fixed overflow in total array size
|
2015-07-28 11:42:40 +01:00 |
|
Michael Boulton
|
b43eb9cf16
|
Merge changes
|
2015-07-28 11:37:20 +01:00 |
|
Tom Deakin
|
ad93754c3b
|
Print out memory sizes
|
2015-07-27 17:32:47 +01:00 |
|
Tom Deakin
|
7ba0b8fc18
|
Display precision in the header output
|
2015-07-27 17:19:19 +01:00 |
|
Tom Deakin
|
80955ff2dc
|
Add CUDA error checking
|
2015-07-27 17:07:07 +01:00 |
|
Tom Deakin
|
ec9d73b8d5
|
CUDA bug: ARRAY_SIZE on CLI not being resized
The ARRAY_SIZE was being resized before the CLI args were read
|
2015-07-27 16:49:36 +01:00 |
|
Tom Deakin
|
a74e7cee4c
|
Bug fix: sizes array wasn't updating with ARRAY_SIZE argument
sizes array was fixed at compile time not run time.
Moved the declaration to the correct scope.
|
2015-07-27 13:44:07 +01:00 |
|
Tom Deakin
|
c3ad5edcb3
|
Port float code to CUDA version
|
2015-07-23 12:49:25 +01:00 |
|
Tom Deakin
|
cd786dbe25
|
CUDA fix geting device name with the device index
|
2015-07-20 17:09:26 +01:00 |
|
Tom Deakin
|
5fc14bbba1
|
Change binary name in cuda help
|
2015-07-16 18:32:23 +01:00 |
|
Tom Deakin
|
30305d83e0
|
CUDA: select number to times from CLI
|
2015-07-16 18:31:54 +01:00 |
|
Tom Deakin
|
39a63b7d32
|
CUDA fix bug with listing device names
|
2015-07-16 18:14:11 +01:00 |
|
Tom Deakin
|
f2536f8764
|
Enqueue CUDA kernels
|
2015-07-16 18:06:35 +01:00 |
|
Tom Deakin
|
2c2dbf2c3f
|
You have to specify block size in CUDA, so make sure ARRAY_SIZE divides 1024
|
2015-07-16 18:06:07 +01:00 |
|
Tom Deakin
|
9da2cd6d09
|
CUDA: make device buffers arrays not vectors
|
2015-07-16 17:53:09 +01:00 |
|
Tom Deakin
|
e76cc9fb62
|
Uncomment kernels in CUDA - have to remove restrict keyword
|
2015-07-16 17:46:50 +01:00 |
|
Tom Deakin
|
fa865dca4f
|
Enumerate and select CUDA devices
|
2015-07-16 17:46:03 +01:00 |
|
Tom Deakin
|
cf6809ba37
|
Change output to say CUDA if CUDA version
|
2015-07-16 17:34:59 +01:00 |
|
Tom Deakin
|
3ff57ba9d6
|
Add parseInt to cuda version for device index
|
2015-07-16 17:33:58 +01:00 |
|
Tom Deakin
|
8c7224ab42
|
Remove cl_uint reference in CUDA version
|
2015-07-16 17:31:59 +01:00 |
|
Tom Deakin
|
06c24b2867
|
Remove OpenCL header and use CUDA header in CUDA version
|
2015-07-16 17:30:54 +01:00 |
|
Tom Deakin
|
435cb3b971
|
Remove catching cl::Error in CUDA version
|
2015-07-16 17:30:13 +01:00 |
|
Tom Deakin
|
cfe2f862ae
|
Add a half broken CUDA port
|
2015-07-16 17:28:28 +01:00 |
|