Tom Deakin
|
33cb76bfc3
|
Use binary power of two for default array size
|
2015-07-31 11:20:39 +01:00 |
|
Tom Deakin
|
78dd832a5c
|
Rename AMD result
|
2015-07-31 11:16:52 +01:00 |
|
Tom Deakin
|
79f2aa4a17
|
Add K80 ECC on results
|
2015-07-30 15:20:33 +01:00 |
|
Tom Deakin
|
5587957a59
|
Add Xeon Phi SE10P result
|
2015-07-30 15:11:36 +01:00 |
|
Tom Deakin
|
f1f3927273
|
Add K20 and K40 results with ECC off
|
2015-07-30 15:10:47 +01:00 |
|
Tom Deakin
|
50d701a8d8
|
Update results to specify ECC
|
2015-07-30 15:09:03 +01:00 |
|
Tom Deakin
|
a2b4ccdb2f
|
Bump version to 0.9
|
2015-07-30 13:57:29 +01:00 |
|
Tom Deakin
|
d6116f7fe8
|
Update License
|
2015-07-30 13:57:00 +01:00 |
|
Tom Deakin
|
b891f0bf29
|
Print out NTIMES
|
2015-07-30 13:23:31 +01:00 |
|
Tom Deakin
|
b339e374eb
|
Merge branch 'master' of github.com:tomdeakin/GPU-STREAM
|
2015-07-29 19:01:41 +01:00 |
|
Tom
|
24859c4353
|
Add K80 results (using one half)
|
2015-07-29 18:18:08 +01:00 |
|
Tom Deakin
|
d02bb84b9f
|
Check array size is not zero after resize
|
2015-07-29 16:06:18 +01:00 |
|
Tom Deakin
|
d463c036b1
|
Move version string to top of common header
|
2015-07-29 16:02:26 +01:00 |
|
Tom Deakin
|
0bd4bbe82d
|
Make init arrays 1.0f, etc if float
|
2015-07-29 14:47:37 +01:00 |
|
Tom Deakin
|
e3de82d63c
|
Make 3.0 floating point in kernel if float
|
2015-07-29 14:31:26 +01:00 |
|
Tom Deakin
|
dcf734387e
|
Free host arrays at end of program
|
2015-07-28 13:59:31 +01:00 |
|
Tom Deakin
|
a9a087622d
|
Use large try/catch in main function for clarity
|
2015-07-28 13:37:15 +01:00 |
|
Tom Deakin
|
3c248195ea
|
Add try/catch around kernel executions
|
2015-07-28 13:18:02 +01:00 |
|
Tom Deakin
|
807af7d679
|
Remove cuda check in main line Makefile - only needs to be in CUDA part
|
2015-07-28 12:09:23 +01:00 |
|
Tom Deakin
|
dea95fe68e
|
Move common exceptions to common.h
|
2015-07-28 12:06:21 +01:00 |
|
Tom Deakin
|
201bc4638f
|
Bug fix: library linker error in Makefile on OS X
|
2015-07-28 12:04:25 +01:00 |
|
Tom Deakin
|
2b00245e63
|
Remove large try/catch block in OpenCL - wrap around each OCL call
|
2015-07-28 12:03:44 +01:00 |
|
michael boulton
|
e605b056a6
|
Fixed overflow in total array size
|
2015-07-28 11:42:40 +01:00 |
|
Michael Boulton
|
0a89282d57
|
new files
|
2015-07-28 11:37:50 +01:00 |
|
Michael Boulton
|
b43eb9cf16
|
Merge changes
|
2015-07-28 11:37:20 +01:00 |
|
Tom Deakin
|
bb0dcce28b
|
Check for nvcc before building CUDA version
|
2015-07-27 17:40:24 +01:00 |
|
Tom Deakin
|
ad93754c3b
|
Print out memory sizes
|
2015-07-27 17:32:47 +01:00 |
|
Tom Deakin
|
7ba0b8fc18
|
Display precision in the header output
|
2015-07-27 17:19:19 +01:00 |
|
Tom Deakin
|
80955ff2dc
|
Add CUDA error checking
|
2015-07-27 17:07:07 +01:00 |
|
Tom Deakin
|
ec9d73b8d5
|
CUDA bug: ARRAY_SIZE on CLI not being resized
The ARRAY_SIZE was being resized before the CLI args were read
|
2015-07-27 16:49:36 +01:00 |
|
Tom Deakin
|
a74e7cee4c
|
Bug fix: sizes array wasn't updating with ARRAY_SIZE argument
sizes array was fixed at compile time not run time.
Moved the declaration to the correct scope.
|
2015-07-27 13:44:07 +01:00 |
|
Tom Deakin
|
c3ad5edcb3
|
Port float code to CUDA version
|
2015-07-23 12:49:25 +01:00 |
|
Tom Deakin
|
e7fc832771
|
Add --float option to cli
|
2015-07-23 12:15:58 +01:00 |
|
Tom Deakin
|
ae7a352809
|
Add a float option to the kernel
|
2015-07-23 12:12:02 +01:00 |
|
Tom Deakin
|
99d2fe8526
|
Add S9150 result
|
2015-07-22 12:44:07 +01:00 |
|
Tom Deakin
|
cd786dbe25
|
CUDA fix geting device name with the device index
|
2015-07-20 17:09:26 +01:00 |
|
Tom Deakin
|
91c804edcd
|
Add NVIDIA GTX 980 Ti and 780 Ti results
|
2015-07-16 18:47:12 +01:00 |
|
Tom Deakin
|
7e0821278a
|
Add NVIDIA GTX 980 and 580 results
|
2015-07-16 18:43:41 +01:00 |
|
Tom Deakin
|
5fc14bbba1
|
Change binary name in cuda help
|
2015-07-16 18:32:23 +01:00 |
|
Tom Deakin
|
30305d83e0
|
CUDA: select number to times from CLI
|
2015-07-16 18:31:54 +01:00 |
|
Tom Deakin
|
0edd0646a6
|
OCL: select number to times from CLI
|
2015-07-16 18:29:42 +01:00 |
|
Tom Deakin
|
c52c7f290c
|
Update README
|
2015-07-16 18:23:28 +01:00 |
|
Tom Deakin
|
0aaceef49d
|
Update Makefile with all and cuda stuff
|
2015-07-16 18:16:27 +01:00 |
|
Tom Deakin
|
39a63b7d32
|
CUDA fix bug with listing device names
|
2015-07-16 18:14:11 +01:00 |
|
Tom Deakin
|
153b6f2105
|
Add results directory with some sample results
|
2015-07-16 18:12:26 +01:00 |
|
Tom Deakin
|
f2536f8764
|
Enqueue CUDA kernels
|
2015-07-16 18:06:35 +01:00 |
|
Tom Deakin
|
2c2dbf2c3f
|
You have to specify block size in CUDA, so make sure ARRAY_SIZE divides 1024
|
2015-07-16 18:06:07 +01:00 |
|
Tom Deakin
|
9da2cd6d09
|
CUDA: make device buffers arrays not vectors
|
2015-07-16 17:53:09 +01:00 |
|
Tom Deakin
|
e76cc9fb62
|
Uncomment kernels in CUDA - have to remove restrict keyword
|
2015-07-16 17:46:50 +01:00 |
|
Tom Deakin
|
fa865dca4f
|
Enumerate and select CUDA devices
|
2015-07-16 17:46:03 +01:00 |
|