35 lines
1.8 KiB
Plaintext
35 lines
1.8 KiB
Plaintext
-------------------------------------------------------------
|
|
STREAM version $Revision: 5.10 $
|
|
-------------------------------------------------------------
|
|
This system uses 8 bytes per array element.
|
|
-------------------------------------------------------------
|
|
Array size = 33554432 (elements), Offset = 0 (elements)
|
|
Memory per array = 256.0 MiB (= 0.2 GiB).
|
|
Total memory required = 768.0 MiB (= 0.8 GiB).
|
|
Each kernel will be executed 100 times.
|
|
The *best* time for each kernel (excluding the first iteration)
|
|
will be used to compute the reported bandwidth.
|
|
-------------------------------------------------------------
|
|
Number of Threads requested = 32
|
|
Number of Threads counted = 32
|
|
-------------------------------------------------------------
|
|
Your clock granularity/precision appears to be 1 microseconds.
|
|
Each test below will take on the order of 4685 microseconds.
|
|
(= 4685 clock ticks)
|
|
Increase the size of the arrays if this shows that
|
|
you are not getting at least 20 clock ticks per test.
|
|
-------------------------------------------------------------
|
|
WARNING -- The above is only a rough guideline.
|
|
For best results, please be sure you know the
|
|
precision of your system timer.
|
|
-------------------------------------------------------------
|
|
Function Best Rate MB/s Avg time Min time Max time
|
|
Copy: 84400.3 0.006386 0.006361 0.006415
|
|
Scale: 99272.6 0.005457 0.005408 0.005546
|
|
Add: 118080.7 0.006854 0.006820 0.006892
|
|
Triad: 116271.9 0.006969 0.006926 0.007042
|
|
-------------------------------------------------------------
|
|
Solution Validates: avg error less than 1.000000e-13 on all three arrays
|
|
-------------------------------------------------------------
|
|
Application 1396734 resources: utime ~84s, stime ~2s, Rss ~791400, inblocks ~490, outblocks ~54
|