Tom Deakin
ffac9fc352
[OMP45] Use alloc instead to to allocate device memory
...
This fixes #11
2016-11-16 12:50:20 -06:00
Tom Deakin
d42bcd4675
Merge remote-tracking branch 'origin/init-arrays' into devel
2016-11-04 09:17:54 +00:00
James Price
7f4761ae52
Replace write_arrays with init_arrays
...
This allows each model to initialise their arrays with a parallel
approach, which yields the first touch required for good performance
on NUMA architectures.
2016-11-02 11:22:01 +00:00
Tom Deakin
f32cf3bad3
Merge branch 'master' into kernel-dot
...
Conflicts:
main.cpp
2016-10-24 13:53:58 +01:00
Tom Deakin
b120acaf87
[OMP45] Use global defined scalar value
2016-10-24 13:23:20 +01:00
Tom Deakin
8a100f07b4
Add dot kernel to OpenMP 4.5 - tested with clang-ykt
2016-10-14 15:19:25 +01:00
Tom Deakin
942188d836
Add copyright header to source with it missing
2016-05-12 12:53:26 +01:00
Tom Deakin
31cb567e21
Switch data from 1.0, 2.0 and 3.0 to 0.1, 0.2, and 0.3 resp.
...
Using integers for maths gets unstable past 38 interations even
in double precision. Using the original values/10 is safe up to
the default 100 iterations.
2016-05-11 15:51:19 +01:00
Tom Deakin
4954ef7cf0
Add map clauses to OpenMP 4.5 kernels
2016-05-11 15:17:06 +01:00
Tom Deakin
eb10c716f2
First attempt at OpenMP 4.5
2016-05-11 15:08:08 +01:00