Commit Graph

818 Commits

Author SHA1 Message Date
Tom Lin
2e957d3f60 Inline blocks in CUDAStream 2021-06-30 19:20:37 +01:00
Tom Lin
418315543c Use -p 2 and no arg for JuliaStream in CI 2021-06-30 19:09:37 +01:00
Tom Lin
d675875dcd Switch back to -p for DistributedStream 2021-06-30 19:03:39 +01:00
Tom Lin
fe180656d1 Merge branch 'main' into julia 2021-06-30 18:44:17 +01:00
Tom Lin
4e6c56729b Inline AMDGPU's hard_wait
Show the selected implementation and not a constant "threaded"
2021-06-30 18:09:54 +01:00
Tom Lin
6fe81e1955 Update CUDA to 11.3 for CI script 2021-06-30 16:31:14 +01:00
Tom Lin
ce7f013a8e Update NVHPC to 2.15 w/ CUDA 11.3 2021-06-30 16:04:27 +01:00
Tom Lin
cd367c7da3 Mirror Fujitsu flags for CMake 2021-06-29 17:53:32 +01:00
Tom Deakin
fa6433bab1 update changelog 2021-06-25 09:45:38 -05:00
Tom Deakin
eba2e79eab [OpenMP] Add Fujitsu compiler flags
For best performance on the A64FX with the Fujitsu compiler,
the array pointers also need to be labeled __restrict and const
as appropriate.

Closes #94.
2021-06-25 09:44:16 -05:00
Tom Lin
e3bd58378f Don't debug print args 2021-06-16 01:16:10 +01:00
Tom Lin
ce4d6cfbfb Add integration tests and CI
Fix wrong nstream in plain_stream
2021-06-16 01:11:40 +01:00
Tom Lin
fdb2c181cc Add Crossbeam implementation
Add rustfmt and use target-cpu=native
Add option for libc malloc, basic thread pinning, touch-free allocation
Split modules
2021-06-15 23:13:14 +01:00
Tom Lin
c70a5da45b Merge branch 'main' into rust 2021-06-10 05:37:03 +01:00
Tom Lin
d799535c96 Larger arraysize for CI 2021-06-10 05:06:48 +01:00
Tom Lin
c5ad3f34d9 Drop -p N for DistributedStream.jl CI 2021-06-10 05:01:24 +01:00
Tom Lin
2cf8ca5f8c Use addprocs() for DistributedStream 2021-06-10 04:57:52 +01:00
Tom Lin
63f471f880 set pwd to JuliaStream.jl for CI run 2021-06-10 04:33:12 +01:00
Tom Lin
b3efa6af67 Initial Julia implementation 2021-06-10 04:20:40 +01:00
Tom Deakin
5d9e408a06 [SYCL 2020] Make array size a size_t 2021-06-04 16:42:49 +00:00
Tom Deakin
25e021caa3
Update CHANGELOG.md 2021-06-03 16:08:14 +01:00
Tom Deakin
dd90598e20
Merge pull request #105 from UoB-HPC/tbb
Initial TBB implementation
2021-06-03 16:07:38 +01:00
Tom Lin
0e3727d8f8 Make partitioner a compile option
Inline all abstractions
Add intel compilers for Make
2021-06-03 13:43:12 +01:00
Tom Lin
0867115d8d Remove references to oneapi/tbb.h 2021-05-27 10:51:45 +01:00
Tom Lin
d3b676cb37 Include CL_MEM_CHANNEL_INTEL directly to avoid header precedence issues 2021-05-27 10:47:46 +01:00
Tom Lin
7a130a59bc Don't tie implementation to oneTBB specific headers
Fix wrong TBB_ROOT detection
2021-05-27 10:23:06 +01:00
Tom Lin
4d00a8699e Don't point to the CL dir for SYCL 2021-05-27 09:41:41 +01:00
Tom Lin
742f0629be Initial TBB implementation 2021-05-27 09:28:40 +01:00
Tom Lin
5318404249 Use ./src instead of ./cpp
Create subdir for each cpp-based implementation
2021-05-26 17:46:07 +01:00
Tom Lin
cc86998f50 Initial Scala implementation 2021-05-23 08:03:30 +01:00
Tom Lin
60d4cb8c87 Enter ./cpp first before CI tasks, take 2 2021-05-23 07:39:04 +01:00
Tom Lin
67a7447924 Enter ./cpp first before CI tasks 2021-05-23 07:36:08 +01:00
Tom Lin
e2f07c056d Merge branch 'main' into top-level-src 2021-05-23 07:27:26 +01:00
Tom Lin
717cc40c2c Move all C++ impl. to ./cpp 2021-05-23 07:21:21 +01:00
Tom Deakin
2ab68ab39e Update changelog 2021-05-18 16:47:56 +01:00
Tom Deakin
82dedad676 [SYCL 1.2.1] Add check for FP64 support
Fixes #98
2021-05-18 16:47:36 +01:00
Tom Deakin
d3269ece59
Merge pull request #101 from UoB-HPC/revert-100-issue-98
Revert "Add check for FP64 support"
2021-05-18 16:44:20 +01:00
Tom Deakin
b772d00fe4
Revert "Add check for FP64 support" 2021-05-18 16:44:06 +01:00
Tom Deakin
a209f16c16
Merge pull request #100 from UoB-HPC/issue-98
Add check for FP64 support
2021-05-18 11:31:36 +01:00
Tom Lin
0616068ade Merge branch 'main' into issue-98 2021-05-17 20:16:50 +01:00
Tom Lin
45ebd09ef2 Don't use hardcoded rocm path 2021-05-17 20:00:00 +01:00
Tom Deakin
6581ee63b8 Update changelog 2021-05-17 15:33:54 +01:00
Tom Deakin
9f38177e1b [SYCL 2020] Add check for FP64 support using device aspects.
This will resolve #98 in the future SYCL 2020 version.
2021-05-17 15:32:42 +01:00
Tom Deakin
e20aecd845 [SYCL 1.2.1] Add check for FP64 support
Fixes #98
2021-05-17 15:25:43 +01:00
Tom Lin
cc16547e4d Add install target for CMake 2021-05-10 17:50:36 +01:00
Tom Deakin
e5c9f313cb
Merge pull request #96 from tob2/stdlib_fix
Include stdlib.h for aligned_alloc
2021-04-28 11:19:27 +01:00
Tobias Burnus
75a4394830 Include stdlib.h for aligned_alloc
Silence "error: there are no arguments to 'aligned_alloc' that depend
on a template parameter, so a declaration of 'aligned_alloc' must be
available"

* OMPStream.cpp: #include <cstdlib>.
* RAJAStream.cpp: Likewise.
2021-04-22 09:00:55 +02:00
Tom Lin
2eca3974e6 Disable CI for RAJA on gcc-10+CUDA due to ICE
Update changelog to include RAJA 0.13.x
2021-04-21 16:28:12 +01:00
Tom Lin
2ff883f2f7 Merge branch 'main' into rust
# Conflicts:
#	README.md
2021-04-21 15:58:43 +01:00
Tom Deakin
6ecd10bf98 Deprecate HC 2021-04-20 16:16:56 +01:00