Commit Graph

907 Commits

Author SHA1 Message Date
Tom Lin
949071b4b9 Add CI debugging 2021-12-03 12:20:40 +00:00
Tom Lin
c4ee2af23a Add CI badge 2021-12-02 23:21:31 +00:00
Tom Lin
c378f93178 Fix action cache key 2021-12-02 18:32:58 +00:00
Tom Deakin
ed960d88a3
Update CHANGELOG.md 2021-12-02 10:41:55 +00:00
Tom Deakin
4ab6eac3a5
Merge pull request #102 from UoB-HPC/top-level-src
Move all C++ impl. to ./cpp and remove Makefiles
2021-12-02 10:30:46 +00:00
Tom Lin
7bf0b212d9 Fix CI missing omp library
Fix thrust model typo
2021-12-01 16:40:05 +00:00
Tom Lin
53f3b7b8a4 Fetch CL headers on-demand for ComputeCpp and OCL build
Minor CI adjustments for act
Fix TBB and Thrust cmake builds
2021-12-01 16:19:50 +00:00
Tom Lin
565c8c7f95 Fix cache path 2021-11-30 19:03:04 +00:00
Tom Lin
f5fe55c204 [WIP] Drop CL headers and Makefiles
Update README
Move new models to /src
2021-11-30 18:22:55 +00:00
Tom Lin
9326e98172 Merge branch 'main' into top-level-src
# Conflicts:
#	.github/workflows/main.yaml
#	.gitignore
#	README.md
#	src/CMakeLists.txt
#	src/ci-test-compile.sh
2021-11-30 14:39:48 +00:00
Tom Deakin
53547ff664
Merge pull request #111 from UoB-HPC/thrust
Thrust Implementation
2021-11-25 13:17:48 +00:00
Tom Deakin
2ea930a0f5
Merge pull request #108 from UoB-HPC/java
Java implementation
2021-11-25 13:14:30 +00:00
Tom Lin
3d2f55da4a
Merge branch 'main' into java 2021-11-25 13:14:06 +00:00
Tom Deakin
1b147963d1
Merge pull request #103 from UoB-HPC/scala
Initial Scala implementation
2021-11-25 13:11:46 +00:00
Tom Deakin
d1b1722acf
Merge pull request #106 from UoB-HPC/julia
Julia implementation
2021-11-25 13:07:27 +00:00
Tom Lin
6c47b22cc3 Capture vectors by reference directly
Add custom range implementation
2021-11-12 12:38:54 +00:00
Tom Lin
dc42388df3 Fix CXX recognition issues for rocThrust
Fix CI check for min CMake version on CUDA Thrust
Temporarily disable CUDA Thrust w/ TBB for now
2021-11-12 03:25:18 +00:00
Tom Lin
fe4007b446 Fix CI ROCm quirks
Fix CI CUDA path
2021-11-12 02:26:31 +00:00
Tom Lin
0d55a7261b Fix CI not installing rocThrust
Fix CI CUDA flag version
2021-11-12 00:14:07 +00:00
Tom Lin
a463e88895 Fix CI rocThrust build variables
Fix CI CUDA cmake module include path
Bump CI NVHPC version
2021-11-11 23:50:27 +00:00
Tom Lin
c2f75b90b3 Fix CI NVHPC path
Fix CI ROCm install sources
2021-11-11 23:30:04 +00:00
Tom Lin
a66696d971 Initial Thrust implementation 2021-11-11 23:11:04 +00:00
Tom Lin
78b52a496c Use @simd instead of @fastmath for CPU reduction 2021-08-28 11:39:08 +01:00
Tom Lin
41f1767365 Pause GC during benchmark to reduce noise 2021-08-28 11:16:19 +01:00
Tom Lin
13cb8ffced Use custom static reduction for CPU 2021-08-28 11:10:49 +01:00
Tom Lin
4853457dca Add type annotation for all kernels
Update dependencies
2021-08-27 14:04:58 +01:00
Tom Lin
c445b64690 Address CUDA comments
Drop soft=false for AMDGPU as this option was removed
Update dependencies
2021-08-18 02:00:50 +01:00
Tom Lin
bb271dd046 Update PlainStream with context 2021-08-18 01:59:06 +01:00
Tom Lin
a26699c5b5 Add oneAPI and KA implementation
Isolate projects to avoid transitive dependency
Add parameter for passing devices
Incorporate further reviews
Update all dependencies
2021-08-17 14:28:47 +01:00
Tom Deakin
8f9ca7baa7 update references in README 2021-07-28 10:37:25 +01:00
Tom Deakin
b4d01160cb
CITATION cannot yet handle external references 2021-07-28 10:31:39 +01:00
Tom Deakin
d8eba00132
Update CITATION.cff 2021-07-28 10:30:51 +01:00
Tom Deakin
064743299b
Update CITATION.cff 2021-07-28 10:30:18 +01:00
Tom Deakin
b766b0c707
Update CITATION.cff 2021-07-28 10:29:56 +01:00
Tom Deakin
537ad3650e Add CITATION file 2021-07-28 10:28:34 +01:00
Tom Lin
867a8a32ee Use older fmt-maven-plugin for Java 8 compat. 2021-07-01 06:05:10 +01:00
Tom Lin
82084d407b +x for mvnw executable 2021-07-01 06:01:29 +01:00
Tom Lin
ab41475f10 Initial Java implementation 2021-07-01 05:59:48 +01:00
Tom Lin
7c1e04a42b Add comment about blockIdx/workgroupIdx in Julia 2021-06-30 19:31:42 +01:00
Tom Lin
2e957d3f60 Inline blocks in CUDAStream 2021-06-30 19:20:37 +01:00
Tom Lin
418315543c Use -p 2 and no arg for JuliaStream in CI 2021-06-30 19:09:37 +01:00
Tom Lin
d675875dcd Switch back to -p for DistributedStream 2021-06-30 19:03:39 +01:00
Tom Lin
fe180656d1 Merge branch 'main' into julia 2021-06-30 18:44:17 +01:00
Tom Lin
4e6c56729b Inline AMDGPU's hard_wait
Show the selected implementation and not a constant "threaded"
2021-06-30 18:09:54 +01:00
Tom Lin
6fe81e1955 Update CUDA to 11.3 for CI script 2021-06-30 16:31:14 +01:00
Tom Lin
ce7f013a8e Update NVHPC to 2.15 w/ CUDA 11.3 2021-06-30 16:04:27 +01:00
Tom Lin
cd367c7da3 Mirror Fujitsu flags for CMake 2021-06-29 17:53:32 +01:00
Tom Deakin
fa6433bab1 update changelog 2021-06-25 09:45:38 -05:00
Tom Deakin
eba2e79eab [OpenMP] Add Fujitsu compiler flags
For best performance on the A64FX with the Fujitsu compiler,
the array pointers also need to be labeled __restrict and const
as appropriate.

Closes #94.
2021-06-25 09:44:16 -05:00
Tom Lin
e3bd58378f Don't debug print args 2021-06-16 01:16:10 +01:00