Tom Lin
|
65b8b51b35
|
Move CMakeList.txt to top level
|
2021-12-03 13:26:09 +00:00 |
|
Tom Lin
|
365d1a3601
|
Fix CI cache path
|
2021-12-03 12:25:57 +00:00 |
|
Tom Lin
|
19c59cba73
|
Merge remote-tracking branch 'origin/main' into main
|
2021-12-03 12:20:59 +00:00 |
|
Tom Lin
|
949071b4b9
|
Add CI debugging
|
2021-12-03 12:20:40 +00:00 |
|
Tom Lin
|
c4ee2af23a
|
Add CI badge
|
2021-12-02 23:21:31 +00:00 |
|
Tom Lin
|
c378f93178
|
Fix action cache key
|
2021-12-02 18:32:58 +00:00 |
|
Tom Deakin
|
ed960d88a3
|
Update CHANGELOG.md
|
2021-12-02 10:41:55 +00:00 |
|
Tom Deakin
|
4ab6eac3a5
|
Merge pull request #102 from UoB-HPC/top-level-src
Move all C++ impl. to ./cpp and remove Makefiles
|
2021-12-02 10:30:46 +00:00 |
|
Tom Lin
|
7bf0b212d9
|
Fix CI missing omp library
Fix thrust model typo
|
2021-12-01 16:40:05 +00:00 |
|
Tom Lin
|
53f3b7b8a4
|
Fetch CL headers on-demand for ComputeCpp and OCL build
Minor CI adjustments for act
Fix TBB and Thrust cmake builds
|
2021-12-01 16:19:50 +00:00 |
|
Tom Lin
|
565c8c7f95
|
Fix cache path
|
2021-11-30 19:03:04 +00:00 |
|
Tom Lin
|
f5fe55c204
|
[WIP] Drop CL headers and Makefiles
Update README
Move new models to /src
|
2021-11-30 18:22:55 +00:00 |
|
Tom Lin
|
9326e98172
|
Merge branch 'main' into top-level-src
# Conflicts:
# .github/workflows/main.yaml
# .gitignore
# README.md
# src/CMakeLists.txt
# src/ci-test-compile.sh
|
2021-11-30 14:39:48 +00:00 |
|
Tom Deakin
|
53547ff664
|
Merge pull request #111 from UoB-HPC/thrust
Thrust Implementation
|
2021-11-25 13:17:48 +00:00 |
|
Tom Deakin
|
2ea930a0f5
|
Merge pull request #108 from UoB-HPC/java
Java implementation
|
2021-11-25 13:14:30 +00:00 |
|
Tom Lin
|
3d2f55da4a
|
Merge branch 'main' into java
|
2021-11-25 13:14:06 +00:00 |
|
Tom Deakin
|
1b147963d1
|
Merge pull request #103 from UoB-HPC/scala
Initial Scala implementation
|
2021-11-25 13:11:46 +00:00 |
|
Tom Deakin
|
d1b1722acf
|
Merge pull request #106 from UoB-HPC/julia
Julia implementation
|
2021-11-25 13:07:27 +00:00 |
|
Tom Lin
|
dc42388df3
|
Fix CXX recognition issues for rocThrust
Fix CI check for min CMake version on CUDA Thrust
Temporarily disable CUDA Thrust w/ TBB for now
|
2021-11-12 03:25:18 +00:00 |
|
Tom Lin
|
fe4007b446
|
Fix CI ROCm quirks
Fix CI CUDA path
|
2021-11-12 02:26:31 +00:00 |
|
Tom Lin
|
0d55a7261b
|
Fix CI not installing rocThrust
Fix CI CUDA flag version
|
2021-11-12 00:14:07 +00:00 |
|
Tom Lin
|
a463e88895
|
Fix CI rocThrust build variables
Fix CI CUDA cmake module include path
Bump CI NVHPC version
|
2021-11-11 23:50:27 +00:00 |
|
Tom Lin
|
c2f75b90b3
|
Fix CI NVHPC path
Fix CI ROCm install sources
|
2021-11-11 23:30:04 +00:00 |
|
Tom Lin
|
a66696d971
|
Initial Thrust implementation
|
2021-11-11 23:11:04 +00:00 |
|
Tom Lin
|
78b52a496c
|
Use @simd instead of @fastmath for CPU reduction
|
2021-08-28 11:39:08 +01:00 |
|
Tom Lin
|
41f1767365
|
Pause GC during benchmark to reduce noise
|
2021-08-28 11:16:19 +01:00 |
|
Tom Lin
|
13cb8ffced
|
Use custom static reduction for CPU
|
2021-08-28 11:10:49 +01:00 |
|
Tom Lin
|
4853457dca
|
Add type annotation for all kernels
Update dependencies
|
2021-08-27 14:04:58 +01:00 |
|
Tom Lin
|
c445b64690
|
Address CUDA comments
Drop soft=false for AMDGPU as this option was removed
Update dependencies
|
2021-08-18 02:00:50 +01:00 |
|
Tom Lin
|
bb271dd046
|
Update PlainStream with context
|
2021-08-18 01:59:06 +01:00 |
|
Tom Lin
|
a26699c5b5
|
Add oneAPI and KA implementation
Isolate projects to avoid transitive dependency
Add parameter for passing devices
Incorporate further reviews
Update all dependencies
|
2021-08-17 14:28:47 +01:00 |
|
Tom Deakin
|
8f9ca7baa7
|
update references in README
|
2021-07-28 10:37:25 +01:00 |
|
Tom Deakin
|
b4d01160cb
|
CITATION cannot yet handle external references
|
2021-07-28 10:31:39 +01:00 |
|
Tom Deakin
|
d8eba00132
|
Update CITATION.cff
|
2021-07-28 10:30:51 +01:00 |
|
Tom Deakin
|
064743299b
|
Update CITATION.cff
|
2021-07-28 10:30:18 +01:00 |
|
Tom Deakin
|
b766b0c707
|
Update CITATION.cff
|
2021-07-28 10:29:56 +01:00 |
|
Tom Deakin
|
537ad3650e
|
Add CITATION file
|
2021-07-28 10:28:34 +01:00 |
|
Tom Lin
|
867a8a32ee
|
Use older fmt-maven-plugin for Java 8 compat.
|
2021-07-01 06:05:10 +01:00 |
|
Tom Lin
|
82084d407b
|
+x for mvnw executable
|
2021-07-01 06:01:29 +01:00 |
|
Tom Lin
|
ab41475f10
|
Initial Java implementation
|
2021-07-01 05:59:48 +01:00 |
|
Tom Lin
|
7c1e04a42b
|
Add comment about blockIdx/workgroupIdx in Julia
|
2021-06-30 19:31:42 +01:00 |
|
Tom Lin
|
2e957d3f60
|
Inline blocks in CUDAStream
|
2021-06-30 19:20:37 +01:00 |
|
Tom Lin
|
418315543c
|
Use -p 2 and no arg for JuliaStream in CI
|
2021-06-30 19:09:37 +01:00 |
|
Tom Lin
|
d675875dcd
|
Switch back to -p for DistributedStream
|
2021-06-30 19:03:39 +01:00 |
|
Tom Lin
|
fe180656d1
|
Merge branch 'main' into julia
|
2021-06-30 18:44:17 +01:00 |
|
Tom Lin
|
4e6c56729b
|
Inline AMDGPU's hard_wait
Show the selected implementation and not a constant "threaded"
|
2021-06-30 18:09:54 +01:00 |
|
Tom Lin
|
6fe81e1955
|
Update CUDA to 11.3 for CI script
|
2021-06-30 16:31:14 +01:00 |
|
Tom Lin
|
ce7f013a8e
|
Update NVHPC to 2.15 w/ CUDA 11.3
|
2021-06-30 16:04:27 +01:00 |
|
Tom Lin
|
cd367c7da3
|
Mirror Fujitsu flags for CMake
|
2021-06-29 17:53:32 +01:00 |
|
Tom Deakin
|
fa6433bab1
|
update changelog
|
2021-06-25 09:45:38 -05:00 |
|