Tom Deakin
|
ad61a0dba1
|
Merge pull request #178 from UoB-HPC/revert-177-patch-5
Revert "Remove AdaptiveCpp workaround in dpl_shim.h to allow for automatic prefetch optimization"
|
2023-12-04 12:43:17 +00:00 |
|
Tom Deakin
|
438e210867
|
Revert "Remove AdaptiveCpp workaround in dpl_shim.h to allow for automatic prefetch optimization (#177)"
This reverts commit 06c3d534dd.
|
2023-12-04 12:42:33 +00:00 |
|
Aksel Alpay
|
06c3d534dd
|
Remove AdaptiveCpp workaround in dpl_shim.h to allow for automatic prefetch optimization (#177)
|
2023-12-04 11:13:37 +00:00 |
|
Mosè Giordano
|
78ba4ff37a
|
Rename Julia package to BabelStream.jl and add compats (#175)
|
2023-11-24 17:18:43 +00:00 |
|
kaanolgu
|
8389fc43a3
|
Update README.md (#172)
The sycl2020 model variant names needs update with the Babelstream 5.0
|
2023-11-05 21:43:33 +00:00 |
|
Tom Deakin
|
f6ae48de89
|
Merge pull request #170 from UoB-HPC/develop
Update CHANGELOG.md
|
2023-10-12 11:12:28 +01:00 |
|
Tom Deakin
|
7b6ee78ec4
|
Update CHANGELOG.md
|
2023-10-12 11:12:05 +01:00 |
|
Tom Deakin
|
f3801aeac2
|
Merge pull request #169 from UoB-HPC/develop
Merge develop for v5.0
|
2023-10-12 11:11:33 +01:00 |
|
Tom Deakin
|
773814f0f2
|
Update CHANGELOG.md
|
2023-10-12 11:11:19 +01:00 |
|
Tom Deakin
|
f3aaca06dd
|
Add Read and Init timings for Fortran (not csv)
|
2023-10-09 11:16:14 +01:00 |
|
Tom Lin
|
165db1749c
|
Update changelog for v5
|
2023-10-07 15:26:52 +01:00 |
|
Tom Lin
|
01ef17e8b4
|
Bump version to 5.0, resolves #167
|
2023-10-07 15:14:28 +01:00 |
|
Tom Lin
|
a27abfe296
|
Merge branch 'time_init_read' into develop
|
2023-10-07 15:09:52 +01:00 |
|
Tom Lin
|
145e2a0649
|
Merge branch 'fix_num_type' into develop
|
2023-10-07 15:09:44 +01:00 |
|
Tom Lin
|
3cb01e76a8
|
Add init/read timing for Julia
|
2023-10-07 14:59:26 +01:00 |
|
Tom Lin
|
e7774c1372
|
Update changelog for timing and version bump updates
|
2023-10-07 13:58:34 +01:00 |
|
Tom Lin
|
3de019c156
|
Add init/read timing for Java
Upgrade to TornadoVM 0.15 API
|
2023-10-07 13:50:58 +01:00 |
|
Tom Lin
|
971d1e8ac7
|
Add init/read timing for Scala
|
2023-10-07 12:10:08 +01:00 |
|
Tom Lin
|
512a6fac0c
|
Add init/read timing for Rust
|
2023-10-07 11:16:46 +01:00 |
|
Tom Lin
|
f2f7f3a3de
|
Fix bad dot group initialiser in HIP and CUDA
|
2023-10-07 11:12:08 +01:00 |
|
Tom Lin
|
5f3741e404
|
Add init/read timing for C++ models
|
2023-10-07 10:23:31 +01:00 |
|
Tom Lin
|
ffae3ba83f
|
Fix CMAKE_CUDA_FLAGS, resolves #166
|
2023-10-07 09:45:16 +01:00 |
|
Tom Lin
|
e347d2ff6c
|
Aggregate initialise numeric types, resolves #134
|
2023-10-07 09:41:18 +01:00 |
|
Tom Deakin
|
9954b7d38c
|
Set CUDA dot kernel to use number of blocks relative to device property
This aligns with the approach implemented in other models (SYCL 1.2.1 and HIP)
Cherry-picks the CUDA updates from lmeadows in #122
|
2023-10-06 17:56:42 +01:00 |
|
Troels Henriksen
|
92fed7082b
|
Add Futhark implementation (#146)
* Add Futhark.
|
2023-10-03 13:57:20 +01:00 |
|
Tom Lin
|
2e3ebeecab
|
Don't set CMAKE_BUILD_TYPE unconditionally
|
2023-10-01 21:48:27 +01:00 |
|
Tom Lin
|
369785c96a
|
Add HIP managed memory support, resolves #162
|
2023-09-25 01:41:06 +01:00 |
|
Tom Lin
|
bd6bb09b5d
|
Fix MEM flag for CUDA, resolves #163
|
2023-09-25 01:39:23 +01:00 |
|
Tom Lin
|
29b03be572
|
Update readme and changelog
|
2023-09-25 01:12:08 +01:00 |
|
Tom Lin
|
def6dadfd7
|
Merge remote-tracking branch 'origin/main' into develop
|
2023-09-25 01:06:22 +01:00 |
|
Tom Lin
|
42de93076c
|
Increase CI root reserve for ROCm and tmp spills 6
|
2023-09-25 00:10:12 +01:00 |
|
Tom Lin
|
e88043a5af
|
Increase CI root reserve for ROCm and tmp spills 5
|
2023-09-24 23:50:56 +01:00 |
|
Tom Lin
|
177416229c
|
Increase CI root reserve for ROCm and tmp spills 4
|
2023-09-24 23:23:40 +01:00 |
|
Tom Lin
|
5c9cb660ec
|
Increase CI root reserve for ROCm and tmp spills 3
|
2023-09-24 23:15:58 +01:00 |
|
Tom Lin
|
717c40cb3d
|
Increase CI root reserve for ROCm and tmp spills 2
|
2023-09-24 23:04:53 +01:00 |
|
Tom Lin
|
f47d27980f
|
Increase CI root reserve for ROCm and tmp spills
|
2023-09-24 22:44:47 +01:00 |
|
Tom Lin
|
5a1be9399c
|
Bump Julia lockfiles and compat to 1.9
|
2023-09-24 22:40:04 +01:00 |
|
Tom Lin
|
154ad9f297
|
Bump rust lockfiles
|
2023-09-24 21:51:26 +01:00 |
|
Tom Lin
|
aea4e7d2a4
|
Fix CI cache version
|
2023-09-24 21:45:47 +01:00 |
|
Tom Lin
|
a542f3bf67
|
Add experimental CI action for more disk space
Don't use CI action dependencies
Bump CI checkout/cache versions
|
2023-09-24 21:41:31 +01:00 |
|
Tom Lin
|
89a84fbbea
|
Merge branch 'develop' into option_for_vec
# Conflicts:
# src/std-indices/STDIndicesStream.h
|
2023-09-24 21:17:47 +01:00 |
|
Tom Lin
|
324361aa83
|
Merge pull request #157 from illuhad/patch-3
std-indices: Use forward iterator tag to align with the implemented operators
|
2023-09-24 21:13:27 +01:00 |
|
Tom Lin
|
a0c5c00c9c
|
Merge pull request #155 from illuhad/patch-1
std-indices: Fix infinite recursion in ranged::operator!=
|
2023-09-24 21:13:02 +01:00 |
|
Tom Lin
|
72be9f6980
|
Fix up CI, resolves #145, supersedes #154
Drop vector for std-* models
|
2023-09-24 21:11:35 +01:00 |
|
Tom Lin
|
3dcafd1af1
|
Fix max element guard overflow for CUDA, resolves #136
|
2023-09-22 02:31:14 +01:00 |
|
Tom Lin
|
87a38e949d
|
Fix SYCL2020 accessors typo
|
2023-09-05 03:35:17 +01:00 |
|
Tom Lin
|
3f7bb631e1
|
Initial SYCL2020 USM implementation
|
2023-09-05 03:29:16 +01:00 |
|
Tom Lin
|
b25fd755a6
|
Merge branch 'develop' into option_for_vec
|
2023-09-05 02:44:43 +01:00 |
|
Tom Lin
|
28dcf6f962
|
Bump oneDPL version
|
2023-09-01 03:48:19 +01:00 |
|
Tom Lin
|
eef3221df6
|
Qualify all math calls
|
2023-08-31 17:53:50 +01:00 |
|