Commit Graph

10 Commits

Author SHA1 Message Date
Tom Lin
4853457dca Add type annotation for all kernels
Update dependencies
2021-08-27 14:04:58 +01:00
Tom Lin
c445b64690 Address CUDA comments
Drop soft=false for AMDGPU as this option was removed
Update dependencies
2021-08-18 02:00:50 +01:00
Tom Lin
bb271dd046 Update PlainStream with context 2021-08-18 01:59:06 +01:00
Tom Lin
a26699c5b5 Add oneAPI and KA implementation
Isolate projects to avoid transitive dependency
Add parameter for passing devices
Incorporate further reviews
Update all dependencies
2021-08-17 14:28:47 +01:00
Tom Lin
7c1e04a42b Add comment about blockIdx/workgroupIdx in Julia 2021-06-30 19:31:42 +01:00
Tom Lin
2e957d3f60 Inline blocks in CUDAStream 2021-06-30 19:20:37 +01:00
Tom Lin
d675875dcd Switch back to -p for DistributedStream 2021-06-30 19:03:39 +01:00
Tom Lin
4e6c56729b Inline AMDGPU's hard_wait
Show the selected implementation and not a constant "threaded"
2021-06-30 18:09:54 +01:00
Tom Lin
2cf8ca5f8c Use addprocs() for DistributedStream 2021-06-10 04:57:52 +01:00
Tom Lin
b3efa6af67 Initial Julia implementation 2021-06-10 04:20:40 +01:00