Skip to content

[WIP] ThreadVector implementations kernels for SET, SCAL, GEMV and GEMM#1556

Draft
mzuzek wants to merge 23 commits intokokkos:developfrom
NexGenAnalytics:threadvector-kernels
Draft

[WIP] ThreadVector implementations kernels for SET, SCAL, GEMV and GEMM#1556
mzuzek wants to merge 23 commits intokokkos:developfrom
NexGenAnalytics:threadvector-kernels

Conversation

@mzuzek
Copy link
Copy Markdown

@mzuzek mzuzek commented Sep 29, 2022

WIP: based on #1540 and #1550 - please review and merge those first


This PR introduces ThreadVector{Set,Scale,Gemv,Gemm} implementations for functor-level kernels, which use ThreadVectorRange and thus can be executed under TeamPolicy from within TeamThreadRange in application.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant