Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SYCL] Implement SYCL part of sycl_ext_oneapi_prefetch #11458
[SYCL] Implement SYCL part of sycl_ext_oneapi_prefetch #11458
Changes from 1 commit
6ab79b1
c4123e9
549f9e8
19e19a8
30e531a
42de79f
2d5a360
c40e4bd
5de267d
7afe1c9
886f2ab
a8e907b
01a8b5a
154660a
82badcc
7ecdbb1
ebc3b23
a3dbe96
9e81d89
3e554ac
024c901
c796b9e
bf3d6c9
6d9de5a
9ff283e
9b2ecc8
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This doesn't feel right to me, but it's outside of SYCL RT...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks like currently it's the only way to transform these properties into spirv decorations for this pointer. I tried to create
spirv.Decorations
metadata instead, but the compiler eliminates them with optimization flags.Annotation also may be eliminated, but it's much less likely.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we know for sure that
count
elements are consecutive in memory?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. @Pennycook I'm not sure how it's intended to work in case of N-dim offset. Should we call the __spirv_ocl_prefetch spirv instruction several times for different memory segments in such case or there should be some constraint?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We struggled a bit with multi-dimensional prefetches (see the issues).
Where we landed (for now) is that the block being prefetched is assumed to be contiguous, and only the offset itself is multi-dimensional. It's effectively a shorthand to avoid computing the linear offset from the start of the buffer. Note that the specification says for the multi-dimensional cases:
If somebody requests a multi-dimensional prefetch later, we can describe it with a
range
parameter in place of asize_t
count, and implement it the way you suggested (by calling the instruction multiple times).There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are these manually written? If so, do we have an utility script to generate them automatically? If yes, one might need to disable instrumentation/use
-O1
to get a more readable IR.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this can be
single_task
. I'd also want to see an E2E test with this used in non-uniform control flow (I don't think the spec prohibits that).There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved to single task - 7afe1c9. I'm going to add E2E tests a bit later when the whole feature (incl. llvm-spirv translator part) is done. Not sure what do you mean by "non-uniform control". Could you please explain a bit?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now that you've added
joint_prefetch
that just delegates to the per-WI one, I'm even more concerned about non-uniform control flow scenario...