SYCL: SOFTMAX F16 mask support and other fixes #11261

qnixsynapse · 2025-01-16T07:53:40Z

Implemented ggml_sycl_op_soft_max() F16 src1(mask) support for which a pragma deprecation warning was added during #5021.
To do this, had to decouple it from ggml_sycl_op_flatten which always considered src1 to be of fp32 type(many OP functions are dependent on it).

Also, replaced std::max with sycl::max in the softmax kernel. There was not a single test with F16 mask in the test-backend-ops so I manually had to add such a test locally and I can confirm that it passed on my machine. This PR did not add that test. Reviewers are requested to test it thoroughly on their machines.

Not sure why this was necessary. The models which I tested do not use F16 mask.
Also did few cleanups.

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Jan 16, 2025

qnixsynapse force-pushed the softmax branch from 9af1835 to 90e7db9 Compare January 17, 2025 12:22

SYCL: SOFTMAX F16 mask support and other fixes

1e2fe41

qnixsynapse force-pushed the softmax branch from 90e7db9 to 1e2fe41 Compare January 19, 2025 14:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SYCL: SOFTMAX F16 mask support and other fixes #11261

SYCL: SOFTMAX F16 mask support and other fixes #11261

qnixsynapse commented Jan 16, 2025

SYCL: SOFTMAX F16 mask support and other fixes #11261

Are you sure you want to change the base?

SYCL: SOFTMAX F16 mask support and other fixes #11261

Conversation

qnixsynapse commented Jan 16, 2025