Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convolution FP32 in oneDNN Changed from gemm:acl to gemm:ref in TensorFlow 2.16 #4068

Open
rpushkarr opened this issue Feb 20, 2025 · 0 comments

Comments

@rpushkarr
Copy link

In TensorFlow version 2.16 (and later), the convolution operation implementation for FP32 data type in oneDNN was changed from gemm:acl to gemm:ref. However, this change has resulted in performance degradation compared to TensorFlow version 2.15, where gemm:acl was used.

System Information:

  • TensorFlow Version: 2.16
  • Previous Working Version: 2.15
  • oneDNN Version: 3.2.1
  • Hardware: Aarch64
  • Operating System: Ubuntu 22.04

Issue Summary:

  • In TensorFlow 2.15, the convolution operation for FP32 data type was routed through gemm:acl in oneDNN, which provided better performance.
  • In TensorFlow 2.16 (and later), the implementation was changed to use gemm:ref, leading to a noticeable performance drop.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant