-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Paddle Inference] Implement conv2d_fusion NHWC format using cutlass #47989
[Paddle Inference] Implement conv2d_fusion NHWC format using cutlass #47989
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
paddle/phi/kernels/fusion/cutlass/conv2d/conv2d_bias_add_relu.cu
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
需要增加单测
paddle/phi/kernels/fusion/cutlass/conv2d/conv2d_bias_relu_few_channels.cu
Show resolved
Hide resolved
ceacbf9
ceacbf9
to
9dd83bf
Compare
… brought by others
done! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for type registration. We approve conv for not having int kernel
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for including fluid header in phi
PR types
Performance optimization
PR changes
Others
Describe
config.exp_enable_use_cutlass()
或C++ APIconfig.Exp_EnableUseCutlass()
即可使用cutlass来进行fp16 推理。Paddle-Inference Demo resnet50测试数据,算上DTH和HTD