[Phi] Support cudnn kernel moving & move softmax kernels #39547

chenwhql · 2022-02-14T15:48:50Z

PR types

New features

PR changes

Others

Describe

[Phi] Support cudnn kernel moving

支持phi注册GPUDNN的kernel，迁移softmax op kernel作为验证

由于并没有单独的GPUDNNContext，这给目前的体系引入了一些问题，当我们using CUDNNContext = GPUContext时，如果gpu和cudnn的kernel函数命名是一样的，会导致实例化冲突，例如：

SoftmaxKernel<float, GPUContext>和SoftmaxKernel<float, GPUDNNContext>实质上是一个实例化，但它们的实现是不一样的，这导致编译时，GPUDNN的kernel实例化被覆盖了，实际上GPUDNN注册的是GPU的kernel，在运行时引入了计算结果错误。

由于目前GPUDNNContext的设计并不明确，和现在的GPUContext本质上就是一样的，所以暂时通过更改函数名解决冲突，GPUDNN的kernel命名为SoftmaxGPUDNNKernel。

但这样引入的问题是，通过SoftmaxKernel<T, Context>将无法调用到SoftmaxGPUDNNKernel，gpudnn的kernel自成一个小体系，但影响不大，cudnn kernel的数目极少（个位数）。这里后续需要确认下，SoftmaxKernel原先的cuda kernel是否还有必要保留，如果没有必要的话，将其删除，并将cudnn kernel移到gpu下会合适一些。

TODO：

迁移时发现math/softmax.*和gpu_dnn相关组件也需要由fluid迁移到phi，后续PR进行
暂时为gpudnn目录下的kernel文件增加了gpudnn的后缀，不加后缀的话，inference CI单测会因为文件重名出现符号丢失问题，本地暂未复现，为不阻塞后续conv cuddn的迁移工作，该后缀我会在后续PR继续分析去除
移除fluid下的paddle/fluid/operators/amp/fp16_type_traits.h

paddle-bot-old · 2022-02-14T15:48:54Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… pten/support_cudnn_kernel_moving

…/chenwhql/Paddle into pten/support_cudnn_kernel_moving

… pten/support_cudnn_kernel_moving

paddle/phi/kernels/gpudnn/softmax_gpudnn.h

ZzSean

LGTM for op benchmark

MingMingShangTian · 2022-02-24T09:02:33Z

paddle/phi/common/float16.h

+ public:
+  using Type = float;
+};
+


这里和 paddle/fluid/operators/amp/fp16_type_traits.h 重复了，后边可以考虑合并。

嗯嗯，这是移过来的，后续不推荐使用fluid下的paddle/fluid/operators/amp/fp16_type_traits.h，可以删除这个文件

XiaoguangHu01

LGTM

MingMingShangTian

LGTM

ZzSean

LGTM for op benchmark

support cudnn kernel moving

cff1e4e

chenwhql added 6 commits February 15, 2022 02:31

Merge branch 'develop' of /~https://github.com/PaddlePaddle/Paddle into…

689400a

… pten/support_cudnn_kernel_moving

polish cmake rules

693b862

add unittest for coverage

b2e3327

remove orig kernel

d6c16c9

remove softmax cudnn kernel

79ac576

Merge branch 'develop' into pten/support_cudnn_kernel_moving

9bee86a

chenwhql changed the title ~~[PTen] Support cudnn kernel moving~~ [PTen] Support cudnn kernel moving & move softmax kernels Feb 17, 2022

chenwhql added 10 commits February 18, 2022 12:41

fix softmax test failed

0b93470

resolve conflict

a0d9f1a

Merge branch 'pten/support_cudnn_kernel_moving' of https://github.com…

7334e31

…/chenwhql/Paddle into pten/support_cudnn_kernel_moving

fix npu func error

a5883ea

Merge branch 'develop' into pten/support_cudnn_kernel_moving

7e8dea5

resolve conflict

dabe1d0

resolve conflict

d156d54

Merge branch 'pten/support_cudnn_kernel_moving' of https://github.com…

affaf1d

…/chenwhql/Paddle into pten/support_cudnn_kernel_moving

resolve conflit

5ba59d4

Merge branch 'develop' of /~https://github.com/PaddlePaddle/Paddle into…

2aeef70

… pten/support_cudnn_kernel_moving

chenwhql changed the title ~~[PTen] Support cudnn kernel moving & move softmax kernels~~ [Phi] Support cudnn kernel moving & move softmax kernels Feb 23, 2022

chenwhql added 3 commits February 23, 2022 13:00

rename gpu dnn kernels

529be1d

fix name rule error

b39f5d0

fix compile error

577bef8

chenwhql mentioned this pull request Feb 24, 2022

Move GumbelSoftmax OP to phi #39873

Merged

zyfncg previously approved these changes Feb 24, 2022

View reviewed changes

paddle/phi/kernels/gpudnn/softmax_gpudnn.h Outdated Show resolved Hide resolved

ZzSean previously approved these changes Feb 24, 2022

View reviewed changes

MingMingShangTian reviewed Feb 24, 2022

View reviewed changes

resolve conflict

b99eadd

chenwhql dismissed stale reviews from ZzSean and zyfncg via b99eadd February 24, 2022 13:56

update fp16 namespace

d2b96ee

XiaoguangHu01 approved these changes Feb 25, 2022

View reviewed changes

zyfncg approved these changes Feb 25, 2022

View reviewed changes

MingMingShangTian approved these changes Feb 25, 2022

View reviewed changes

ZzSean approved these changes Feb 25, 2022

View reviewed changes

YuanRisheng approved these changes Feb 25, 2022

View reviewed changes

chenwhql merged commit 8895379 into PaddlePaddle:develop Feb 25, 2022

ZzSean mentioned this pull request Feb 25, 2022

Revert "Optimize perf of softmax_with_cross_entropy (#39553)" #39928

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Phi] Support cudnn kernel moving & move softmax kernels #39547

[Phi] Support cudnn kernel moving & move softmax kernels #39547

chenwhql commented Feb 14, 2022 •

edited

Loading

paddle-bot-old bot commented Feb 14, 2022

ZzSean left a comment

MingMingShangTian Feb 24, 2022

chenwhql Feb 24, 2022 •

edited

Loading

XiaoguangHu01 left a comment

MingMingShangTian left a comment

ZzSean left a comment

[Phi] Support cudnn kernel moving & move softmax kernels #39547

[Phi] Support cudnn kernel moving & move softmax kernels #39547

Conversation

chenwhql commented Feb 14, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Feb 14, 2022

ZzSean left a comment

Choose a reason for hiding this comment

MingMingShangTian Feb 24, 2022

Choose a reason for hiding this comment

chenwhql Feb 24, 2022 • edited Loading

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

MingMingShangTian left a comment

Choose a reason for hiding this comment

ZzSean left a comment

Choose a reason for hiding this comment

chenwhql commented Feb 14, 2022 •

edited

Loading

chenwhql Feb 24, 2022 •

edited

Loading