add maximum limit for grid of reduce, elementwise, gather and scatter #40813

FlyingQianMM · 2022-03-22T06:39:23Z

PR types

Bug fixes

PR changes

OPs

Describe

The grid number of reduce、elementwise and masked_select has not been limited, which may raise a bug like:

:parallel_for failed: cudaErrorInvalidConfiguration: invalid configuration argument.

So we add a maximum limit for grid of reduce, elementwise, gather and scatter kernel.

… develop_limit_grid

ZzSean · 2022-03-23T12:52:10Z

paddle/fluid/platform/device/gpu/gpu_launch_config.h

@@ -128,6 +128,8 @@ inline GpuLaunchConfig GetGpuLaunchConfig1D(
  // Number of threads per block shall be larger than 64.
  threads = std::max(64, threads);
  int blocks = DivUp(DivUp(numel, vec_size), threads);
+  int limit_blocks = context.GetCUDAMaxGridDimSize()[0];
+  if (blocks > limit_blocks) blocks = limit_blocks;


根据C++ Style要求，if条件最好加上大括号吧

已加上{}，感谢～

ZzSean · 2022-03-23T13:00:21Z

paddle/phi/kernels/funcs/reduce_function.h

@@ -1044,7 +1056,7 @@ void ReduceKernel(const KPDevice& dev_ctx,

  auto x_dim = phi::vectorize<int>(x.dims());
  auto config = ReduceConfig<Ty>(origin_reduce_dims, x_dim);
-  config.Run();
+  config.Run(x.place());


可不可以把这个LimitGridDim写在外部，就可以直接用dev_ctx了

config.Run()里面有对block数量做限制，所以把thread数量限制一起放在config.Run()里面了，这样可被复用

ZzSean

LGTM

Xreki · 2022-03-25T07:40:56Z

能不能给一下出错的算子配置、修改前的线程数和修改后的线程数？

FlyingQianMM added 2 commits March 22, 2022 06:37

add maximum limit for grid of reduce, elementwise and gather

7aa62c8

Merge branch 'develop' of /~https://github.com/PaddlePaddle/Paddle into…

dd1ece3

… develop_limit_grid

ZzSean reviewed Mar 23, 2022

View reviewed changes

add {} after if

669da77

ZzSean approved these changes Mar 25, 2022

View reviewed changes

FlyingQianMM merged commit 608a5f5 into PaddlePaddle:develop Mar 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add maximum limit for grid of reduce, elementwise, gather and scatter #40813

add maximum limit for grid of reduce, elementwise, gather and scatter #40813

FlyingQianMM commented Mar 22, 2022

ZzSean Mar 23, 2022

FlyingQianMM Mar 24, 2022

ZzSean Mar 23, 2022

FlyingQianMM Mar 24, 2022

ZzSean left a comment

Xreki commented Mar 25, 2022

add maximum limit for grid of reduce, elementwise, gather and scatter #40813

add maximum limit for grid of reduce, elementwise, gather and scatter #40813

Conversation

FlyingQianMM commented Mar 22, 2022

PR types

PR changes

Describe

ZzSean Mar 23, 2022

Choose a reason for hiding this comment

FlyingQianMM Mar 24, 2022

Choose a reason for hiding this comment

ZzSean Mar 23, 2022

Choose a reason for hiding this comment

FlyingQianMM Mar 24, 2022

Choose a reason for hiding this comment

ZzSean left a comment

Choose a reason for hiding this comment

Xreki commented Mar 25, 2022