Memory optimization in convolution layer by the grouped im2col and gemm. #6802

qingqing01 · 2017-12-20T15:24:03Z

Reduce the memory usage in convolution layer by grouped im2col+gemm.
- Add a flag FLAGS_ conv_workspace_limit_in_mb to determine whether to enable this function.
Add unit testing.

… mobile_mem

qingqing01 · 2017-12-21T06:49:41Z

Except for the unit testing, also tested on the real model.

hedaoyuan · 2017-12-25T06:32:45Z

paddle/function/GemmConvOp.cpp

-      resizeBuffer<Device>(colShape.getElements());
+      size_t workspaceSize = colShape.getElements() * sizeof(real);
+      double memoryLimitBytes =
+          static_cast<double>(1LL << 20) * FLAGS_conv_workspace_limit_in_mb;


FLAGS_conv_workspace_limit_in_mb这个值，一般用户不会知道该如何设置。这么实现，可以减少临时内存的使用，但代码不够友好，使用起来也比较困难。基于这个思路，可以进一步优化实现，见#7034。

Memory optimization in convolution layer by the grouped im2col and gemm.

488965d

qingqing01 requested review from Xreki and hedaoyuan December 20, 2017 15:24

qingqing01 added 3 commits December 21, 2017 11:15

Enbale grouped im2col for C-API compiling.

4fd9ff7

Fix the calculation bug.

c396551

Merge branch 'develop' of /~https://github.com/PaddlePaddle/Paddle into…

7b05478

… mobile_mem

qingqing01 force-pushed the mobile_mem branch from 5a0442a to 7b05478 Compare December 21, 2017 06:46

Refine the way of implementation.

f149866

hedaoyuan mentioned this pull request Dec 26, 2017

GemmConvMobileFunction(optimized for mobile) #7034

Merged

hedaoyuan reviewed Dec 26, 2017

View reviewed changes

qingqing01 closed this Dec 27, 2017

qingqing01 deleted the mobile_mem branch November 14, 2019 05:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory optimization in convolution layer by the grouped im2col and gemm. #6802

Memory optimization in convolution layer by the grouped im2col and gemm. #6802

qingqing01 commented Dec 20, 2017 •

edited

Loading

qingqing01 commented Dec 21, 2017

hedaoyuan Dec 25, 2017

Memory optimization in convolution layer by the grouped im2col and gemm. #6802

Memory optimization in convolution layer by the grouped im2col and gemm. #6802

Conversation

qingqing01 commented Dec 20, 2017 • edited Loading

qingqing01 commented Dec 21, 2017

hedaoyuan Dec 25, 2017

Choose a reason for hiding this comment

qingqing01 commented Dec 20, 2017 •

edited

Loading