[oneDNN] Second fix to #33021 #33471

jczaja · 2021-06-09T12:41:48Z

PR types

Bug fixes

PR changes

Others

Describe

It is second part of fixes to #33021. Problem addressed here was that Predictor after finishing execution of Run() is restoring default settings of cache. Those default settings are unrestriced size of cache (cache_capacity = 0). but after Run() is completed , There are sometimes Tensor:: CopyToCpu() methods called which are using oneDNN objects and could make cache growing.

- fix

paddle-bot-old · 2021-06-09T12:41:55Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

lidanqing-intel · 2021-06-10T10:52:32Z

paddle/fluid/inference/api/analysis_predictor.cc

@@ -353,6 +351,9 @@ void AnalysisPredictor::MkldnnPreSet(
    VLOG(2) << "Set input shape=" << ss.str();
    platform::MKLDNNDeviceContext::tls().set_cur_input_shape_str(ss.str());
  }
+  platform::MKLDNNDeviceContext::tls().set_cur_input_shape_cache_capacity(
+      config_.mkldnn_cache_capacity_);
+


Is there difference to move position of this line of code? What is influenced ?

Previously , cache_cpacity was only set when it was non-zero (cache clearing mode) . But as zero value is default it was
not set. But if you have two predictors : one working in restriced cache (clear mode) and other without this limitation
then if you run executor with restricted cache size and then go to execution of seconf executor that is being run on default settings then you would actually use restriced caching again. I mean we were doing reverting to defaults in MKLDNNPostReset() , but I we cannot do it anymore so I had to move this resetting to when execution begins.

Oh ok it is now in preset function. Sorry I did not notice. Thanks for your explanation

lidanqing-intel · 2021-06-10T10:56:21Z

paddle/fluid/inference/tests/api/analyzer_detect_functional_mkldnn_tester.cc

+  out_data.resize(out_num);
+  output_t->CopyToCpu(out_data.data());
+
+  // Release predictor (relevant cache should be emptied)


"predictor.reset(nullptr);" I feel we don't call often call this in API tests. Usually we just do predictor.Run(), that's all and seems assuming that cache will be release during deconstructor of some object?
What will be the result if we remove this predictor.reset(nullptr) ?

predictor.reset(nullptr) This enforces that predictor is released and its cached objects as well. Normally you do not need to do it, as it is released when get out of scope, but I want to check if cache is released so I could not wait till end of scope.

jakpiase

LGTM

lidanqing-intel · 2021-06-11T07:44:06Z

LGTM.

lidanqing-intel

LGTM

jczaja · 2021-06-11T07:47:59Z

As agreed with @juncaipeng After passing internal review (two reviewers) for PRs specific to oneDNN we are allowed to merge .

* - Second fix - fix * - fix

…y consumption (#33571) * [oneDNN] First fix to #33021 (#33174) * - First fix to #33021 * [oneDNN] Second fix to #33021 (#33471) * use older download_data function Co-authored-by: Jacek Czaja <jacek.czaja@intel.com>

- Second fix

0097cd6

- fix

jczaja added the Intel label Jun 9, 2021

- fix

c81229e

jczaja requested review from lidanqing-intel and jakpiase June 10, 2021 07:49

lidanqing-intel reviewed Jun 10, 2021

View reviewed changes

jakpiase approved these changes Jun 10, 2021

View reviewed changes

lidanqing-intel approved these changes Jun 11, 2021

View reviewed changes

jczaja merged commit 3c49f08 into PaddlePaddle:develop Jun 11, 2021

jczaja deleted the prv-33021-2-pr branch June 11, 2021 07:48

lidanqing-intel pushed a commit to lidanqing-intel/Paddle that referenced this pull request Jun 15, 2021

[oneDNN] Second fix to PaddlePaddle#33021 (PaddlePaddle#33471)

2e9722c

* - Second fix - fix * - fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[oneDNN] Second fix to #33021 #33471

[oneDNN] Second fix to #33021 #33471

jczaja commented Jun 9, 2021

paddle-bot-old bot commented Jun 9, 2021

lidanqing-intel Jun 10, 2021

jczaja Jun 10, 2021

lidanqing-intel Jun 11, 2021

lidanqing-intel Jun 10, 2021

jczaja Jun 10, 2021

jakpiase left a comment

lidanqing-intel commented Jun 11, 2021

lidanqing-intel left a comment

jczaja commented Jun 11, 2021

[oneDNN] Second fix to #33021 #33471

[oneDNN] Second fix to #33021 #33471

Conversation

jczaja commented Jun 9, 2021

PR types

PR changes

Describe

paddle-bot-old bot commented Jun 9, 2021

lidanqing-intel Jun 10, 2021

Choose a reason for hiding this comment

jczaja Jun 10, 2021

Choose a reason for hiding this comment

lidanqing-intel Jun 11, 2021

Choose a reason for hiding this comment

lidanqing-intel Jun 10, 2021

Choose a reason for hiding this comment

jczaja Jun 10, 2021

Choose a reason for hiding this comment

jakpiase left a comment

Choose a reason for hiding this comment

lidanqing-intel commented Jun 11, 2021

lidanqing-intel left a comment

Choose a reason for hiding this comment

jczaja commented Jun 11, 2021