Enhance the unit test framework to explicitly test whether the operator correctly handles gradients for multiple inputs. #3857

qingqing01 · 2017-09-04T14:30:54Z

If all input gradients of forwarding operator do not need to calculate, the backward operator will be a NOP operator: /~https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/framework/backward.cc#L75

So, there is no need to handle this case for the backward op with only one output.

reyoung · 2017-09-04T18:00:36Z

python/paddle/v2/framework/tests/gradient_checker.py

+            #   input_vars, check_names, place)
+            # In fact, the above two lines can be used to replace following
+            # codes. But most of the gradient operators need to handle the case
+            # where one of more of the gradient of the input is not needed.


one of more of the gradient of the input is --> some input gradients are

Awesome work for making mul/rowwise_add can ignore some IG.

However, I think the gradient unit-test framework should not be changed for the following reason.

If a user wants to check a mul operator gradient without some IGs, he can invoke check_grad many times, like check_grad("mul", input_to_check=["X"], ignore=["Y"]), check_grad("mul", input_to_check["Y"], ignore=["X"]).

The disadvantage for invoking check_grad many times is that the numeric gradient will be calculated many times. It could be slow, but unit tests are not used by users, so it is OK.

To make users invoke check_grad by themselves can handle if some IGs are not able to be ignored. For example, OpA has two inputs, 'X' and 'Y' and gradient 'Y' should always be calculated, and it YG should never be nullptr.

To make users invoke check_grad can make error log clearer. The error could be check mul gradient error when calculating X and ignore Y.

Please review qingqing01#1

@reyoung Thanks！I also thought it was not good to change the check_grad code at first. I merge your patch.

Invoke check_grad many times for no_grad_set

reyoung

LGTM.

emailweixu · 2017-09-06T19:23:41Z

python/paddle/v2/framework/tests/test_mul_op.py

+            self.inputs, ["X"],
+            "Out",
+            max_relative_error=0.5,
+            no_grad_set={"Y"})


@qingqing01 The framework should automatically test cases with certain input gradient ignored without explicitly writing code here. We should change GradientChecker.check_grad() to do this.

@emailweixu Thanks for your comments. Ok, in the first comments: 4470332, the check_grad is automatically testing. I'll change it again :).

The framework should automatically test cases with certain input gradient ignored without explicitly writing code here. We should change GradientChecker.check_grad() to do this.

To make users invoke check_grad by themselves can handle if some IGs are not able to be ignored. For example, OpA has two inputs, 'X' and 'Y' and gradient 'Y' should always be calculated, and it YG should never be nullptr.

Also, check_grad is not only used to check single operator's gradient but also the gradient inside a whole neural network. It is hard to automatically testing a whole network.

To make users invoke check_grad can make error log clearer. The error would be test_mul_grad_ingore_x error.

To make user-interface easily, there maybe a parameter in check_grad, like test_no_input_grad. If test_no_input_grad=True, multiple check_grads will be invoked.

Make some operator correctly handle gradients for multi inputs.

4470332

reyoung reviewed Sep 4, 2017

View reviewed changes

reyoung and others added 3 commits September 4, 2017 16:20

Invoke check_grad many times for no_grad_set

3d9d32a

Merge pull request #1 from reyoung/grad_test_for_multi_inputs

95955cc

Invoke check_grad many times for no_grad_set

revert scatter_op and other mirror changes.

ab55d79

reyoung approved these changes Sep 5, 2017

View reviewed changes

reyoung merged commit b64aac5 into PaddlePaddle:develop Sep 5, 2017

emailweixu reviewed Sep 6, 2017

View reviewed changes

qingqing01 deleted the grad_test_for_multi_inputs branch November 14, 2019 05:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance the unit test framework to explicitly test whether the operator correctly handles gradients for multiple inputs. #3857

Enhance the unit test framework to explicitly test whether the operator correctly handles gradients for multiple inputs. #3857

qingqing01 commented Sep 4, 2017 •

edited

Loading

reyoung Sep 4, 2017

reyoung Sep 4, 2017 •

edited

Loading

reyoung Sep 4, 2017

qingqing01 Sep 5, 2017 •

edited

Loading

reyoung left a comment

emailweixu Sep 6, 2017

qingqing01 Sep 7, 2017 •

edited

Loading

reyoung Sep 7, 2017

Enhance the unit test framework to explicitly test whether the operator correctly handles gradients for multiple inputs. #3857

Enhance the unit test framework to explicitly test whether the operator correctly handles gradients for multiple inputs. #3857

Conversation

qingqing01 commented Sep 4, 2017 • edited Loading

reyoung Sep 4, 2017

Choose a reason for hiding this comment

reyoung Sep 4, 2017 • edited Loading

Choose a reason for hiding this comment

reyoung Sep 4, 2017

Choose a reason for hiding this comment

qingqing01 Sep 5, 2017 • edited Loading

Choose a reason for hiding this comment

reyoung left a comment

Choose a reason for hiding this comment

emailweixu Sep 6, 2017

Choose a reason for hiding this comment

qingqing01 Sep 7, 2017 • edited Loading

Choose a reason for hiding this comment

reyoung Sep 7, 2017

Choose a reason for hiding this comment

qingqing01 commented Sep 4, 2017 •

edited

Loading

reyoung Sep 4, 2017 •

edited

Loading

qingqing01 Sep 5, 2017 •

edited

Loading

qingqing01 Sep 7, 2017 •

edited

Loading