This repository has been archived by the owner on Nov 17, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Register fake grad to subgraph and quantized operators #14275
Merged
+114
−21
Merged
Changes from 6 commits
Commits
Show all changes
9 commits
Select commit
Hold shift + click to select a range
8a3c106
add fake grad
xinyu-intel 3da11fb
Merge remote-tracking branch 'upstream/master' into fake_grad
xinyu-intel c9e7116
Merge remote-tracking branch 'upstream/master' into fake_grad
xinyu-intel 237376d
Skip inference only subgraph pass when gradient is needed.
xinyu-intel 4d16e29
add fake grad to quantizev2
xinyu-intel 6f06baa
add TODO
xinyu-intel a1a31c0
modify prop_name to property_name
xinyu-intel 869b601
Merge remote-tracking branch 'upstream/master' into fake_grad
xinyu-intel 61d4d16
add test case
xinyu-intel File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -82,6 +82,9 @@ where | |
.set_attr<nnvm::FInferShape>("FInferShape", QuantizeShape) | ||
.set_attr<nnvm::FInferType>("FInferType", QuantizeType) | ||
.set_attr<FInferStorageType>("FInferStorageType", QuantizeStorageType) | ||
// TODO(Xinyu): a temp solution to enable GluonCV INT8 flow, | ||
// will be reverted after the improvement of CachedOP is done. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. is this currently WIP ? can you open an issue for this There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. #14331 added:) |
||
.set_attr<nnvm::FGradient>("FGradient", MakeZeroGradNodes) | ||
#if MXNET_USE_MKLDNN == 1 | ||
.set_attr<bool>("TIsMKLDNN", true) | ||
.set_attr<FComputeEx>("FComputeEx<cpu>", MKLDNNQuantizeCompute) | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -107,7 +107,11 @@ class SgMKLDNNConvPostQuantizeProperty : public SubgraphProperty { | |
} | ||
} | ||
static SubgraphPropertyPtr Create() { | ||
return std::make_shared<SgMKLDNNConvPostQuantizeProperty>(); | ||
auto property = std::make_shared<SgMKLDNNConvPostQuantizeProperty>(); | ||
property->SetAttr<std::string>("prop_name", | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Nit: can we use "property_name" or just "name" here? Because "prop" also stands for "propagation" somewhere. |
||
"MKLDNN Convolution post-quantization optimization pass"); | ||
property->SetAttr<bool>("inference_only", true); | ||
return property; | ||
} | ||
nnvm::NodePtr CreateSubgraphNode(const nnvm::Symbol &sym, | ||
const int subgraph_id = 0) const override { | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we add a test for this case: module bound with for_training=True, and the symbol reused to invoke graph partitioning with MKLDNN. The returned symbol shouldn't contain mkldnn subgraph ops.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The problem is, we can't get the symbol after module bound. So we can't check if MKLDNN graph partitioning happens or not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why can't we use mod._sym and call get_backend_symbol("MKLDNN") on it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because there's no python API can return the partitioned symbol after bind, even mod._sym holds the original symbol. And
get_backend_symbol("MKLDNN")
will apply partition and won't skipinference_only
pass.inference_only
only works for bind.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
makes sense.