-
Notifications
You must be signed in to change notification settings - Fork 6.8k
MKLDNN Perplexity Issue #13515
Comments
@mxnet-label-bot add [MKLDNN,Performance] |
This is no longer about performance, it is an issue with model perplexity. |
@mxnet-label-bot remove [Performance] |
Adding @pengzhao-intel - can you guys look into it on Intel side and discuss with the MKLDNN team? |
@azai91 @lupesko commit d772a4b
commit a29185a
|
@pengzhao-intel |
@Vikas89 there are different ways to implement and train an LSTM-based language model on PTB dataset. Can you please share the training script? |
@Vikas89 please share how to reproduce the issue? |
@mseth10 please share the test training script and steps to run and instance type(I think its c5.18x but please confirm) with Patric |
@pengzhao-intel Manu has started the training script to verify. I think there was a bug introduced in our bench-marking set up, which was concluding results after 5 epochs. We fixed our benchmark setup and graphs seems to back to normal. We will observe for few more runs and close this. |
Good to know the problem is fixed :) Feel free to ping me if anything needs our help. |
@azai91 please close this issue. benchmark is back to normal. |
The recent upgrade to to 0.17.1 (#13369) has addressed the issue of throughput. However, the the perplexity of the lstm increases dramatically (https://www.dropbox.com/s/lnp1dc9uvwhfcqh/Screenshot%202018-12-03%2011.22.10.png?dl=0).
For Q & A and discussion, please start a discussion thread at https://discuss.mxnet.io
Description
(Brief description of the problem in no more than 2 sentences.)
Environment info (Required)
Package used (Python/R/Scala/Julia):
(I'm using ...)
For Scala user, please provide:
java -version
)mvn -version
)scala -version
)For R user, please provide R
sessionInfo()
:Build info (Required if built from source)
Compiler (gcc/clang/mingw/visual studio):
MXNet commit hash:
(Paste the output of
git rev-parse HEAD
here.)Build config:
(Paste the content of config.mk, or the build command.)
Error Message:
(Paste the complete error message, including stack trace.)
Minimum reproducible example
(If you are using your own code, please provide a short script that reproduces the error. Otherwise, please provide link to the existing example.)
python word_language_model/lstm_bucketing.py --num-hidden 650 --num-embed 650 --gpus 0 --epochs 30 --kv-store local
The text was updated successfully, but these errors were encountered: