-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Conversation
@mxnet-label-bot add[Example, Gluon, pr-awaiting-review] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't been able to test this end-to-end yet. I've tried the data download process a couple of times, but have had to restart due to connectivity and space issues. I'll try again later, but I thought I'd at least give some initial feedback.
I'm looking forward to seeing this work. It seems like a very cool example. Thanks for sharing it.
"args = dict()\n", | ||
"args['batch_size'] = 64\n", | ||
"args['epochs'] = 100\n", | ||
"args['image_path'] = '/home/ubuntu/works/2018/lips_model/data/datasets/'\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Might make it easier if you used paths relative to where this is in the examples folder and where the data gets downloaded.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changed
example/gluon/lipnet/utils/multi.py
Outdated
|
||
def split_seq(sam_num, n_tile): | ||
""" | ||
Spli the number(sam_num) into numbers by n_tile |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Spli the number(sam_num) into numbers by n_tile | |
Split the number(sam_num) into numbers by n_tile |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed
Co-Authored-By: seujung <digit82@gmail.com>
Co-Authored-By: seujung <digit82@gmail.com>
Co-Authored-By: seujung <digit82@gmail.com>
Co-Authored-By: seujung <digit82@gmail.com>
Co-Authored-By: seujung <digit82@gmail.com>
Co-Authored-By: seujung <digit82@gmail.com>
Please add these to your prerequisites list:
Also, I tried to run it without a GPU and couldn't get it to work with:
Raises this error:
|
I built the project on a GPU instance this time and was able to run main.py. However, I immediately get a dump of a lot of these errors:
Looks like it ran that 200 times and failed each time. |
* Split data into train and valid * Update Readme * Add infer.py * Remove ipynb * Apply to continual learning
is this PR good to go @thomelane @larroy ? |
@aaronmarkham any chance you could zip up all of the preprocessed files, to avoid And then we need to get those instructions into the README, so people don't try to download and preprocess the data themselves. |
@thomelane Ok, I'm creating the tar files now. The reason I didn't do that is that I kept getting disconnected and wanted to be able to resume a sync. If you're pulling a 15gb file and have to start over, well, that's no fun. |
I put the tar files in a separate bucket so you can pick how you want to download. To get the tar files:
Or to download them by link: To get the folders (unzipped):
|
@aaronmarkham thanks for uploading! sure, you can add the instructions to the readme in a different commit, wouldn't that need a CI run anyway, or has that been optimised now to ignore markdown changes? @seujung the model seems to be training okay (i.e. loss going down), but still noticeable differences between target and prediction. How good are the predictions on a correctly trained model? Also noticed that things like learning rate aren't explicitly defined, are the defaults correct for this model? |
@aaronmarkham I checked the file you uploaded. It was nice as intended. It takes too long time to train this model. The main reason is the decode part (beam search) on validation data (def infer_batch). Actually, we do not need decode all the validation examples during training, so we skip this or check only one mini batch example. To speed up, we only check the decoded result using infer.py not main.py (train). (3) Although the loss is still decreasing, I attach the pre-trained model. You can get the result.
Or
|
Great, thanks for clarifying @soeque1! |
LGTM |
@@ -0,0 +1,194 @@ | |||
# LipNet: End-to-End Sentence-level Lipreading |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# LipNet: End-to-End Sentence-level Lipreading | |
<!--- | |
Licensed under the Apache License, Version 2.0 (the "License"); | |
you may not use this file except in compliance with the License. | |
You may obtain a copy of the License at | |
http://www.apache.org/licenses/LICENSE-2.0 | |
Unless required by applicable law or agreed to in writing, software | |
distributed under the License is distributed on an "AS IS" BASIS, | |
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | |
See the License for the specific language governing permissions and | |
limitations under the License. See accompanying LICENSE file. | |
--> | |
# LipNet: End-to-End Sentence-level Lipreading |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
License isn't required on readme files. @szha if you feel strongly about adding it, I'm going to modify the readme in another PR later today and I can add it then.
* update lipnet * update utils * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/preprocess_data.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/download_data.py Co-Authored-By: seujung <digit82@gmail.com> * fix error for using gpu mode * Add requirements * Remove unnecessary requirements * Update .gitignore * Remove inappropriate license file * Changed relative path * Fix description * Fix description * Fix description * Fix description * Change doc strings and add url reference * Fix align_path * Remove zip files * Fix bugs: source_path, n_process * Fix target_path * Fix exception handler and resume the preprocess * Pass the output when it fails to detect the mouth * Add exception during collecting images * Add the disk space and fix default align_path * Change optimizer * Update readme for pip * Update README * Add checkpoint folder * Apply to train using multiprocess * update network.py * delete batchnorm comment *fix dropout * fix loading ndarray as F * add space * Update readme * Add the info of GRID Data * Add the info of word alignments * Add total download size * Add time for preprocessing * Add test code for beamsearch * add space * delete line and fix code * Add shebang in BeamSearch * Fix trainer * Add space line * Fix appeding losses * Fix trainer * Delete debug line in data_loader * Move transpose of input into data_loader * Delete trailing-whitespace * Hybridize lip model * Hybridize model * Refactor the len of input sequence * Fix the shape of model * Apply to split train and validation * Split data into train and valid * Update Readme * Add infer.py * Remove ipynb * Apply to continual learning * Add images * Update readme * Fix typo and pylint * Fix loss digits of save_file and typo * Add info of data split and batch size
* update lipnet * update utils * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/preprocess_data.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/download_data.py Co-Authored-By: seujung <digit82@gmail.com> * fix error for using gpu mode * Add requirements * Remove unnecessary requirements * Update .gitignore * Remove inappropriate license file * Changed relative path * Fix description * Fix description * Fix description * Fix description * Change doc strings and add url reference * Fix align_path * Remove zip files * Fix bugs: source_path, n_process * Fix target_path * Fix exception handler and resume the preprocess * Pass the output when it fails to detect the mouth * Add exception during collecting images * Add the disk space and fix default align_path * Change optimizer * Update readme for pip * Update README * Add checkpoint folder * Apply to train using multiprocess * update network.py * delete batchnorm comment *fix dropout * fix loading ndarray as F * add space * Update readme * Add the info of GRID Data * Add the info of word alignments * Add total download size * Add time for preprocessing * Add test code for beamsearch * add space * delete line and fix code * Add shebang in BeamSearch * Fix trainer * Add space line * Fix appeding losses * Fix trainer * Delete debug line in data_loader * Move transpose of input into data_loader * Delete trailing-whitespace * Hybridize lip model * Hybridize model * Refactor the len of input sequence * Fix the shape of model * Apply to split train and validation * Split data into train and valid * Update Readme * Add infer.py * Remove ipynb * Apply to continual learning * Add images * Update readme * Fix typo and pylint * Fix loss digits of save_file and typo * Add info of data split and batch size
* update lipnet * update utils * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/preprocess_data.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/download_data.py Co-Authored-By: seujung <digit82@gmail.com> * fix error for using gpu mode * Add requirements * Remove unnecessary requirements * Update .gitignore * Remove inappropriate license file * Changed relative path * Fix description * Fix description * Fix description * Fix description * Change doc strings and add url reference * Fix align_path * Remove zip files * Fix bugs: source_path, n_process * Fix target_path * Fix exception handler and resume the preprocess * Pass the output when it fails to detect the mouth * Add exception during collecting images * Add the disk space and fix default align_path * Change optimizer * Update readme for pip * Update README * Add checkpoint folder * Apply to train using multiprocess * update network.py * delete batchnorm comment *fix dropout * fix loading ndarray as F * add space * Update readme * Add the info of GRID Data * Add the info of word alignments * Add total download size * Add time for preprocessing * Add test code for beamsearch * add space * delete line and fix code * Add shebang in BeamSearch * Fix trainer * Add space line * Fix appeding losses * Fix trainer * Delete debug line in data_loader * Move transpose of input into data_loader * Delete trailing-whitespace * Hybridize lip model * Hybridize model * Refactor the len of input sequence * Fix the shape of model * Apply to split train and validation * Split data into train and valid * Update Readme * Add infer.py * Remove ipynb * Apply to continual learning * Add images * Update readme * Fix typo and pylint * Fix loss digits of save_file and typo * Add info of data split and batch size
* update lipnet * update utils * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/preprocess_data.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/download_data.py Co-Authored-By: seujung <digit82@gmail.com> * fix error for using gpu mode * Add requirements * Remove unnecessary requirements * Update .gitignore * Remove inappropriate license file * Changed relative path * Fix description * Fix description * Fix description * Fix description * Change doc strings and add url reference * Fix align_path * Remove zip files * Fix bugs: source_path, n_process * Fix target_path * Fix exception handler and resume the preprocess * Pass the output when it fails to detect the mouth * Add exception during collecting images * Add the disk space and fix default align_path * Change optimizer * Update readme for pip * Update README * Add checkpoint folder * Apply to train using multiprocess * update network.py * delete batchnorm comment *fix dropout * fix loading ndarray as F * add space * Update readme * Add the info of GRID Data * Add the info of word alignments * Add total download size * Add time for preprocessing * Add test code for beamsearch * add space * delete line and fix code * Add shebang in BeamSearch * Fix trainer * Add space line * Fix appeding losses * Fix trainer * Delete debug line in data_loader * Move transpose of input into data_loader * Delete trailing-whitespace * Hybridize lip model * Hybridize model * Refactor the len of input sequence * Fix the shape of model * Apply to split train and validation * Split data into train and valid * Update Readme * Add infer.py * Remove ipynb * Apply to continual learning * Add images * Update readme * Fix typo and pylint * Fix loss digits of save_file and typo * Add info of data split and batch size
* update lipnet * update utils * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/README.md Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/preprocess_data.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/multi.py Co-Authored-By: seujung <digit82@gmail.com> * Update example/gluon/lipnet/utils/download_data.py Co-Authored-By: seujung <digit82@gmail.com> * fix error for using gpu mode * Add requirements * Remove unnecessary requirements * Update .gitignore * Remove inappropriate license file * Changed relative path * Fix description * Fix description * Fix description * Fix description * Change doc strings and add url reference * Fix align_path * Remove zip files * Fix bugs: source_path, n_process * Fix target_path * Fix exception handler and resume the preprocess * Pass the output when it fails to detect the mouth * Add exception during collecting images * Add the disk space and fix default align_path * Change optimizer * Update readme for pip * Update README * Add checkpoint folder * Apply to train using multiprocess * update network.py * delete batchnorm comment *fix dropout * fix loading ndarray as F * add space * Update readme * Add the info of GRID Data * Add the info of word alignments * Add total download size * Add time for preprocessing * Add test code for beamsearch * add space * delete line and fix code * Add shebang in BeamSearch * Fix trainer * Add space line * Fix appeding losses * Fix trainer * Delete debug line in data_loader * Move transpose of input into data_loader * Delete trailing-whitespace * Hybridize lip model * Hybridize model * Refactor the len of input sequence * Fix the shape of model * Apply to split train and validation * Split data into train and valid * Update Readme * Add infer.py * Remove ipynb * Apply to continual learning * Add images * Update readme * Fix typo and pylint * Fix loss digits of save_file and typo * Add info of data split and batch size
Description
Add lip reading model using gluon
Checklist
Essentials
Please feel free to remove inapplicable items for your PR.
Changes
Comments