-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Progress Report] Implementation of final decision layer #14
Comments
From keras-team/keras#7581 PS: |
Keras provide Inception V3 weight and model in their library, along with other network models such as VGG and ResNet. Currently there's issue found in Keras inception V3 model. Current structure is the following
By this we cast a grayscale image to rgb channel and feed it into the image.(I also tried with reading the grayscale image as rgb, it doesn't change much in terms of result) However, I face a strange bug where the network itself doesn't seem to learn from the inception v3 network.
From graph we see that although training loss is decreasing, validation loss is not. Also we get val accuracy around 20% which is very close to random guessing. I've tried with various method like changing the size of input, change number of layers. However results are similar or even worse. Here's result after adding another dense layer of size 200: Inception V3 is not the only model that can classify image. I also tried with VGG16 model. Similar result achieved. Possible error:
For the potential bug exist in Batch Normalization Layer of Inception v3 model, 231/231 [==============================] - 0s 2ms/step I add another Dense layer of size 1024 and dropout of 0.5.The result shows over-fitting. 231/231 [==============================] - 0s 2ms/step |
This is to document the implement process of constructing final decision layer using both Spectrogram+CNN and audio+LSTM models.
Current two possible flow can be:
Current Implementation will be option 1
The text was updated successfully, but these errors were encountered: