resnet-image-captioning

show attention and tell with resnet
Adapted from show attend and tell, replace VGG with resnet, and can be trained together with LSTM. resnet codes are borrowed and adapted from models/official/resnet/ Use the output of block_layer3 in resnet50 as image features, get result in
val set:

Bleu_1: 0.660386
Bleu_2: 0.447982
Bleu_3: 0.305375
Bleu_4: 0.213699
METEOR: 0.213692
ROUGE_L: 0.515579
CIDEr: 0.665676

test set:

Bleu_1: 0.623284
Bleu_2: 0.399399
Bleu_3: 0.260130
Bleu_4: 0.174152
METEOR: 0.191364
ROUGE_L: 0.486628
CIDEr: 0.501525

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
core		core
README.md		README.md
caption.py		caption.py
download.sh		download.sh
metric.py		metric.py
prepro.py		prepro.py
requirements.txt		requirements.txt
resize.py		resize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

resnet-image-captioning

About

Releases

Packages

Languages

Robootx/resnet-image-captioning

Folders and files

Latest commit

History

Repository files navigation

resnet-image-captioning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages