Keras Image Caption

A image catpioning model for the 2017 DatalabCup: Image Caption competition in NTHU CS565500.

The model is trained on 2014 Microsoft COCO dataset. It can reach a 0.765 CIDErD score, and it also win the 1st place with a score 0.67822 Root Mean Square Percentage Error.

We are still working on the repo, so all the info you see now is a temporary version, and the completed version will come out soon.

Prerequisites

Python 3.4+
Keras
Pandas
Numpy

Features

Most of the features used in the model can be found in the 2017 Datalab Cup: Image Caption, while the image feature is extracted by Inception-v3. You can find the pre-trained weight file on tensorflow site.

We take the result from the last pooling layer, which is a 2048-D vector, as image feature.

Training

The training set contains 102,739 images selected by TAs from MSCOCO 2014.

You can start training by:

python train.py [initial weights]

Evaluation

The result is evaluated on 20,548 testing images selected by TAs from MSCOCO2014, and use CIDErD as the metric.

We have offered a simple evaluation code that will randomly pick images from testing set, produce and print out the generated captions and the ground truth:

python evaluation.py [weights file path]

Demo

python demo.py [weighs file path]

We are able to reach 0.765 CIDErD school, and win the 1st place in the competition.

Acknowledgements

Image extractor : mjhucla : TF-mRNN Beamsearch : udibr : beamsearch

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dataset		dataset
pre_trained		pre_trained
README.md		README.md
beamsearch.py		beamsearch.py
demo.py		demo.py
evaluate.py		evaluate.py
extractor.py		extractor.py
model.py		model.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keras Image Caption

Prerequisites

Features

Training

Evaluation

Demo

Acknowledgements

About

Releases

Packages

Languages

LemonATsu/Keras-Image-Caption

Folders and files

Latest commit

History

Repository files navigation

Keras Image Caption

Prerequisites

Features

Training

Evaluation

Demo

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages