This is a Deep learning project using Flickr8k dataset. A model inspired from the famous Show and Tell Model is implemented for automatic image captioning. The model block diagram is shown below.
-
Download and extract Flickr8k_Dataset.zip and Flickr8k_text.zip
-
Now, create a folder named 'ML' in root directory of your Google Drive.
-
Then upload extracted version of
Flickr8k_Dataset
andFlickr8k_text
into the ML folder. This will take times depending on your IP provider. -
Also upload the
testing_caption_generator.py
into ML folder. -
Now create a notebook in Colab by your preferred name, and copy paste from
ImageCaption.ipynb
-
Run all the shells. Happy CODING :)