Image Caption Generator

This project implements an image caption generator using a combination of LSTM for text generation and VGG16 for image feature extraction. The model is trained on the Flickr 8k dataset.

Model Architecture

Image Feature Extraction: VGG16 pre-trained on ImageNet
Text Generation: Long Short-Term Memory (LSTM) network

Dataset

The model is trained on the Flickr 8k dataset, which contains:

8,000 images
5 captions per image

Requirements

Python 3.7+
TensorFlow 2.0
Keras
NumPy
Matplotlib

Usage

Clone the repository: git clone https://github.com/Roronoa-17/Image_Caption_Generator.git
Install the required packages: pip install -r requirements.txt
Run the caption generator: streamlit run app.py

Training

Download the Flickr 8k dataset
Open the python notebook for further instructions.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
features		features
models		models
notebook		notebook
README.md		README.md
app.py		app.py
command		command
requirements.txt		requirements.txt
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Caption Generator

Model Architecture

Dataset

Requirements

Usage

Training

About

Releases

Packages

Languages

Roronoa-17/Image_Caption_Generator

Folders and files

Latest commit

History

Repository files navigation

Image Caption Generator

Model Architecture

Dataset

Requirements

Usage

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages