This project addresses the challenge of converting visual information into descriptive language by integrating Computer Vision and Natural Language Processing. I developed a multimodal image captioning system using an Encoder-Decoder framework, with CNNs (VGG19 and ResNet50) for extracting image features and LSTMs for generating captions. The system achieved a BLEU score of 0.52, demonstrating its effectiveness in producing accurate and descriptive captions for images.
-
Notifications
You must be signed in to change notification settings - Fork 1
rohit-mapakshi97/ImageCaptioning
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published