Akash Shetty ,Abhiram Srivathsa K H, O S Sumukh, Kavitha S N
Information Science and Engineering , R V College of Engineering , Bangalore, Karnataka, India
Abstract
Generating accurate captions for an image has remained one of the major challenges in Artificial Intelligence with plenty of applications ranging from robotic vision to helping the visually impaired.Long term applications also involve providing accurate captions for videos in scenarios such as security system.The aim is to build an optimal system which can generate semantically and grammatically accurate captions for an image and also to suggest captions which can be used on social media platforms.In this system, images are preprocessed and captions are generated. Then the image features are extracted using Resnet50.Then the captions are generated word by word using LSTM. The application hopes to be useful for visually impaired people and to be useful for generating the social media captions which can be used on various social media platforms.
Keywords: CNN, RNN, LSTM, Resnet50
Journal Name :
EPRA International Journal of Multidisciplinary Research (IJMR)

VIEW PDF
Published on : 2022-08-11

Vol : 8
Issue : 8
Month : August
Year : 2022
Copyright © 2024 EPRA JOURNALS. All rights reserved
Developed by Peace Soft