EconPapers    
Economics at your fingertips  
 

A Deep Learning Model to Generate Image Captions

Shashank Parmar, Raman Tyagi and Prince Kumar Dhankar
Additional contact information
Shashank Parmar: Department of Software Engineering, Delhi Technological University, New Delhi, India
Raman Tyagi: Department of Software Engineering, Delhi Technological University, New Delhi, India
Prince Kumar Dhankar: Department of Software Engineering, Delhi Technological University, New Delhi, India

International Journal of Research and Scientific Innovation, 2023, vol. 10, issue 11, 634-643

Abstract: How computers can automatically describe the substance of photographs using human language is a topic that interests us greatly. We choose to use the most advanced image caption generator currently available to obtain a deeper understanding of this computer vision topic. Show, attend and tell Visually attentive neural image caption generator [12]. Our machine learning-based neural network picture description generator is created in Python using the Pytorch ML framework. In our workflow, we’ve determined ï¬ ve key elements: Data preparation, a convolutional neural network (CNN) it helps in encoding, a recurrent neural network (RNN) it helps in decoding, a beam search to determine the best description, generation of sentences, and assessment make up the R1-R6 framework. The quality and correctness of the generated caption are evaluated using the BLEU-4 score. Each member of our group contributed equally to moving the project forward as we distributed the ï¬ ve elements mentioned above equally among ourselves. All ï¬ ve components have been completed successfully, and we can now use Kaggle Notebook to train our network. After the network has been trained and is performing satisfactorily, we continue to see the attention mechanism.

Date: 2023
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.rsisinternational.org/journals/ijrsi/d ... issue-11/634-643.pdf (application/pdf)
https://rsisinternational.org/journals/ijrsi/artic ... rate-image-captions/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bjc:journl:v:10:y:2023:i:11:p:634-643

Access Statistics for this article

International Journal of Research and Scientific Innovation is currently edited by Dr. Renu Malsaria

More articles in International Journal of Research and Scientific Innovation from International Journal of Research and Scientific Innovation (IJRSI)
Bibliographic data for series maintained by Dr. Renu Malsaria ().

 
Page updated 2025-03-19
Handle: RePEc:bjc:journl:v:10:y:2023:i:11:p:634-643