Proceedings of the International e-Conference on Advances in Computer Engineering and Communication Systems (ICACECS 2023)

Image Caption Generator

Authors
B. Deepika1, S. Pushpanjali Reddy1, *, S. Gouthami Satya1, K. Rushil Kumar1
1VNR Vignana Jyothi Institute of Engineering & Technology, Hyderabad, India
*Corresponding author. Email: pushpaseelamreddy@gmail.com
Corresponding Author
S. Pushpanjali Reddy
Available Online 21 December 2023.
DOI
10.2991/978-94-6463-314-6_35How to use a DOI?
Keywords
LSTM; RNN; CNN; Deep Learning; Natural Language Processing
Abstract

Image captioning, also defined as describing the image, has consistently sparked the curiosity of expert system researchers and accurate description of an image has been a significant task. Image caption generator involves describing the characteristics, attributes of the image. It has a plenty of applications in the field of Robotic vision, story-telling from album uploads, business and many more. For instance, it can be used in Image segmentation as used by Google Photos and its application can also be extended to video frames. It has grown to become one of the most prevalent tools in the contemporary period. This paper aims in employing computer vision and machine translation for captioning the image. It involves recognizing the objects, actions, attributes in an image and identify the relation between the objects and the generated descriptions. Most of them use encoder-decoder framework, where the image, which is given as input, is encoded to an intermediary representation of the image’s information and then decoded into a series of descriptions and descriptive text. The dataset employed for the same is Flickr8k dataset and the programming language is python. The project involves developing an app that takes an input image, extract features, and generate accurate descriptions, using Flutter. It has an immense potential in helping the visually impaired. It helps in automating the job of radiologists.

Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the International e-Conference on Advances in Computer Engineering and Communication Systems (ICACECS 2023)
Series
Atlantis Highlights in Computer Sciences
Publication Date
21 December 2023
ISBN
978-94-6463-314-6
ISSN
2589-4900
DOI
10.2991/978-94-6463-314-6_35How to use a DOI?
Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - B. Deepika
AU  - S. Pushpanjali Reddy
AU  - S. Gouthami Satya
AU  - K. Rushil Kumar
PY  - 2023
DA  - 2023/12/21
TI  - Image Caption Generator
BT  - Proceedings of the International e-Conference on Advances in Computer Engineering and Communication Systems (ICACECS 2023)
PB  - Atlantis Press
SP  - 360
EP  - 370
SN  - 2589-4900
UR  - https://doi.org/10.2991/978-94-6463-314-6_35
DO  - 10.2991/978-94-6463-314-6_35
ID  - Deepika2023
ER  -