Image Caption Generator
- DOI
- 10.2991/978-94-6463-314-6_35How to use a DOI?
- Keywords
- LSTM; RNN; CNN; Deep Learning; Natural Language Processing
- Abstract
Image captioning, also defined as describing the image, has consistently sparked the curiosity of expert system researchers and accurate description of an image has been a significant task. Image caption generator involves describing the characteristics, attributes of the image. It has a plenty of applications in the field of Robotic vision, story-telling from album uploads, business and many more. For instance, it can be used in Image segmentation as used by Google Photos and its application can also be extended to video frames. It has grown to become one of the most prevalent tools in the contemporary period. This paper aims in employing computer vision and machine translation for captioning the image. It involves recognizing the objects, actions, attributes in an image and identify the relation between the objects and the generated descriptions. Most of them use encoder-decoder framework, where the image, which is given as input, is encoded to an intermediary representation of the image’s information and then decoded into a series of descriptions and descriptive text. The dataset employed for the same is Flickr8k dataset and the programming language is python. The project involves developing an app that takes an input image, extract features, and generate accurate descriptions, using Flutter. It has an immense potential in helping the visually impaired. It helps in automating the job of radiologists.
- Copyright
- © 2023 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - B. Deepika AU - S. Pushpanjali Reddy AU - S. Gouthami Satya AU - K. Rushil Kumar PY - 2023 DA - 2023/12/21 TI - Image Caption Generator BT - Proceedings of the International e-Conference on Advances in Computer Engineering and Communication Systems (ICACECS 2023) PB - Atlantis Press SP - 360 EP - 370 SN - 2589-4900 UR - https://doi.org/10.2991/978-94-6463-314-6_35 DO - 10.2991/978-94-6463-314-6_35 ID - Deepika2023 ER -