A Novel Bi-LSTM Based Automatic Image Description Generation

Image captioning is the process of creating a textual description of an image. Due to its importance in various fields, it has emerged as the latest and hot research problem. It uses Computer Vision techniques to process an image and Natural Language Processing to generate the caption. Our proposed...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Ingénierie des systèmes d'Information 2023-04, Vol.28 (2), p.527-534
1. Verfasser: Ravulaplli, Lakshmi Tulasi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Image captioning is the process of creating a textual description of an image. Due to its importance in various fields, it has emerged as the latest and hot research problem. It uses Computer Vision techniques to process an image and Natural Language Processing to generate the caption. Our proposed approach uses the Bi-LSTM (Bi directional Long Short Term Memory) approach to generate the image description. We also propose the Novel Moth Flame Optimization (NMFO). This model uses the correlation-based logarithmic spiral update. The novel proposed model is demonstrated on standard datasets like Flicker 8k, Flicker 30k, and MSCOCO datasets using standard metrics likeBLEU, CIDEr. Performance of various metrics on various datasets shows that our novel Bi-LSTM approach gives better performance when compared to our traditional approaches.
ISSN:1633-1311
2116-7125
DOI:10.18280/isi.280230