A Novel Bi-LSTM Based Automatic Image Description Generation
Image captioning is the process of creating a textual description of an image. Due to its importance in various fields, it has emerged as the latest and hot research problem. It uses Computer Vision techniques to process an image and Natural Language Processing to generate the caption. Our proposed...
Gespeichert in:
Veröffentlicht in: | Ingénierie des systèmes d'Information 2023-04, Vol.28 (2), p.527-534 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Image captioning is the process of creating a textual description of an image. Due to its importance in various fields, it has emerged as the latest and hot research problem. It uses Computer Vision techniques to process an image and Natural Language Processing to generate the caption. Our proposed approach uses the Bi-LSTM (Bi directional Long Short Term Memory) approach to generate the image description. We also propose the Novel Moth Flame Optimization (NMFO). This model uses the correlation-based logarithmic spiral update. The novel proposed model is demonstrated on standard datasets like Flicker 8k, Flicker 30k, and MSCOCO datasets using standard metrics likeBLEU, CIDEr. Performance of various metrics on various datasets shows that our novel Bi-LSTM approach gives better performance when compared to our traditional approaches. |
---|---|
ISSN: | 1633-1311 2116-7125 |
DOI: | 10.18280/isi.280230 |