Few-shot vegetable disease recognition model based on image text collaborative representation learning

•A model based on image text collaborative representation learning was proposed.•Text improves the learning ability of the model on small samples.•The back leaf image is added to the model to support the disease diagnosis process.•All the disease images were collected from the field environment. Aut...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers and electronics in agriculture 2021-05, Vol.184, p.106098, Article 106098
Hauptverfasser: Wang, Chunshan, Zhou, Ji, Zhao, Chunjiang, Li, Jiuxi, Teng, Guifa, Wu, Huarui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A model based on image text collaborative representation learning was proposed.•Text improves the learning ability of the model on small samples.•The back leaf image is added to the model to support the disease diagnosis process.•All the disease images were collected from the field environment. Automatic recognition of vegetable diseases in complex backgrounds is an urgent need in the field of agricultural informatization. The recognition methods based on deep learning have achieved excellent performance in disease diagnosis and therefore have gradually become a research hotspot. However, the disease recognition models established based on deep convolutional neural networks usually need to be trained on huge disease image datasets so as to achieve an ideal outcome. Building such a kind of dataset requires a large amount of disease images and labeling information, which is often technically or economically infeasible. In this paper, a small-sample recognition model of vegetable diseases in complex backgrounds based on image text collaborative representation learning (ITC-Net) was proposed. This model combined the disease image modal information with the disease text modal information, so as to achieve collaborative recognition of disease features by utilizing the correlation and complementarity between the two types of disease information. Eventually, the ITC-Net achieved better results than either the image model or text model alone on a small dataset. To be more specific, its accuracy, precision, sensitivity and specificity are 99.48%, 98.90%, 98.78% and 99.66%, respectively. This paper proves that the multi-modal collaborative representation learning using both disease images and disease texts is an effective method to solve the problem of vegetable disease recognition in complex backgrounds with few-shot.
ISSN:0168-1699
1872-7107
DOI:10.1016/j.compag.2021.106098