Fine-grained image classification method based on multi-modal learning

The invention discloses a multi-modal learning-based fine-grained image classification method, which comprises the following steps of: downloading original pictures of different species and corresponding additional information files from a known data set, preprocessing the additional information fil...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	XU JIE, GENG ZILI, FENG YUREN, ZHANG XIAOQIAN, ZHENG HAO, LIU HENG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTING COUNTING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	XU JIE GENG ZILI FENG YUREN ZHANG XIAOQIAN ZHENG HAO LIU HENG
description	The invention discloses a multi-modal learning-based fine-grained image classification method, which comprises the following steps of: downloading original pictures of different species and corresponding additional information files from a known data set, preprocessing the additional information files, training a neural network for extracting multi-modal features and fusion features, and converging the neural network, performing label probability prediction on the fine-grained image through the converged neural network, performing decision correction on the prediction probabilities of the two neural networks, and finally outputting the category of the species in the image according to the correction result. 本发明公开了一种基于多模态学习的细粒度图像分类方法，先从已知数据集中下载不同物种的原始图片及对应的附加信息文件，通过对附加信息文件进行预处理后，用于训练提取多模态特征和融合特征的神经网络并收敛，然后通过收敛的神经网络对应细粒度图像进行标签概率预测，再对两个神经网络的预测概率进行决策修正，最后根据修正结果输出图像中物种的类别。
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN116740420A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN116740420A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN116740420A3</originalsourceid><addsrcrecordid>eNrjZHBzy8xL1U0vSgRSKQqZuYnpqQrJOYnFxZlpmcmJJZn5eQq5qSUZ-SkKSYnFQBUgfmlOSaZubn5KYo5CTmpiUV5mXjoPA2taYk5xKi-U5mZQdHMNcfbQTS3Ij08tLkhMTs1LLYl39jM0NDM3MTAxMnA0JkYNAB0lM2k</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Fine-grained image classification method based on multi-modal learning</title><source>esp@cenet</source><creator>XU JIE ; GENG ZILI ; FENG YUREN ; ZHANG XIAOQIAN ; ZHENG HAO ; LIU HENG</creator><creatorcontrib>XU JIE ; GENG ZILI ; FENG YUREN ; ZHANG XIAOQIAN ; ZHENG HAO ; LIU HENG</creatorcontrib><description>The invention discloses a multi-modal learning-based fine-grained image classification method, which comprises the following steps of: downloading original pictures of different species and corresponding additional information files from a known data set, preprocessing the additional information files, training a neural network for extracting multi-modal features and fusion features, and converging the neural network, performing label probability prediction on the fine-grained image through the converged neural network, performing decision correction on the prediction probabilities of the two neural networks, and finally outputting the category of the species in the image according to the correction result. 本发明公开了一种基于多模态学习的细粒度图像分类方法，先从已知数据集中下载不同物种的原始图片及对应的附加信息文件，通过对附加信息文件进行预处理后，用于训练提取多模态特征和融合特征的神经网络并收敛，然后通过收敛的神经网络对应细粒度图像进行标签概率预测，再对两个神经网络的预测概率进行决策修正，最后根据修正结果输出图像中物种的类别。</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230912&DB=EPODOC&CC=CN&NR=116740420A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230912&DB=EPODOC&CC=CN&NR=116740420A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>XU JIE</creatorcontrib><creatorcontrib>GENG ZILI</creatorcontrib><creatorcontrib>FENG YUREN</creatorcontrib><creatorcontrib>ZHANG XIAOQIAN</creatorcontrib><creatorcontrib>ZHENG HAO</creatorcontrib><creatorcontrib>LIU HENG</creatorcontrib><title>Fine-grained image classification method based on multi-modal learning</title><description>The invention discloses a multi-modal learning-based fine-grained image classification method, which comprises the following steps of: downloading original pictures of different species and corresponding additional information files from a known data set, preprocessing the additional information files, training a neural network for extracting multi-modal features and fusion features, and converging the neural network, performing label probability prediction on the fine-grained image through the converged neural network, performing decision correction on the prediction probabilities of the two neural networks, and finally outputting the category of the species in the image according to the correction result. 本发明公开了一种基于多模态学习的细粒度图像分类方法，先从已知数据集中下载不同物种的原始图片及对应的附加信息文件，通过对附加信息文件进行预处理后，用于训练提取多模态特征和融合特征的神经网络并收敛，然后通过收敛的神经网络对应细粒度图像进行标签概率预测，再对两个神经网络的预测概率进行决策修正，最后根据修正结果输出图像中物种的类别。</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHBzy8xL1U0vSgRSKQqZuYnpqQrJOYnFxZlpmcmJJZn5eQq5qSUZ-SkKSYnFQBUgfmlOSaZubn5KYo5CTmpiUV5mXjoPA2taYk5xKi-U5mZQdHMNcfbQTS3Ij08tLkhMTs1LLYl39jM0NDM3MTAxMnA0JkYNAB0lM2k</recordid><startdate>20230912</startdate><enddate>20230912</enddate><creator>XU JIE</creator><creator>GENG ZILI</creator><creator>FENG YUREN</creator><creator>ZHANG XIAOQIAN</creator><creator>ZHENG HAO</creator><creator>LIU HENG</creator><scope>EVB</scope></search><sort><creationdate>20230912</creationdate><title>Fine-grained image classification method based on multi-modal learning</title><author>XU JIE ; GENG ZILI ; FENG YUREN ; ZHANG XIAOQIAN ; ZHENG HAO ; LIU HENG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN116740420A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>XU JIE</creatorcontrib><creatorcontrib>GENG ZILI</creatorcontrib><creatorcontrib>FENG YUREN</creatorcontrib><creatorcontrib>ZHANG XIAOQIAN</creatorcontrib><creatorcontrib>ZHENG HAO</creatorcontrib><creatorcontrib>LIU HENG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>XU JIE</au><au>GENG ZILI</au><au>FENG YUREN</au><au>ZHANG XIAOQIAN</au><au>ZHENG HAO</au><au>LIU HENG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Fine-grained image classification method based on multi-modal learning</title><date>2023-09-12</date><risdate>2023</risdate><abstract>The invention discloses a multi-modal learning-based fine-grained image classification method, which comprises the following steps of: downloading original pictures of different species and corresponding additional information files from a known data set, preprocessing the additional information files, training a neural network for extracting multi-modal features and fusion features, and converging the neural network, performing label probability prediction on the fine-grained image through the converged neural network, performing decision correction on the prediction probabilities of the two neural networks, and finally outputting the category of the species in the image according to the correction result. 本发明公开了一种基于多模态学习的细粒度图像分类方法，先从已知数据集中下载不同物种的原始图片及对应的附加信息文件，通过对附加信息文件进行预处理后，用于训练提取多模态特征和融合特征的神经网络并收敛，然后通过收敛的神经网络对应细粒度图像进行标签概率预测，再对两个神经网络的预测概率进行决策修正，最后根据修正结果输出图像中物种的类别。</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN116740420A
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING PHYSICS
title	Fine-grained image classification method based on multi-modal learning
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T04%3A10%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=XU%20JIE&rft.date=2023-09-12&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN116740420A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true