DETERMINING REGIONS OF INTEREST IN AN IMAGE

In some examples, a system for determining regions of interest in an image comprises a convolutional neural network (CNN). The CNN can comprise a down sampling pathway defining an encoder comprising a set of convolutional layers configured to output a down sampled image representation of the image,...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	VO, Nhat, XIA, Baiqiang
Format:	Patent
Sprache:	eng ; fre
Schlagworte:	CALCULATING COMPUTING COUNTING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In some examples, a system for determining regions of interest in an image comprises a convolutional neural network (CNN). The CNN can comprise a down sampling pathway defining an encoder comprising a set of convolutional layers configured to output a down sampled image representation of the image, and an up sampling pathway defining a decoder configured to output classified image data for the image representing the regions of interest, the encoder configured to receive image data representing the image, and generate, using the image data and a set of kernels, output data comprising the down sampled image representation, the decoder comprising a set of layers configured to perform transposed convolutions on the down sampled image representation to generate the classified image data, wherein the decoder is further configured to receive region data representing region information for the image, the region data comprising a vector encoding one or more characteristics of at least a portion of a region of interest of the image, and determined using the image data and a mask comprising an output of the convolutional neural network in the form of candidate classified image data. L'invention concerne, selon dans certains exemples, un système pour déterminer des régions d'intérêt dans une image comprenant un réseau de neurones à convolution (CNN). Le CNN peut comprendre un chemin d'échantillonnage descendant définissant un codeur comprenant un ensemble de couches de convolution configuré pour sortir une représentation d'image échantillonnée descendante de l'image, et un chemin d'échantillonnage ascendant définissant un décodeur configuré pour sortir des données d'image classées pour l'image représentant les régions d'intérêt, le codeur étant configuré pour recevoir des données d'image représentant l'image, et générer, en utilisant les données d'image et un ensemble de noyaux, des données de sortie comprenant la représentation d'image échantillonnée descendante, le décodeur comprenant un ensemble de couches configurées pour effectuer des convolutions transposées sur la représentation d'image échantillonnée vers le bas pour générer les données d'image classées, le décodeur étant en outre configuré pour recevoir des données de région représentant des informations de région pour l'image, les données de région comprenant un vecteur codant une ou plusieurs caractéristiques d'au moins une partie d'une région d'intérêt de l'image, et déterminées en utilisant les données d'