Multimodal entity identification
A machine learning based system can identify an entity as the likely subject of a multimodal message (e.g., a social media post having a short text phrase overlaid on an image) by creating embeddings for an image of the multimodal message and one or more string embeddings from text of the multimodal...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A machine learning based system can identify an entity as the likely subject of a multimodal message (e.g., a social media post having a short text phrase overlaid on an image) by creating embeddings for an image of the multimodal message and one or more string embeddings from text of the multimodal message. The embeddings can be weighted to maximize information gain, then recombined and compared against a result embedding database to identify an entity as the subject of the multimodal message. |
---|