ITERATIVE TRAINING FOR TEXT-IMAGE-LAYOUT TRANSFORMER

Disclosed herein is a system and method for Natural Language Processing (NLP) of real world documents. The system and method combines various models not previously combined and overcomes the challenges of this combination. Models include an encoder-decoder model, a spatial model, and a multi-modal m...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	BORCHMANN, Lukasz Konrad, DANCEWICZ, Adam, GRALINSKI, Filip
Format:	Patent
Sprache:	eng ; fre
Schlagworte:	CALCULATING COMPUTING COUNTING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	BORCHMANN, Lukasz Konrad DANCEWICZ, Adam GRALINSKI, Filip
description	Disclosed herein is a system and method for Natural Language Processing (NLP) of real world documents. The system and method combines various models not previously combined and overcomes the challenges of this combination. Models include an encoder-decoder model, a spatial model, and a multi-modal model. An iterative training process receives documents and generates outputs, wherein the iterative training process comprises enabling information retrieval from documents without training data. Sont divulgués un système et un procédé de traitement de langage naturel (TLN) de documents du monde réel. Le système et le procédé combinent divers modèles n'ayant jamais été combinés et surmontent les défis de cette combinaison. Les modèles comprennent un modèle codeur-décodeur, un modèle spatial et un modèle multimodal. Un processus d'entraînement itératif reçoit des documents et génère des sorties, le processus d'entraînement itératif comprenant l'activation de l'extraction d'informations à partir de documents sans données d'entraînement.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_WO2022175849A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>WO2022175849A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_WO2022175849A13</originalsourceid><addsrcrecordid>eNrjZDDxDHENcgzxDHNVCAly9PTz9HNXcPMPUghxjQjR9fR1dHfV9XGM9A8NAUn7BQOlfF2DeBhY0xJzilN5oTQ3g7Kba4izh25qQX58anFBYnJqXmpJfLi_kYGRkaG5qYWJpaOhMXGqAJAmKL4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>ITERATIVE TRAINING FOR TEXT-IMAGE-LAYOUT TRANSFORMER</title><source>esp@cenet</source><creator>BORCHMANN, Lukasz Konrad ; DANCEWICZ, Adam ; GRALINSKI, Filip</creator><creatorcontrib>BORCHMANN, Lukasz Konrad ; DANCEWICZ, Adam ; GRALINSKI, Filip</creatorcontrib><description>Disclosed herein is a system and method for Natural Language Processing (NLP) of real world documents. The system and method combines various models not previously combined and overcomes the challenges of this combination. Models include an encoder-decoder model, a spatial model, and a multi-modal model. An iterative training process receives documents and generates outputs, wherein the iterative training process comprises enabling information retrieval from documents without training data. Sont divulgués un système et un procédé de traitement de langage naturel (TLN) de documents du monde réel. Le système et le procédé combinent divers modèles n'ayant jamais été combinés et surmontent les défis de cette combinaison. Les modèles comprennent un modèle codeur-décodeur, un modèle spatial et un modèle multimodal. Un processus d'entraînement itératif reçoit des documents et génère des sorties, le processus d'entraînement itératif comprenant l'activation de l'extraction d'informations à partir de documents sans données d'entraînement.</description><language>eng ; fre</language><subject>CALCULATING ; COMPUTING ; COUNTING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220825&DB=EPODOC&CC=WO&NR=2022175849A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,778,883,25547,76298</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220825&DB=EPODOC&CC=WO&NR=2022175849A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>BORCHMANN, Lukasz Konrad</creatorcontrib><creatorcontrib>DANCEWICZ, Adam</creatorcontrib><creatorcontrib>GRALINSKI, Filip</creatorcontrib><title>ITERATIVE TRAINING FOR TEXT-IMAGE-LAYOUT TRANSFORMER</title><description>Disclosed herein is a system and method for Natural Language Processing (NLP) of real world documents. The system and method combines various models not previously combined and overcomes the challenges of this combination. Models include an encoder-decoder model, a spatial model, and a multi-modal model. An iterative training process receives documents and generates outputs, wherein the iterative training process comprises enabling information retrieval from documents without training data. Sont divulgués un système et un procédé de traitement de langage naturel (TLN) de documents du monde réel. Le système et le procédé combinent divers modèles n'ayant jamais été combinés et surmontent les défis de cette combinaison. Les modèles comprennent un modèle codeur-décodeur, un modèle spatial et un modèle multimodal. Un processus d'entraînement itératif reçoit des documents et génère des sorties, le processus d'entraînement itératif comprenant l'activation de l'extraction d'informations à partir de documents sans données d'entraînement.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDxDHENcgzxDHNVCAly9PTz9HNXcPMPUghxjQjR9fR1dHfV9XGM9A8NAUn7BQOlfF2DeBhY0xJzilN5oTQ3g7Kba4izh25qQX58anFBYnJqXmpJfLi_kYGRkaG5qYWJpaOhMXGqAJAmKL4</recordid><startdate>20220825</startdate><enddate>20220825</enddate><creator>BORCHMANN, Lukasz Konrad</creator><creator>DANCEWICZ, Adam</creator><creator>GRALINSKI, Filip</creator><scope>EVB</scope></search><sort><creationdate>20220825</creationdate><title>ITERATIVE TRAINING FOR TEXT-IMAGE-LAYOUT TRANSFORMER</title><author>BORCHMANN, Lukasz Konrad ; DANCEWICZ, Adam ; GRALINSKI, Filip</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_WO2022175849A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>BORCHMANN, Lukasz Konrad</creatorcontrib><creatorcontrib>DANCEWICZ, Adam</creatorcontrib><creatorcontrib>GRALINSKI, Filip</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>BORCHMANN, Lukasz Konrad</au><au>DANCEWICZ, Adam</au><au>GRALINSKI, Filip</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>ITERATIVE TRAINING FOR TEXT-IMAGE-LAYOUT TRANSFORMER</title><date>2022-08-25</date><risdate>2022</risdate><abstract>Disclosed herein is a system and method for Natural Language Processing (NLP) of real world documents. The system and method combines various models not previously combined and overcomes the challenges of this combination. Models include an encoder-decoder model, a spatial model, and a multi-modal model. An iterative training process receives documents and generates outputs, wherein the iterative training process comprises enabling information retrieval from documents without training data. Sont divulgués un système et un procédé de traitement de langage naturel (TLN) de documents du monde réel. Le système et le procédé combinent divers modèles n'ayant jamais été combinés et surmontent les défis de cette combinaison. Les modèles comprennent un modèle codeur-décodeur, un modèle spatial et un modèle multimodal. Un processus d'entraînement itératif reçoit des documents et génère des sorties, le processus d'entraînement itératif comprenant l'activation de l'extraction d'informations à partir de documents sans données d'entraînement.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; fre
recordid	cdi_epo_espacenet_WO2022175849A1
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS
title	ITERATIVE TRAINING FOR TEXT-IMAGE-LAYOUT TRANSFORMER
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T17%3A05%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=BORCHMANN,%20Lukasz%20Konrad&rft.date=2022-08-25&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EWO2022175849A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true