PatFig: Generating Short and Long Captions for Patent Figures
This paper introduces Qatent PatFig, a novel large-scale patent figure dataset comprising 30,000+ patent figures from over 11,000 European patent applications. For each figure, this dataset provides short and long captions, reference numerals, their corresponding terms, and the minimal claim set tha...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper introduces Qatent PatFig, a novel large-scale patent figure
dataset comprising 30,000+ patent figures from over 11,000 European patent
applications. For each figure, this dataset provides short and long captions,
reference numerals, their corresponding terms, and the minimal claim set that
describes the interactions between the components of the image. To assess the
usability of the dataset, we finetune an LVLM model on Qatent PatFig to
generate short and long descriptions, and we investigate the effects of
incorporating various text-based cues at the prediction stage of the patent
figure captioning process. |
---|---|
DOI: | 10.48550/arxiv.2309.08379 |