ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing

Most molecular diagram parsers recover chemical structure from raster images (e.g., PNGs). However, many PDFs include commands giving explicit locations and shapes for characters, lines, and polygons. We present a new parser that uses these born-digital PDF primitives as input. The parsing model is...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Shah, Ayush Kumar, Amador, Bryan Manrique, Dey, Abhisek, Creekmore, Ming, Ocampo, Blake, Denmark, Scott, Zanibbi, Richard
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!