Chapbooks_HTO: A Knowledge Graph for representing the "Chapbooks Printed In Scotland" (1671 - 1893) following the Heritage Textual Ontology
This Knowlege Graph represents the information of the "Chapbooks Printed In Scotland" (years: 1671 - 1893) collection in RDF (ttl format). This dataset comprises more than 3,000 chapbooks printed in Scotland from the 17th to 19th century. They form part of the Lauriston Castle Collection,...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Dataset |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This Knowlege Graph represents the information of the "Chapbooks Printed In Scotland" (years: 1671 - 1893) collection in RDF (ttl format). This dataset comprises more than 3,000 chapbooks printed in Scotland from the 17th to 19th century. They form part of the Lauriston Castle Collection, which was bequeathed to the Library in 1926. It includes some 500 chapbook volumes containing around 5,500 individual items, more than half of which were printed in Scotland. The raw dataset is provided by the NLS in this link. As other NLS data collections, they are originally provided using two XMLs schemas: METS for descriptive, structural, technical and administrative metadata (Title, Author, Publisher, etc); and ALTO for encoding the OCR text of a page.
The KG uses the HTO to represent the information extracted. Furthermore, during the information extraction phase, we have employed several techniques to mitigate two common OCR errors: long-S and the line-break hyphenation. |
---|---|
DOI: | 10.5281/zenodo.14051651 |