MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale
We present MapReader, a free, open-source software library written in Python for analyzing large map collections (scanned or born-digital). This library transforms the way historians can use maps by turning extensive, homogeneous map sets into searchable primary sources. MapReader allows users with...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Hosseini, Kasra Wilson, Daniel C. S Beelen, Kaspar McDonough, Katherine |
description | We present MapReader, a free, open-source software library written in Python
for analyzing large map collections (scanned or born-digital). This library
transforms the way historians can use maps by turning extensive, homogeneous
map sets into searchable primary sources. MapReader allows users with little or
no computer vision expertise to i) retrieve maps via web-servers; ii)
preprocess and divide them into patches; iii) annotate patches; iv) train,
fine-tune, and evaluate deep neural network models; and v) create structured
data about map content. We demonstrate how MapReader enables historians to
interpret a collection of $\approx$16K nineteenth-century Ordnance Survey map
sheets ($\approx$30.5M patches), foregrounding the challenge of translating
visual markers into machine-readable data. We present a case study focusing on
British rail infrastructure and buildings as depicted on these maps. We also
show how the outputs from the MapReader pipeline can be linked to other,
external datasets, which we use to evaluate as well as enrich and interpret the
results. We release $\approx$62K manually annotated patches used here for
training and evaluating the models. |
doi_str_mv | 10.48550/arxiv.2111.15592 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2111_15592</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2111_15592</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-63e00d5a1d21df957732dbb38f81c25b75076c0e0e46fdb0e1c83b3255c73a933</originalsourceid><addsrcrecordid>eNotz8tOwzAQhWFvWKDCA7BiXiDBlzpO2FVRuUitQKRiG03ssbCUxJETUHl7aGF1Nke_9DF2I3i-LrXmd5iO4SuXQohcaF3JS9bscXojdJTuYQN1HKbPhRK8hznEEV7DRH0YCXxMsHwQNDTguAQL2-PUx4TL6RU9_FZmwAUaiz1dsQuP_UzX_7tih4ftoX7Kdi-Pz_Vml2FhZFYo4txpFE4K5yttjJKu61TpS2Gl7ozmprCcOK0L7zpOwpaqU1JraxRWSq3Y7V_2rGqnFAZM3-1J15516geMdElt</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale</title><source>arXiv.org</source><creator>Hosseini, Kasra ; Wilson, Daniel C. S ; Beelen, Kaspar ; McDonough, Katherine</creator><creatorcontrib>Hosseini, Kasra ; Wilson, Daniel C. S ; Beelen, Kaspar ; McDonough, Katherine</creatorcontrib><description>We present MapReader, a free, open-source software library written in Python
for analyzing large map collections (scanned or born-digital). This library
transforms the way historians can use maps by turning extensive, homogeneous
map sets into searchable primary sources. MapReader allows users with little or
no computer vision expertise to i) retrieve maps via web-servers; ii)
preprocess and divide them into patches; iii) annotate patches; iv) train,
fine-tune, and evaluate deep neural network models; and v) create structured
data about map content. We demonstrate how MapReader enables historians to
interpret a collection of $\approx$16K nineteenth-century Ordnance Survey map
sheets ($\approx$30.5M patches), foregrounding the challenge of translating
visual markers into machine-readable data. We present a case study focusing on
British rail infrastructure and buildings as depicted on these maps. We also
show how the outputs from the MapReader pipeline can be linked to other,
external datasets, which we use to evaluate as well as enrich and interpret the
results. We release $\approx$62K manually annotated patches used here for
training and evaluating the models.</description><identifier>DOI: 10.48550/arxiv.2111.15592</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Learning</subject><creationdate>2021-11</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2111.15592$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2111.15592$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Hosseini, Kasra</creatorcontrib><creatorcontrib>Wilson, Daniel C. S</creatorcontrib><creatorcontrib>Beelen, Kaspar</creatorcontrib><creatorcontrib>McDonough, Katherine</creatorcontrib><title>MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale</title><description>We present MapReader, a free, open-source software library written in Python
for analyzing large map collections (scanned or born-digital). This library
transforms the way historians can use maps by turning extensive, homogeneous
map sets into searchable primary sources. MapReader allows users with little or
no computer vision expertise to i) retrieve maps via web-servers; ii)
preprocess and divide them into patches; iii) annotate patches; iv) train,
fine-tune, and evaluate deep neural network models; and v) create structured
data about map content. We demonstrate how MapReader enables historians to
interpret a collection of $\approx$16K nineteenth-century Ordnance Survey map
sheets ($\approx$30.5M patches), foregrounding the challenge of translating
visual markers into machine-readable data. We present a case study focusing on
British rail infrastructure and buildings as depicted on these maps. We also
show how the outputs from the MapReader pipeline can be linked to other,
external datasets, which we use to evaluate as well as enrich and interpret the
results. We release $\approx$62K manually annotated patches used here for
training and evaluating the models.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz8tOwzAQhWFvWKDCA7BiXiDBlzpO2FVRuUitQKRiG03ssbCUxJETUHl7aGF1Nke_9DF2I3i-LrXmd5iO4SuXQohcaF3JS9bscXojdJTuYQN1HKbPhRK8hznEEV7DRH0YCXxMsHwQNDTguAQL2-PUx4TL6RU9_FZmwAUaiz1dsQuP_UzX_7tih4ftoX7Kdi-Pz_Vml2FhZFYo4txpFE4K5yttjJKu61TpS2Gl7ozmprCcOK0L7zpOwpaqU1JraxRWSq3Y7V_2rGqnFAZM3-1J15516geMdElt</recordid><startdate>20211130</startdate><enddate>20211130</enddate><creator>Hosseini, Kasra</creator><creator>Wilson, Daniel C. S</creator><creator>Beelen, Kaspar</creator><creator>McDonough, Katherine</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20211130</creationdate><title>MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale</title><author>Hosseini, Kasra ; Wilson, Daniel C. S ; Beelen, Kaspar ; McDonough, Katherine</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-63e00d5a1d21df957732dbb38f81c25b75076c0e0e46fdb0e1c83b3255c73a933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Hosseini, Kasra</creatorcontrib><creatorcontrib>Wilson, Daniel C. S</creatorcontrib><creatorcontrib>Beelen, Kaspar</creatorcontrib><creatorcontrib>McDonough, Katherine</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hosseini, Kasra</au><au>Wilson, Daniel C. S</au><au>Beelen, Kaspar</au><au>McDonough, Katherine</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale</atitle><date>2021-11-30</date><risdate>2021</risdate><abstract>We present MapReader, a free, open-source software library written in Python
for analyzing large map collections (scanned or born-digital). This library
transforms the way historians can use maps by turning extensive, homogeneous
map sets into searchable primary sources. MapReader allows users with little or
no computer vision expertise to i) retrieve maps via web-servers; ii)
preprocess and divide them into patches; iii) annotate patches; iv) train,
fine-tune, and evaluate deep neural network models; and v) create structured
data about map content. We demonstrate how MapReader enables historians to
interpret a collection of $\approx$16K nineteenth-century Ordnance Survey map
sheets ($\approx$30.5M patches), foregrounding the challenge of translating
visual markers into machine-readable data. We present a case study focusing on
British rail infrastructure and buildings as depicted on these maps. We also
show how the outputs from the MapReader pipeline can be linked to other,
external datasets, which we use to evaluate as well as enrich and interpret the
results. We release $\approx$62K manually annotated patches used here for
training and evaluating the models.</abstract><doi>10.48550/arxiv.2111.15592</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2111.15592 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2111_15592 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning |
title | MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T20%3A40%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MapReader:%20A%20Computer%20Vision%20Pipeline%20for%20the%20Semantic%20Exploration%20of%20Maps%20at%20Scale&rft.au=Hosseini,%20Kasra&rft.date=2021-11-30&rft_id=info:doi/10.48550/arxiv.2111.15592&rft_dat=%3Carxiv_GOX%3E2111_15592%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |