Synthetic Image Data for Deep Learning
Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based o...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Anderson, Jason W Ziolkowski, Marcin Kennedy, Ken Apon, Amy W |
description | Realistic synthetic image data rendered from 3D models can be used to augment
image sets and train image classification semantic segmentation models. In this
work, we explore how high quality physically-based rendering and domain
randomization can efficiently create a large synthetic dataset based on
production 3D CAD models of a real vehicle. We use this dataset to quantify the
effectiveness of synthetic augmentation using U-net and Double-U-net models. We
found that, for this domain, synthetic images were an effective technique for
augmenting limited sets of real training data. We observed that models trained
on purely synthetic images had a very low mean prediction IoU on real
validation images. We also observed that adding even very small amounts of real
images to a synthetic dataset greatly improved accuracy, and that models
trained on datasets augmented with synthetic images were more accurate than
those trained on real images alone. Finally, we found that in use cases that
benefit from incremental training or model specialization, pretraining a base
model on synthetic images provided a sizeable reduction in the training cost of
transfer learning, allowing up to 90\% of the model training to be
front-loaded. |
doi_str_mv | 10.48550/arxiv.2212.06232 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2212_06232</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2212_06232</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-add2f43979ad33c53639feeaa6baabec8844658c85000276d6c970e814223e33</originalsourceid><addsrcrecordid>eNotzrsKwjAUgOEsDqI-gJOZ3FrTc5I0HcU7FBx0L8f0VAtaJRbRtxcv07_9fEIMExVrZ4yaUHjWjxgggVhZQOiK8e7VtCduay83FzqynFNLsroGOWe-yZwpNHVz7ItORec7D_7tid1ysZ-to3y72symeUQ2hYjKEiqNWZpRiegNWswqZiJ7IDqwd05ra5x3RikFqS2tz1LFLtEAyIg9Mfpdv87iFuoLhVfx8RZfL74BrTA45A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Synthetic Image Data for Deep Learning</title><source>arXiv.org</source><creator>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</creator><creatorcontrib>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</creatorcontrib><description>Realistic synthetic image data rendered from 3D models can be used to augment
image sets and train image classification semantic segmentation models. In this
work, we explore how high quality physically-based rendering and domain
randomization can efficiently create a large synthetic dataset based on
production 3D CAD models of a real vehicle. We use this dataset to quantify the
effectiveness of synthetic augmentation using U-net and Double-U-net models. We
found that, for this domain, synthetic images were an effective technique for
augmenting limited sets of real training data. We observed that models trained
on purely synthetic images had a very low mean prediction IoU on real
validation images. We also observed that adding even very small amounts of real
images to a synthetic dataset greatly improved accuracy, and that models
trained on datasets augmented with synthetic images were more accurate than
those trained on real images alone. Finally, we found that in use cases that
benefit from incremental training or model specialization, pretraining a base
model on synthetic images provided a sizeable reduction in the training cost of
transfer learning, allowing up to 90\% of the model training to be
front-loaded.</description><identifier>DOI: 10.48550/arxiv.2212.06232</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Learning</subject><creationdate>2022-12</creationdate><rights>http://creativecommons.org/licenses/by-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2212.06232$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2212.06232$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Anderson, Jason W</creatorcontrib><creatorcontrib>Ziolkowski, Marcin</creatorcontrib><creatorcontrib>Kennedy, Ken</creatorcontrib><creatorcontrib>Apon, Amy W</creatorcontrib><title>Synthetic Image Data for Deep Learning</title><description>Realistic synthetic image data rendered from 3D models can be used to augment
image sets and train image classification semantic segmentation models. In this
work, we explore how high quality physically-based rendering and domain
randomization can efficiently create a large synthetic dataset based on
production 3D CAD models of a real vehicle. We use this dataset to quantify the
effectiveness of synthetic augmentation using U-net and Double-U-net models. We
found that, for this domain, synthetic images were an effective technique for
augmenting limited sets of real training data. We observed that models trained
on purely synthetic images had a very low mean prediction IoU on real
validation images. We also observed that adding even very small amounts of real
images to a synthetic dataset greatly improved accuracy, and that models
trained on datasets augmented with synthetic images were more accurate than
those trained on real images alone. Finally, we found that in use cases that
benefit from incremental training or model specialization, pretraining a base
model on synthetic images provided a sizeable reduction in the training cost of
transfer learning, allowing up to 90\% of the model training to be
front-loaded.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzrsKwjAUgOEsDqI-gJOZ3FrTc5I0HcU7FBx0L8f0VAtaJRbRtxcv07_9fEIMExVrZ4yaUHjWjxgggVhZQOiK8e7VtCduay83FzqynFNLsroGOWe-yZwpNHVz7ItORec7D_7tid1ysZ-to3y72symeUQ2hYjKEiqNWZpRiegNWswqZiJ7IDqwd05ra5x3RikFqS2tz1LFLtEAyIg9Mfpdv87iFuoLhVfx8RZfL74BrTA45A</recordid><startdate>20221212</startdate><enddate>20221212</enddate><creator>Anderson, Jason W</creator><creator>Ziolkowski, Marcin</creator><creator>Kennedy, Ken</creator><creator>Apon, Amy W</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20221212</creationdate><title>Synthetic Image Data for Deep Learning</title><author>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-add2f43979ad33c53639feeaa6baabec8844658c85000276d6c970e814223e33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Anderson, Jason W</creatorcontrib><creatorcontrib>Ziolkowski, Marcin</creatorcontrib><creatorcontrib>Kennedy, Ken</creatorcontrib><creatorcontrib>Apon, Amy W</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Anderson, Jason W</au><au>Ziolkowski, Marcin</au><au>Kennedy, Ken</au><au>Apon, Amy W</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Synthetic Image Data for Deep Learning</atitle><date>2022-12-12</date><risdate>2022</risdate><abstract>Realistic synthetic image data rendered from 3D models can be used to augment
image sets and train image classification semantic segmentation models. In this
work, we explore how high quality physically-based rendering and domain
randomization can efficiently create a large synthetic dataset based on
production 3D CAD models of a real vehicle. We use this dataset to quantify the
effectiveness of synthetic augmentation using U-net and Double-U-net models. We
found that, for this domain, synthetic images were an effective technique for
augmenting limited sets of real training data. We observed that models trained
on purely synthetic images had a very low mean prediction IoU on real
validation images. We also observed that adding even very small amounts of real
images to a synthetic dataset greatly improved accuracy, and that models
trained on datasets augmented with synthetic images were more accurate than
those trained on real images alone. Finally, we found that in use cases that
benefit from incremental training or model specialization, pretraining a base
model on synthetic images provided a sizeable reduction in the training cost of
transfer learning, allowing up to 90\% of the model training to be
front-loaded.</abstract><doi>10.48550/arxiv.2212.06232</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2212.06232 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2212_06232 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning |
title | Synthetic Image Data for Deep Learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T20%3A59%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Synthetic%20Image%20Data%20for%20Deep%20Learning&rft.au=Anderson,%20Jason%20W&rft.date=2022-12-12&rft_id=info:doi/10.48550/arxiv.2212.06232&rft_dat=%3Carxiv_GOX%3E2212_06232%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |