Synthetic Image Data for Deep Learning

Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based o...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Anderson, Jason W, Ziolkowski, Marcin, Kennedy, Ken, Apon, Amy W
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Anderson, Jason W Ziolkowski, Marcin Kennedy, Ken Apon, Amy W
description	Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.
doi_str_mv	10.48550/arxiv.2212.06232
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2212_06232</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2212_06232</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-add2f43979ad33c53639feeaa6baabec8844658c85000276d6c970e814223e33</originalsourceid><addsrcrecordid>eNotzrsKwjAUgOEsDqI-gJOZ3FrTc5I0HcU7FBx0L8f0VAtaJRbRtxcv07_9fEIMExVrZ4yaUHjWjxgggVhZQOiK8e7VtCduay83FzqynFNLsroGOWe-yZwpNHVz7ItORec7D_7tid1ysZ-to3y72symeUQ2hYjKEiqNWZpRiegNWswqZiJ7IDqwd05ra5x3RikFqS2tz1LFLtEAyIg9Mfpdv87iFuoLhVfx8RZfL74BrTA45A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Synthetic Image Data for Deep Learning</title><source>arXiv.org</source><creator>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</creator><creatorcontrib>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</creatorcontrib><description>Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.</description><identifier>DOI: 10.48550/arxiv.2212.06232</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Learning</subject><creationdate>2022-12</creationdate><rights>http://creativecommons.org/licenses/by-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2212.06232$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2212.06232$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Anderson, Jason W</creatorcontrib><creatorcontrib>Ziolkowski, Marcin</creatorcontrib><creatorcontrib>Kennedy, Ken</creatorcontrib><creatorcontrib>Apon, Amy W</creatorcontrib><title>Synthetic Image Data for Deep Learning</title><description>Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzrsKwjAUgOEsDqI-gJOZ3FrTc5I0HcU7FBx0L8f0VAtaJRbRtxcv07_9fEIMExVrZ4yaUHjWjxgggVhZQOiK8e7VtCduay83FzqynFNLsroGOWe-yZwpNHVz7ItORec7D_7tid1ysZ-to3y72symeUQ2hYjKEiqNWZpRiegNWswqZiJ7IDqwd05ra5x3RikFqS2tz1LFLtEAyIg9Mfpdv87iFuoLhVfx8RZfL74BrTA45A</recordid><startdate>20221212</startdate><enddate>20221212</enddate><creator>Anderson, Jason W</creator><creator>Ziolkowski, Marcin</creator><creator>Kennedy, Ken</creator><creator>Apon, Amy W</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20221212</creationdate><title>Synthetic Image Data for Deep Learning</title><author>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-add2f43979ad33c53639feeaa6baabec8844658c85000276d6c970e814223e33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Anderson, Jason W</creatorcontrib><creatorcontrib>Ziolkowski, Marcin</creatorcontrib><creatorcontrib>Kennedy, Ken</creatorcontrib><creatorcontrib>Apon, Amy W</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Anderson, Jason W</au><au>Ziolkowski, Marcin</au><au>Kennedy, Ken</au><au>Apon, Amy W</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Synthetic Image Data for Deep Learning</atitle><date>2022-12-12</date><risdate>2022</risdate><abstract>Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.</abstract><doi>10.48550/arxiv.2212.06232</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2212.06232
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2212_06232
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning
title	Synthetic Image Data for Deep Learning
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T20%3A59%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Synthetic%20Image%20Data%20for%20Deep%20Learning&rft.au=Anderson,%20Jason%20W&rft.date=2022-12-12&rft_id=info:doi/10.48550/arxiv.2212.06232&rft_dat=%3Carxiv_GOX%3E2212_06232%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true