Synthetic Image Data for Deep Learning

Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Anderson, Jason W, Ziolkowski, Marcin, Kennedy, Ken, Apon, Amy W
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Anderson, Jason W
Ziolkowski, Marcin
Kennedy, Ken
Apon, Amy W
description Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.
doi_str_mv 10.48550/arxiv.2212.06232
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2212_06232</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2212_06232</sourcerecordid><originalsourceid>FETCH-LOGICAL-a672-add2f43979ad33c53639feeaa6baabec8844658c85000276d6c970e814223e33</originalsourceid><addsrcrecordid>eNotzrsKwjAUgOEsDqI-gJOZ3FrTc5I0HcU7FBx0L8f0VAtaJRbRtxcv07_9fEIMExVrZ4yaUHjWjxgggVhZQOiK8e7VtCduay83FzqynFNLsroGOWe-yZwpNHVz7ItORec7D_7tid1ysZ-to3y72symeUQ2hYjKEiqNWZpRiegNWswqZiJ7IDqwd05ra5x3RikFqS2tz1LFLtEAyIg9Mfpdv87iFuoLhVfx8RZfL74BrTA45A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Synthetic Image Data for Deep Learning</title><source>arXiv.org</source><creator>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</creator><creatorcontrib>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</creatorcontrib><description>Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.</description><identifier>DOI: 10.48550/arxiv.2212.06232</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Learning</subject><creationdate>2022-12</creationdate><rights>http://creativecommons.org/licenses/by-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2212.06232$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2212.06232$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Anderson, Jason W</creatorcontrib><creatorcontrib>Ziolkowski, Marcin</creatorcontrib><creatorcontrib>Kennedy, Ken</creatorcontrib><creatorcontrib>Apon, Amy W</creatorcontrib><title>Synthetic Image Data for Deep Learning</title><description>Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzrsKwjAUgOEsDqI-gJOZ3FrTc5I0HcU7FBx0L8f0VAtaJRbRtxcv07_9fEIMExVrZ4yaUHjWjxgggVhZQOiK8e7VtCduay83FzqynFNLsroGOWe-yZwpNHVz7ItORec7D_7tid1ysZ-to3y72symeUQ2hYjKEiqNWZpRiegNWswqZiJ7IDqwd05ra5x3RikFqS2tz1LFLtEAyIg9Mfpdv87iFuoLhVfx8RZfL74BrTA45A</recordid><startdate>20221212</startdate><enddate>20221212</enddate><creator>Anderson, Jason W</creator><creator>Ziolkowski, Marcin</creator><creator>Kennedy, Ken</creator><creator>Apon, Amy W</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20221212</creationdate><title>Synthetic Image Data for Deep Learning</title><author>Anderson, Jason W ; Ziolkowski, Marcin ; Kennedy, Ken ; Apon, Amy W</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a672-add2f43979ad33c53639feeaa6baabec8844658c85000276d6c970e814223e33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Anderson, Jason W</creatorcontrib><creatorcontrib>Ziolkowski, Marcin</creatorcontrib><creatorcontrib>Kennedy, Ken</creatorcontrib><creatorcontrib>Apon, Amy W</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Anderson, Jason W</au><au>Ziolkowski, Marcin</au><au>Kennedy, Ken</au><au>Apon, Amy W</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Synthetic Image Data for Deep Learning</atitle><date>2022-12-12</date><risdate>2022</risdate><abstract>Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.</abstract><doi>10.48550/arxiv.2212.06232</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2212.06232
ispartof
issn
language eng
recordid cdi_arxiv_primary_2212_06232
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
Computer Science - Learning
title Synthetic Image Data for Deep Learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T20%3A59%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Synthetic%20Image%20Data%20for%20Deep%20Learning&rft.au=Anderson,%20Jason%20W&rft.date=2022-12-12&rft_id=info:doi/10.48550/arxiv.2212.06232&rft_dat=%3Carxiv_GOX%3E2212_06232%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true