Embedding Human Knowledge into Deep Neural Network via Attention Map

In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model paramete...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Mitsuhara, Masahiro, Fukui, Hiroshi, Sakashita, Yusuke, Ogata, Takanori, Hirakawa, Tsubasa, Yamashita, Takayoshi, Fujiyoshi, Hironobu
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Mitsuhara, Masahiro Fukui, Hiroshi Sakashita, Yusuke Ogata, Takanori Hirakawa, Tsubasa Yamashita, Takayoshi Fujiyoshi, Hironobu
description	In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.
doi_str_mv	10.48550/arxiv.1905.03540
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1905_03540</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1905_03540</sourcerecordid><originalsourceid>FETCH-LOGICAL-a1150-7c7a34c4e47a3960801001111f2b8450aff747dd018fec35808abcb482adcfe23</originalsourceid><addsrcrecordid>eNotz7FOwzAUhWEvDKjwAEz4BRKuY5u4Y9UWiiiwwBzd2NeVReJExm3h7QktZ_m2I_2M3QgoldEa7jB9h0Mp5qBLkFrBJVut-5acC3HHN_seI3-Ow7EjtyMeYh74imjkr7RP2E3k45A--SEgX-RMMYch8hccr9iFx-6Lrv-dsY-H9ftyU2zfHp-Wi22BQmgoalujVFaRmpzfgwEBIKb5qjVKA3pfq9o5EMaTldqAwda2ylTorKdKztjt-ffU0Ywp9Jh-mr-e5tQjfwEJi0Sn</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Embedding Human Knowledge into Deep Neural Network via Attention Map</title><source>arXiv.org</source><creator>Mitsuhara, Masahiro ; Fukui, Hiroshi ; Sakashita, Yusuke ; Ogata, Takanori ; Hirakawa, Tsubasa ; Yamashita, Takayoshi ; Fujiyoshi, Hironobu</creator><creatorcontrib>Mitsuhara, Masahiro ; Fukui, Hiroshi ; Sakashita, Yusuke ; Ogata, Takanori ; Hirakawa, Tsubasa ; Yamashita, Takayoshi ; Fujiyoshi, Hironobu</creatorcontrib><description>In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.</description><identifier>DOI: 10.48550/arxiv.1905.03540</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2019-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a1150-7c7a34c4e47a3960801001111f2b8450aff747dd018fec35808abcb482adcfe23</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1905.03540$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1905.03540$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Mitsuhara, Masahiro</creatorcontrib><creatorcontrib>Fukui, Hiroshi</creatorcontrib><creatorcontrib>Sakashita, Yusuke</creatorcontrib><creatorcontrib>Ogata, Takanori</creatorcontrib><creatorcontrib>Hirakawa, Tsubasa</creatorcontrib><creatorcontrib>Yamashita, Takayoshi</creatorcontrib><creatorcontrib>Fujiyoshi, Hironobu</creatorcontrib><title>Embedding Human Knowledge into Deep Neural Network via Attention Map</title><description>In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz7FOwzAUhWEvDKjwAEz4BRKuY5u4Y9UWiiiwwBzd2NeVReJExm3h7QktZ_m2I_2M3QgoldEa7jB9h0Mp5qBLkFrBJVut-5acC3HHN_seI3-Ow7EjtyMeYh74imjkr7RP2E3k45A--SEgX-RMMYch8hccr9iFx-6Lrv-dsY-H9ftyU2zfHp-Wi22BQmgoalujVFaRmpzfgwEBIKb5qjVKA3pfq9o5EMaTldqAwda2ylTorKdKztjt-ffU0Ywp9Jh-mr-e5tQjfwEJi0Sn</recordid><startdate>20190509</startdate><enddate>20190509</enddate><creator>Mitsuhara, Masahiro</creator><creator>Fukui, Hiroshi</creator><creator>Sakashita, Yusuke</creator><creator>Ogata, Takanori</creator><creator>Hirakawa, Tsubasa</creator><creator>Yamashita, Takayoshi</creator><creator>Fujiyoshi, Hironobu</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190509</creationdate><title>Embedding Human Knowledge into Deep Neural Network via Attention Map</title><author>Mitsuhara, Masahiro ; Fukui, Hiroshi ; Sakashita, Yusuke ; Ogata, Takanori ; Hirakawa, Tsubasa ; Yamashita, Takayoshi ; Fujiyoshi, Hironobu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a1150-7c7a34c4e47a3960801001111f2b8450aff747dd018fec35808abcb482adcfe23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Mitsuhara, Masahiro</creatorcontrib><creatorcontrib>Fukui, Hiroshi</creatorcontrib><creatorcontrib>Sakashita, Yusuke</creatorcontrib><creatorcontrib>Ogata, Takanori</creatorcontrib><creatorcontrib>Hirakawa, Tsubasa</creatorcontrib><creatorcontrib>Yamashita, Takayoshi</creatorcontrib><creatorcontrib>Fujiyoshi, Hironobu</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Mitsuhara, Masahiro</au><au>Fukui, Hiroshi</au><au>Sakashita, Yusuke</au><au>Ogata, Takanori</au><au>Hirakawa, Tsubasa</au><au>Yamashita, Takayoshi</au><au>Fujiyoshi, Hironobu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Embedding Human Knowledge into Deep Neural Network via Attention Map</atitle><date>2019-05-09</date><risdate>2019</risdate><abstract>In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.</abstract><doi>10.48550/arxiv.1905.03540</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1905.03540
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1905_03540
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	Embedding Human Knowledge into Deep Neural Network via Attention Map
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T08%3A54%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Embedding%20Human%20Knowledge%20into%20Deep%20Neural%20Network%20via%20Attention%20Map&rft.au=Mitsuhara,%20Masahiro&rft.date=2019-05-09&rft_id=info:doi/10.48550/arxiv.1905.03540&rft_dat=%3Carxiv_GOX%3E1905_03540%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true