Embedding Human Knowledge into Deep Neural Network via Attention Map

In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model paramete...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mitsuhara, Masahiro, Fukui, Hiroshi, Sakashita, Yusuke, Ogata, Takanori, Hirakawa, Tsubasa, Yamashita, Takayoshi, Fujiyoshi, Hironobu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Mitsuhara, Masahiro
Fukui, Hiroshi
Sakashita, Yusuke
Ogata, Takanori
Hirakawa, Tsubasa
Yamashita, Takayoshi
Fujiyoshi, Hironobu
description In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.
doi_str_mv 10.48550/arxiv.1905.03540
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1905_03540</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1905_03540</sourcerecordid><originalsourceid>FETCH-LOGICAL-a1150-7c7a34c4e47a3960801001111f2b8450aff747dd018fec35808abcb482adcfe23</originalsourceid><addsrcrecordid>eNotz7FOwzAUhWEvDKjwAEz4BRKuY5u4Y9UWiiiwwBzd2NeVReJExm3h7QktZ_m2I_2M3QgoldEa7jB9h0Mp5qBLkFrBJVut-5acC3HHN_seI3-Ow7EjtyMeYh74imjkr7RP2E3k45A--SEgX-RMMYch8hccr9iFx-6Lrv-dsY-H9ftyU2zfHp-Wi22BQmgoalujVFaRmpzfgwEBIKb5qjVKA3pfq9o5EMaTldqAwda2ylTorKdKztjt-ffU0Ywp9Jh-mr-e5tQjfwEJi0Sn</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Embedding Human Knowledge into Deep Neural Network via Attention Map</title><source>arXiv.org</source><creator>Mitsuhara, Masahiro ; Fukui, Hiroshi ; Sakashita, Yusuke ; Ogata, Takanori ; Hirakawa, Tsubasa ; Yamashita, Takayoshi ; Fujiyoshi, Hironobu</creator><creatorcontrib>Mitsuhara, Masahiro ; Fukui, Hiroshi ; Sakashita, Yusuke ; Ogata, Takanori ; Hirakawa, Tsubasa ; Yamashita, Takayoshi ; Fujiyoshi, Hironobu</creatorcontrib><description>In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.</description><identifier>DOI: 10.48550/arxiv.1905.03540</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2019-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a1150-7c7a34c4e47a3960801001111f2b8450aff747dd018fec35808abcb482adcfe23</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1905.03540$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1905.03540$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Mitsuhara, Masahiro</creatorcontrib><creatorcontrib>Fukui, Hiroshi</creatorcontrib><creatorcontrib>Sakashita, Yusuke</creatorcontrib><creatorcontrib>Ogata, Takanori</creatorcontrib><creatorcontrib>Hirakawa, Tsubasa</creatorcontrib><creatorcontrib>Yamashita, Takayoshi</creatorcontrib><creatorcontrib>Fujiyoshi, Hironobu</creatorcontrib><title>Embedding Human Knowledge into Deep Neural Network via Attention Map</title><description>In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz7FOwzAUhWEvDKjwAEz4BRKuY5u4Y9UWiiiwwBzd2NeVReJExm3h7QktZ_m2I_2M3QgoldEa7jB9h0Mp5qBLkFrBJVut-5acC3HHN_seI3-Ow7EjtyMeYh74imjkr7RP2E3k45A--SEgX-RMMYch8hccr9iFx-6Lrv-dsY-H9ftyU2zfHp-Wi22BQmgoalujVFaRmpzfgwEBIKb5qjVKA3pfq9o5EMaTldqAwda2ylTorKdKztjt-ffU0Ywp9Jh-mr-e5tQjfwEJi0Sn</recordid><startdate>20190509</startdate><enddate>20190509</enddate><creator>Mitsuhara, Masahiro</creator><creator>Fukui, Hiroshi</creator><creator>Sakashita, Yusuke</creator><creator>Ogata, Takanori</creator><creator>Hirakawa, Tsubasa</creator><creator>Yamashita, Takayoshi</creator><creator>Fujiyoshi, Hironobu</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190509</creationdate><title>Embedding Human Knowledge into Deep Neural Network via Attention Map</title><author>Mitsuhara, Masahiro ; Fukui, Hiroshi ; Sakashita, Yusuke ; Ogata, Takanori ; Hirakawa, Tsubasa ; Yamashita, Takayoshi ; Fujiyoshi, Hironobu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a1150-7c7a34c4e47a3960801001111f2b8450aff747dd018fec35808abcb482adcfe23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Mitsuhara, Masahiro</creatorcontrib><creatorcontrib>Fukui, Hiroshi</creatorcontrib><creatorcontrib>Sakashita, Yusuke</creatorcontrib><creatorcontrib>Ogata, Takanori</creatorcontrib><creatorcontrib>Hirakawa, Tsubasa</creatorcontrib><creatorcontrib>Yamashita, Takayoshi</creatorcontrib><creatorcontrib>Fujiyoshi, Hironobu</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Mitsuhara, Masahiro</au><au>Fukui, Hiroshi</au><au>Sakashita, Yusuke</au><au>Ogata, Takanori</au><au>Hirakawa, Tsubasa</au><au>Yamashita, Takayoshi</au><au>Fujiyoshi, Hironobu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Embedding Human Knowledge into Deep Neural Network via Attention Map</atitle><date>2019-05-09</date><risdate>2019</risdate><abstract>In this work, we aim to realize a method for embedding human knowledge into deep neural networks. While the conventional method to embed human knowledge has been applied for non-deep machine learning, it is challenging to apply it for deep learning models due to the enormous number of model parameters. To tackle this problem, we focus on the attention mechanism of an attention branch network (ABN). In this paper, we propose a fine-tuning method that utilizes a single-channel attention map which is manually edited by a human expert. Our fine-tuning method can train a network so that the output attention map corresponds to the edited ones. As a result, the fine-tuned network can output an attention map that takes into account human knowledge. Experimental results with ImageNet, CUB-200-2010, and IDRiD demonstrate that it is possible to obtain a clear attention map for a visual explanation and improve the classification performance. Our findings can be a novel framework for optimizing networks through human intuitive editing via a visual interface and suggest new possibilities for human-machine cooperation in addition to the improvement of visual explanations.</abstract><doi>10.48550/arxiv.1905.03540</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1905.03540
ispartof
issn
language eng
recordid cdi_arxiv_primary_1905_03540
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title Embedding Human Knowledge into Deep Neural Network via Attention Map
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T08%3A54%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Embedding%20Human%20Knowledge%20into%20Deep%20Neural%20Network%20via%20Attention%20Map&rft.au=Mitsuhara,%20Masahiro&rft.date=2019-05-09&rft_id=info:doi/10.48550/arxiv.1905.03540&rft_dat=%3Carxiv_GOX%3E1905_03540%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true