Text-independent writer identification using convolutional neural network

•A novel method for text-independent writer identification.•Organization of training samples for Convolutional Neural Network.•Feature aggregation to form global features from local features.•99.97% accuracy to classify 100 writers by 200 characters for handwritten Japanese.•91.81% accuracy to class...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Pattern recognition letters 2019-04, Vol.121, p.104-112
Hauptverfasser:	Nguyen, Hung Tuan, Nguyen, Cuong Tuan, Ino, Takeya, Indurkhya, Bipin, Nakagawa, Masaki
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Algorithms Artificial neural networks Classification Clustering Feature extraction Handwriting Identification Machine learning Neural networks Teaching methods Training Writers
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	112
container_issue
container_start_page	104
container_title	Pattern recognition letters
container_volume	121
creator	Nguyen, Hung Tuan Nguyen, Cuong Tuan Ino, Takeya Indurkhya, Bipin Nakagawa, Masaki
description	•A novel method for text-independent writer identification.•Organization of training samples for Convolutional Neural Network.•Feature aggregation to form global features from local features.•99.97% accuracy to classify 100 writers by 200 characters for handwritten Japanese.•91.81% accuracy to classify 900 writers by one text page for handwritten English. The text-independent approach to writer identification does not require the writer to write some predetermined text. Previous research on text-independent writer identification has been based on identifying writer-specific features designed by experts. However, in the last decade, deep learning methods have been successfully applied to learn features from data automatically. We propose here an end-to-end deep-learning method for text-independent writer identification that does not require prior identification of features. A Convolutional Neural Network (CNN) is trained initially to extract local features, which represent characteristics of individual handwriting in the whole character images and their sub-regions. Randomly sampled tuples of images from the training set are used to train the CNN and aggregate the extracted local features of images from the tuples to form global features. For every training epoch, the process of randomly sampling tuples is repeated, which is equivalent to a large number of training patterns being prepared for training the CNN for text-independent writer identification. We conducted experiments on the JEITA-HP database of offline handwritten Japanese character patterns. With 200 characters, our method achieved an accuracy of 99.97% to classify 100 writers. Even when using 50 characters for 100 writers or 100 characters for 400 writers, our method achieved accuracy levels of 92.80% or 93.82%, respectively. We conducted further experiments on the Firemaker and IAM databases of offline handwritten English text. Using only one page per writer to train, our method achieved over 91.81% accuracy to classify 900 writers. Overall, we achieved a better performance than the previously published best result based on handcrafted features and clustering algorithms, which demonstrates the effectiveness of our method for handwritten English text also.
doi_str_mv	10.1016/j.patrec.2018.07.022
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2196510007</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167865518303180</els_id><sourcerecordid>2196510007</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-ffe75b69cf7998a4157510b8e8371a5160c2ecb5331d31e32223e7628531bb6f3</originalsourceid><addsrcrecordid>eNp9kMtOwzAQRS0EEqXwBywisU7wI35kg4QqHpUqsSlrK3EmyKHEwXZa-HtcwprNXM3ozNXMReia4IJgIm77YqyjB1NQTFSBZYEpPUELoiTNJSvLU7RImMyV4PwcXYTQY4wFq9QCrbfwFXM7tDBCKkPMDt5G8Jk9Nrazpo7WDdkU7PCWGTfs3W46TupdNsDkfyUenH-_RGddvQtw9adL9Pr4sF0955uXp_XqfpMbxsqYdx1I3ojKdLKqVF0SLjnBjQLFJKk5EdhQMA1njLSMAKOUMpCCKs5I04iOLdHN7Dt69zlBiLp3k0_3BE1JJZIZxjJR5UwZ70Lw0OnR24_af2uC9TE03es5NH0MTWOpU2hp7W5eg_TB3oLXwVgYDLQ2oVG3zv5v8ANvIXe8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2196510007</pqid></control><display><type>article</type><title>Text-independent writer identification using convolutional neural network</title><source>Elsevier ScienceDirect Journals</source><creator>Nguyen, Hung Tuan ; Nguyen, Cuong Tuan ; Ino, Takeya ; Indurkhya, Bipin ; Nakagawa, Masaki</creator><creatorcontrib>Nguyen, Hung Tuan ; Nguyen, Cuong Tuan ; Ino, Takeya ; Indurkhya, Bipin ; Nakagawa, Masaki</creatorcontrib><description>•A novel method for text-independent writer identification.•Organization of training samples for Convolutional Neural Network.•Feature aggregation to form global features from local features.•99.97% accuracy to classify 100 writers by 200 characters for handwritten Japanese.•91.81% accuracy to classify 900 writers by one text page for handwritten English. The text-independent approach to writer identification does not require the writer to write some predetermined text. Previous research on text-independent writer identification has been based on identifying writer-specific features designed by experts. However, in the last decade, deep learning methods have been successfully applied to learn features from data automatically. We propose here an end-to-end deep-learning method for text-independent writer identification that does not require prior identification of features. A Convolutional Neural Network (CNN) is trained initially to extract local features, which represent characteristics of individual handwriting in the whole character images and their sub-regions. Randomly sampled tuples of images from the training set are used to train the CNN and aggregate the extracted local features of images from the tuples to form global features. For every training epoch, the process of randomly sampling tuples is repeated, which is equivalent to a large number of training patterns being prepared for training the CNN for text-independent writer identification. We conducted experiments on the JEITA-HP database of offline handwritten Japanese character patterns. With 200 characters, our method achieved an accuracy of 99.97% to classify 100 writers. Even when using 50 characters for 100 writers or 100 characters for 400 writers, our method achieved accuracy levels of 92.80% or 93.82%, respectively. We conducted further experiments on the Firemaker and IAM databases of offline handwritten English text. Using only one page per writer to train, our method achieved over 91.81% accuracy to classify 900 writers. Overall, we achieved a better performance than the previously published best result based on handcrafted features and clustering algorithms, which demonstrates the effectiveness of our method for handwritten English text also.</description><identifier>ISSN: 0167-8655</identifier><identifier>EISSN: 1872-7344</identifier><identifier>DOI: 10.1016/j.patrec.2018.07.022</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Accuracy ; Algorithms ; Artificial neural networks ; Classification ; Clustering ; Feature extraction ; Handwriting ; Identification ; Machine learning ; Neural networks ; Teaching methods ; Training ; Writers</subject><ispartof>Pattern recognition letters, 2019-04, Vol.121, p.104-112</ispartof><rights>2018 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Apr 15, 2019</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c334t-ffe75b69cf7998a4157510b8e8371a5160c2ecb5331d31e32223e7628531bb6f3</citedby><cites>FETCH-LOGICAL-c334t-ffe75b69cf7998a4157510b8e8371a5160c2ecb5331d31e32223e7628531bb6f3</cites><orcidid>0000-0003-4751-1302 ; 0000-0001-7872-156X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0167865518303180$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Nguyen, Hung Tuan</creatorcontrib><creatorcontrib>Nguyen, Cuong Tuan</creatorcontrib><creatorcontrib>Ino, Takeya</creatorcontrib><creatorcontrib>Indurkhya, Bipin</creatorcontrib><creatorcontrib>Nakagawa, Masaki</creatorcontrib><title>Text-independent writer identification using convolutional neural network</title><title>Pattern recognition letters</title><description>•A novel method for text-independent writer identification.•Organization of training samples for Convolutional Neural Network.•Feature aggregation to form global features from local features.•99.97% accuracy to classify 100 writers by 200 characters for handwritten Japanese.•91.81% accuracy to classify 900 writers by one text page for handwritten English. The text-independent approach to writer identification does not require the writer to write some predetermined text. Previous research on text-independent writer identification has been based on identifying writer-specific features designed by experts. However, in the last decade, deep learning methods have been successfully applied to learn features from data automatically. We propose here an end-to-end deep-learning method for text-independent writer identification that does not require prior identification of features. A Convolutional Neural Network (CNN) is trained initially to extract local features, which represent characteristics of individual handwriting in the whole character images and their sub-regions. Randomly sampled tuples of images from the training set are used to train the CNN and aggregate the extracted local features of images from the tuples to form global features. For every training epoch, the process of randomly sampling tuples is repeated, which is equivalent to a large number of training patterns being prepared for training the CNN for text-independent writer identification. We conducted experiments on the JEITA-HP database of offline handwritten Japanese character patterns. With 200 characters, our method achieved an accuracy of 99.97% to classify 100 writers. Even when using 50 characters for 100 writers or 100 characters for 400 writers, our method achieved accuracy levels of 92.80% or 93.82%, respectively. We conducted further experiments on the Firemaker and IAM databases of offline handwritten English text. Using only one page per writer to train, our method achieved over 91.81% accuracy to classify 900 writers. Overall, we achieved a better performance than the previously published best result based on handcrafted features and clustering algorithms, which demonstrates the effectiveness of our method for handwritten English text also.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>Classification</subject><subject>Clustering</subject><subject>Feature extraction</subject><subject>Handwriting</subject><subject>Identification</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Teaching methods</subject><subject>Training</subject><subject>Writers</subject><issn>0167-8655</issn><issn>1872-7344</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9kMtOwzAQRS0EEqXwBywisU7wI35kg4QqHpUqsSlrK3EmyKHEwXZa-HtcwprNXM3ozNXMReia4IJgIm77YqyjB1NQTFSBZYEpPUELoiTNJSvLU7RImMyV4PwcXYTQY4wFq9QCrbfwFXM7tDBCKkPMDt5G8Jk9Nrazpo7WDdkU7PCWGTfs3W46TupdNsDkfyUenH-_RGddvQtw9adL9Pr4sF0955uXp_XqfpMbxsqYdx1I3ojKdLKqVF0SLjnBjQLFJKk5EdhQMA1njLSMAKOUMpCCKs5I04iOLdHN7Dt69zlBiLp3k0_3BE1JJZIZxjJR5UwZ70Lw0OnR24_af2uC9TE03es5NH0MTWOpU2hp7W5eg_TB3oLXwVgYDLQ2oVG3zv5v8ANvIXe8</recordid><startdate>20190415</startdate><enddate>20190415</enddate><creator>Nguyen, Hung Tuan</creator><creator>Nguyen, Cuong Tuan</creator><creator>Ino, Takeya</creator><creator>Indurkhya, Bipin</creator><creator>Nakagawa, Masaki</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7TK</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-4751-1302</orcidid><orcidid>https://orcid.org/0000-0001-7872-156X</orcidid></search><sort><creationdate>20190415</creationdate><title>Text-independent writer identification using convolutional neural network</title><author>Nguyen, Hung Tuan ; Nguyen, Cuong Tuan ; Ino, Takeya ; Indurkhya, Bipin ; Nakagawa, Masaki</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-ffe75b69cf7998a4157510b8e8371a5160c2ecb5331d31e32223e7628531bb6f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>Classification</topic><topic>Clustering</topic><topic>Feature extraction</topic><topic>Handwriting</topic><topic>Identification</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Teaching methods</topic><topic>Training</topic><topic>Writers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nguyen, Hung Tuan</creatorcontrib><creatorcontrib>Nguyen, Cuong Tuan</creatorcontrib><creatorcontrib>Ino, Takeya</creatorcontrib><creatorcontrib>Indurkhya, Bipin</creatorcontrib><creatorcontrib>Nakagawa, Masaki</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Pattern recognition letters</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nguyen, Hung Tuan</au><au>Nguyen, Cuong Tuan</au><au>Ino, Takeya</au><au>Indurkhya, Bipin</au><au>Nakagawa, Masaki</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Text-independent writer identification using convolutional neural network</atitle><jtitle>Pattern recognition letters</jtitle><date>2019-04-15</date><risdate>2019</risdate><volume>121</volume><spage>104</spage><epage>112</epage><pages>104-112</pages><issn>0167-8655</issn><eissn>1872-7344</eissn><abstract>•A novel method for text-independent writer identification.•Organization of training samples for Convolutional Neural Network.•Feature aggregation to form global features from local features.•99.97% accuracy to classify 100 writers by 200 characters for handwritten Japanese.•91.81% accuracy to classify 900 writers by one text page for handwritten English. The text-independent approach to writer identification does not require the writer to write some predetermined text. Previous research on text-independent writer identification has been based on identifying writer-specific features designed by experts. However, in the last decade, deep learning methods have been successfully applied to learn features from data automatically. We propose here an end-to-end deep-learning method for text-independent writer identification that does not require prior identification of features. A Convolutional Neural Network (CNN) is trained initially to extract local features, which represent characteristics of individual handwriting in the whole character images and their sub-regions. Randomly sampled tuples of images from the training set are used to train the CNN and aggregate the extracted local features of images from the tuples to form global features. For every training epoch, the process of randomly sampling tuples is repeated, which is equivalent to a large number of training patterns being prepared for training the CNN for text-independent writer identification. We conducted experiments on the JEITA-HP database of offline handwritten Japanese character patterns. With 200 characters, our method achieved an accuracy of 99.97% to classify 100 writers. Even when using 50 characters for 100 writers or 100 characters for 400 writers, our method achieved accuracy levels of 92.80% or 93.82%, respectively. We conducted further experiments on the Firemaker and IAM databases of offline handwritten English text. Using only one page per writer to train, our method achieved over 91.81% accuracy to classify 900 writers. Overall, we achieved a better performance than the previously published best result based on handcrafted features and clustering algorithms, which demonstrates the effectiveness of our method for handwritten English text also.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.patrec.2018.07.022</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0003-4751-1302</orcidid><orcidid>https://orcid.org/0000-0001-7872-156X</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0167-8655
ispartof	Pattern recognition letters, 2019-04, Vol.121, p.104-112
issn	0167-8655 1872-7344
language	eng
recordid	cdi_proquest_journals_2196510007
source	Elsevier ScienceDirect Journals
subjects	Accuracy Algorithms Artificial neural networks Classification Clustering Feature extraction Handwriting Identification Machine learning Neural networks Teaching methods Training Writers
title	Text-independent writer identification using convolutional neural network
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T03%3A20%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Text-independent%20writer%20identification%20using%20convolutional%20neural%20network&rft.jtitle=Pattern%20recognition%20letters&rft.au=Nguyen,%20Hung%20Tuan&rft.date=2019-04-15&rft.volume=121&rft.spage=104&rft.epage=112&rft.pages=104-112&rft.issn=0167-8655&rft.eissn=1872-7344&rft_id=info:doi/10.1016/j.patrec.2018.07.022&rft_dat=%3Cproquest_cross%3E2196510007%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2196510007&rft_id=info:pmid/&rft_els_id=S0167865518303180&rfr_iscdi=true