Automatic zone identification in scientific papers via fusion techniques

Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientometrics 2019-05, Vol.119 (2), p.845-862
Hauptverfasser: Asadi, Nasrin, Badie, Kambiz, Mahmoudi, Maryam Tayefeh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 862
container_issue 2
container_start_page 845
container_title Scientometrics
container_volume 119
creator Asadi, Nasrin
Badie, Kambiz
Mahmoudi, Maryam Tayefeh
description Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.
doi_str_mv 10.1007/s11192-019-03060-9
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2210554336</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2210554336</sourcerecordid><originalsourceid>FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</originalsourceid><addsrcrecordid>eNp9kEFLxDAUhIMouK7-AU8Fz9H3kqZNjsuirrDgRc-hSRPN4rY1aQX99Wat4M3Tg3nzzcAQcolwjQD1TUJExSigosChAqqOyAKFlJTJCo_JApBLqpDDKTlLaQcZ4iAXZLOaxn7fjMEWX33nitC6bgw-2Cz1XRG6ItnwKxVDM7iYio_QFH5Kh__o7GsX3ieXzsmJb96Su_i9S_J8d_u03tDt4_3DerWlltdspKUSRsraWOBMlgaEV5WpHThVVU4hY9agcq7lRjHfqto7obi3LQKK0ijJl-Rqzh1if-gd9a6fYpcrNWMIQpScV9nFZpeNfUrReT3EsG_ip0bQh8X0vJjOi-mfxbTKEJ-hlM3di4t_0f9Q34kwboA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2210554336</pqid></control><display><type>article</type><title>Automatic zone identification in scientific papers via fusion techniques</title><source>SpringerLink Journals - AutoHoldings</source><creator>Asadi, Nasrin ; Badie, Kambiz ; Mahmoudi, Maryam Tayefeh</creator><creatorcontrib>Asadi, Nasrin ; Badie, Kambiz ; Mahmoudi, Maryam Tayefeh</creatorcontrib><description>Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.</description><identifier>ISSN: 0138-9130</identifier><identifier>EISSN: 1588-2861</identifier><identifier>DOI: 10.1007/s11192-019-03060-9</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Algorithms ; Classification ; Computer Science ; Data mining ; Decision making ; Identification ; Information retrieval ; Information Storage and Retrieval ; Learning algorithms ; Library Science ; Logistics ; Machine learning ; Optimization ; Scientific papers ; Sentences</subject><ispartof>Scientometrics, 2019-05, Vol.119 (2), p.845-862</ispartof><rights>Akadémiai Kiadó, Budapest, Hungary 2019</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</citedby><cites>FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11192-019-03060-9$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11192-019-03060-9$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids></links><search><creatorcontrib>Asadi, Nasrin</creatorcontrib><creatorcontrib>Badie, Kambiz</creatorcontrib><creatorcontrib>Mahmoudi, Maryam Tayefeh</creatorcontrib><title>Automatic zone identification in scientific papers via fusion techniques</title><title>Scientometrics</title><addtitle>Scientometrics</addtitle><description>Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.</description><subject>Algorithms</subject><subject>Classification</subject><subject>Computer Science</subject><subject>Data mining</subject><subject>Decision making</subject><subject>Identification</subject><subject>Information retrieval</subject><subject>Information Storage and Retrieval</subject><subject>Learning algorithms</subject><subject>Library Science</subject><subject>Logistics</subject><subject>Machine learning</subject><subject>Optimization</subject><subject>Scientific papers</subject><subject>Sentences</subject><issn>0138-9130</issn><issn>1588-2861</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9kEFLxDAUhIMouK7-AU8Fz9H3kqZNjsuirrDgRc-hSRPN4rY1aQX99Wat4M3Tg3nzzcAQcolwjQD1TUJExSigosChAqqOyAKFlJTJCo_JApBLqpDDKTlLaQcZ4iAXZLOaxn7fjMEWX33nitC6bgw-2Cz1XRG6ItnwKxVDM7iYio_QFH5Kh__o7GsX3ieXzsmJb96Su_i9S_J8d_u03tDt4_3DerWlltdspKUSRsraWOBMlgaEV5WpHThVVU4hY9agcq7lRjHfqto7obi3LQKK0ijJl-Rqzh1if-gd9a6fYpcrNWMIQpScV9nFZpeNfUrReT3EsG_ip0bQh8X0vJjOi-mfxbTKEJ-hlM3di4t_0f9Q34kwboA</recordid><startdate>20190515</startdate><enddate>20190515</enddate><creator>Asadi, Nasrin</creator><creator>Badie, Kambiz</creator><creator>Mahmoudi, Maryam Tayefeh</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope></search><sort><creationdate>20190515</creationdate><title>Automatic zone identification in scientific papers via fusion techniques</title><author>Asadi, Nasrin ; Badie, Kambiz ; Mahmoudi, Maryam Tayefeh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Classification</topic><topic>Computer Science</topic><topic>Data mining</topic><topic>Decision making</topic><topic>Identification</topic><topic>Information retrieval</topic><topic>Information Storage and Retrieval</topic><topic>Learning algorithms</topic><topic>Library Science</topic><topic>Logistics</topic><topic>Machine learning</topic><topic>Optimization</topic><topic>Scientific papers</topic><topic>Sentences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Asadi, Nasrin</creatorcontrib><creatorcontrib>Badie, Kambiz</creatorcontrib><creatorcontrib>Mahmoudi, Maryam Tayefeh</creatorcontrib><collection>CrossRef</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><jtitle>Scientometrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Asadi, Nasrin</au><au>Badie, Kambiz</au><au>Mahmoudi, Maryam Tayefeh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic zone identification in scientific papers via fusion techniques</atitle><jtitle>Scientometrics</jtitle><stitle>Scientometrics</stitle><date>2019-05-15</date><risdate>2019</risdate><volume>119</volume><issue>2</issue><spage>845</spage><epage>862</epage><pages>845-862</pages><issn>0138-9130</issn><eissn>1588-2861</eissn><abstract>Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><doi>10.1007/s11192-019-03060-9</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0138-9130
ispartof Scientometrics, 2019-05, Vol.119 (2), p.845-862
issn 0138-9130
1588-2861
language eng
recordid cdi_proquest_journals_2210554336
source SpringerLink Journals - AutoHoldings
subjects Algorithms
Classification
Computer Science
Data mining
Decision making
Identification
Information retrieval
Information Storage and Retrieval
Learning algorithms
Library Science
Logistics
Machine learning
Optimization
Scientific papers
Sentences
title Automatic zone identification in scientific papers via fusion techniques
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T03%3A12%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20zone%20identification%20in%20scientific%20papers%20via%20fusion%20techniques&rft.jtitle=Scientometrics&rft.au=Asadi,%20Nasrin&rft.date=2019-05-15&rft.volume=119&rft.issue=2&rft.spage=845&rft.epage=862&rft.pages=845-862&rft.issn=0138-9130&rft.eissn=1588-2861&rft_id=info:doi/10.1007/s11192-019-03060-9&rft_dat=%3Cproquest_cross%3E2210554336%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2210554336&rft_id=info:pmid/&rfr_iscdi=true