Automatic zone identification in scientific papers via fusion techniques
Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful...
Gespeichert in:
Veröffentlicht in: | Scientometrics 2019-05, Vol.119 (2), p.845-862 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 862 |
---|---|
container_issue | 2 |
container_start_page | 845 |
container_title | Scientometrics |
container_volume | 119 |
creator | Asadi, Nasrin Badie, Kambiz Mahmoudi, Maryam Tayefeh |
description | Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches. |
doi_str_mv | 10.1007/s11192-019-03060-9 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2210554336</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2210554336</sourcerecordid><originalsourceid>FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</originalsourceid><addsrcrecordid>eNp9kEFLxDAUhIMouK7-AU8Fz9H3kqZNjsuirrDgRc-hSRPN4rY1aQX99Wat4M3Tg3nzzcAQcolwjQD1TUJExSigosChAqqOyAKFlJTJCo_JApBLqpDDKTlLaQcZ4iAXZLOaxn7fjMEWX33nitC6bgw-2Cz1XRG6ItnwKxVDM7iYio_QFH5Kh__o7GsX3ieXzsmJb96Su_i9S_J8d_u03tDt4_3DerWlltdspKUSRsraWOBMlgaEV5WpHThVVU4hY9agcq7lRjHfqto7obi3LQKK0ijJl-Rqzh1if-gd9a6fYpcrNWMIQpScV9nFZpeNfUrReT3EsG_ip0bQh8X0vJjOi-mfxbTKEJ-hlM3di4t_0f9Q34kwboA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2210554336</pqid></control><display><type>article</type><title>Automatic zone identification in scientific papers via fusion techniques</title><source>SpringerLink Journals - AutoHoldings</source><creator>Asadi, Nasrin ; Badie, Kambiz ; Mahmoudi, Maryam Tayefeh</creator><creatorcontrib>Asadi, Nasrin ; Badie, Kambiz ; Mahmoudi, Maryam Tayefeh</creatorcontrib><description>Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.</description><identifier>ISSN: 0138-9130</identifier><identifier>EISSN: 1588-2861</identifier><identifier>DOI: 10.1007/s11192-019-03060-9</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Algorithms ; Classification ; Computer Science ; Data mining ; Decision making ; Identification ; Information retrieval ; Information Storage and Retrieval ; Learning algorithms ; Library Science ; Logistics ; Machine learning ; Optimization ; Scientific papers ; Sentences</subject><ispartof>Scientometrics, 2019-05, Vol.119 (2), p.845-862</ispartof><rights>Akadémiai Kiadó, Budapest, Hungary 2019</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</citedby><cites>FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11192-019-03060-9$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11192-019-03060-9$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids></links><search><creatorcontrib>Asadi, Nasrin</creatorcontrib><creatorcontrib>Badie, Kambiz</creatorcontrib><creatorcontrib>Mahmoudi, Maryam Tayefeh</creatorcontrib><title>Automatic zone identification in scientific papers via fusion techniques</title><title>Scientometrics</title><addtitle>Scientometrics</addtitle><description>Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.</description><subject>Algorithms</subject><subject>Classification</subject><subject>Computer Science</subject><subject>Data mining</subject><subject>Decision making</subject><subject>Identification</subject><subject>Information retrieval</subject><subject>Information Storage and Retrieval</subject><subject>Learning algorithms</subject><subject>Library Science</subject><subject>Logistics</subject><subject>Machine learning</subject><subject>Optimization</subject><subject>Scientific papers</subject><subject>Sentences</subject><issn>0138-9130</issn><issn>1588-2861</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9kEFLxDAUhIMouK7-AU8Fz9H3kqZNjsuirrDgRc-hSRPN4rY1aQX99Wat4M3Tg3nzzcAQcolwjQD1TUJExSigosChAqqOyAKFlJTJCo_JApBLqpDDKTlLaQcZ4iAXZLOaxn7fjMEWX33nitC6bgw-2Cz1XRG6ItnwKxVDM7iYio_QFH5Kh__o7GsX3ieXzsmJb96Su_i9S_J8d_u03tDt4_3DerWlltdspKUSRsraWOBMlgaEV5WpHThVVU4hY9agcq7lRjHfqto7obi3LQKK0ijJl-Rqzh1if-gd9a6fYpcrNWMIQpScV9nFZpeNfUrReT3EsG_ip0bQh8X0vJjOi-mfxbTKEJ-hlM3di4t_0f9Q34kwboA</recordid><startdate>20190515</startdate><enddate>20190515</enddate><creator>Asadi, Nasrin</creator><creator>Badie, Kambiz</creator><creator>Mahmoudi, Maryam Tayefeh</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope></search><sort><creationdate>20190515</creationdate><title>Automatic zone identification in scientific papers via fusion techniques</title><author>Asadi, Nasrin ; Badie, Kambiz ; Mahmoudi, Maryam Tayefeh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c372t-495b887bc03284b05f96b7e0e966e9122cb19eed3b92fd97fe593fcd10154b983</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Classification</topic><topic>Computer Science</topic><topic>Data mining</topic><topic>Decision making</topic><topic>Identification</topic><topic>Information retrieval</topic><topic>Information Storage and Retrieval</topic><topic>Learning algorithms</topic><topic>Library Science</topic><topic>Logistics</topic><topic>Machine learning</topic><topic>Optimization</topic><topic>Scientific papers</topic><topic>Sentences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Asadi, Nasrin</creatorcontrib><creatorcontrib>Badie, Kambiz</creatorcontrib><creatorcontrib>Mahmoudi, Maryam Tayefeh</creatorcontrib><collection>CrossRef</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><jtitle>Scientometrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Asadi, Nasrin</au><au>Badie, Kambiz</au><au>Mahmoudi, Maryam Tayefeh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic zone identification in scientific papers via fusion techniques</atitle><jtitle>Scientometrics</jtitle><stitle>Scientometrics</stitle><date>2019-05-15</date><risdate>2019</risdate><volume>119</volume><issue>2</issue><spage>845</spage><epage>862</epage><pages>845-862</pages><issn>0138-9130</issn><eissn>1588-2861</eissn><abstract>Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><doi>10.1007/s11192-019-03060-9</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0138-9130 |
ispartof | Scientometrics, 2019-05, Vol.119 (2), p.845-862 |
issn | 0138-9130 1588-2861 |
language | eng |
recordid | cdi_proquest_journals_2210554336 |
source | SpringerLink Journals - AutoHoldings |
subjects | Algorithms Classification Computer Science Data mining Decision making Identification Information retrieval Information Storage and Retrieval Learning algorithms Library Science Logistics Machine learning Optimization Scientific papers Sentences |
title | Automatic zone identification in scientific papers via fusion techniques |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T03%3A12%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20zone%20identification%20in%20scientific%20papers%20via%20fusion%20techniques&rft.jtitle=Scientometrics&rft.au=Asadi,%20Nasrin&rft.date=2019-05-15&rft.volume=119&rft.issue=2&rft.spage=845&rft.epage=862&rft.pages=845-862&rft.issn=0138-9130&rft.eissn=1588-2861&rft_id=info:doi/10.1007/s11192-019-03060-9&rft_dat=%3Cproquest_cross%3E2210554336%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2210554336&rft_id=info:pmid/&rfr_iscdi=true |