Probabilistic Approach Processing Scheme Based on BLAST for Improving Search Speed of Bioinformatics

As researchers on bioinformatics using heuristic algorithms have been increasingly studied, information management used in various bioinformatics fields (new drug development, medical diagnosis, agricultural product improvement, etc.) has been studied mainly on BLAST algorithm. However, many of the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Wireless personal communications 2019-03, Vol.105 (2), p.405-426
Hauptverfasser: Jeong, Yoon-Su, Shin, Seung-Soo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 426
container_issue 2
container_start_page 405
container_title Wireless personal communications
container_volume 105
creator Jeong, Yoon-Su
Shin, Seung-Soo
description As researchers on bioinformatics using heuristic algorithms have been increasingly studied, information management used in various bioinformatics fields (new drug development, medical diagnosis, agricultural product improvement, etc.) has been studied mainly on BLAST algorithm. However, many of the algorithms that are being used in the large genome database use a complete sorting procedure, which takes a lot of time to search the database for proteins or nucleic acid sequences, which causes many problems in processing large amounts of bio information. We propose a BLAST-based probabilistic access processing method that can manage, analyze and process a large amount of bio data distributed based on information communication infrastructure and IT technology. The proposed method aims to improve the accessibility of data by linking weighted bioinformatics information with probability factors to easily access large capacity bio data. In addition, the proposed scheme classifies the priority information allocated to the bioinformatics information by hierarchical grouping according to the degree of similarity, thereby ensuring high accuracy of the search results of the bioinformatics information, and at the same time, the goal is to obtain low processing time by classifying information (type, attribute, priority, etc.) into weights by property. Previous researchers have suggested clustering algorithms for fragmentation of genetic information to solve the problem of haplotype assembly in genetics, or proposed particle swarm optimization methods similar to existing genetic algorithms using heuristic clustering method based on MEC model. In the performance evaluation, the proposed method improved the accuracy by average 13.5% and the efficiency of the data retrieval by average 19.7% more than previous scheme. The overhead of Bioinformatics information processing was 8.8% lower and the processing time was average 13.5% lower.
doi_str_mv 10.1007/s11277-018-5955-3
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2188236800</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2188236800</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-80bd6bbfeeaab078b01ab0ec75ee1e4d6788a0303953f015870831f1a883efdb3</originalsourceid><addsrcrecordid>eNp1kE1LAzEQhoMoWKs_wFvAc3SSdDfZY1v8KBQUWsFbSHYn7Zbuh0kV_PemruDJ08DwPO8MLyHXHG45gLqLnAulGHDNsiLLmDwhI54pwbScvJ2SERSiYLng4pxcxLgDSFYhRqR6CZ2zrt7X8VCXdNr3obPllqZ1iTHW7Yauyi02SGc2YkW7ls6W09Wa-i7QRZPozx8GbUjWqscj4-ms7uo2IY1NqfGSnHm7j3j1O8fk9eF-PX9iy-fHxXy6ZKXk-YFpcFXunEe01oHSDniaWKoMkeOkypXWFiTIIpMeeKYVaMk9t1pL9JWTY3Iz5Ka33j8wHsyu-whtOmkE11rIXAMkig9UGboYA3rTh7qx4ctwMMcyzVCmSWWaY5lGJkcMTkxsu8Hwl_y_9A1POXeT</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2188236800</pqid></control><display><type>article</type><title>Probabilistic Approach Processing Scheme Based on BLAST for Improving Search Speed of Bioinformatics</title><source>Springer Nature - Complete Springer Journals</source><creator>Jeong, Yoon-Su ; Shin, Seung-Soo</creator><creatorcontrib>Jeong, Yoon-Su ; Shin, Seung-Soo</creatorcontrib><description>As researchers on bioinformatics using heuristic algorithms have been increasingly studied, information management used in various bioinformatics fields (new drug development, medical diagnosis, agricultural product improvement, etc.) has been studied mainly on BLAST algorithm. However, many of the algorithms that are being used in the large genome database use a complete sorting procedure, which takes a lot of time to search the database for proteins or nucleic acid sequences, which causes many problems in processing large amounts of bio information. We propose a BLAST-based probabilistic access processing method that can manage, analyze and process a large amount of bio data distributed based on information communication infrastructure and IT technology. The proposed method aims to improve the accessibility of data by linking weighted bioinformatics information with probability factors to easily access large capacity bio data. In addition, the proposed scheme classifies the priority information allocated to the bioinformatics information by hierarchical grouping according to the degree of similarity, thereby ensuring high accuracy of the search results of the bioinformatics information, and at the same time, the goal is to obtain low processing time by classifying information (type, attribute, priority, etc.) into weights by property. Previous researchers have suggested clustering algorithms for fragmentation of genetic information to solve the problem of haplotype assembly in genetics, or proposed particle swarm optimization methods similar to existing genetic algorithms using heuristic clustering method based on MEC model. In the performance evaluation, the proposed method improved the accuracy by average 13.5% and the efficiency of the data retrieval by average 19.7% more than previous scheme. The overhead of Bioinformatics information processing was 8.8% lower and the processing time was average 13.5% lower.</description><identifier>ISSN: 0929-6212</identifier><identifier>EISSN: 1572-834X</identifier><identifier>DOI: 10.1007/s11277-018-5955-3</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Agricultural management ; Algorithms ; Bioinformatics ; Clustering ; Communications Engineering ; Computer Communication Networks ; Data processing ; Data retrieval ; Engineering ; Genetic algorithms ; Heuristic methods ; Information management ; Networks ; Particle swarm optimization ; Performance evaluation ; Probabilistic methods ; Proteins ; Researchers ; Searching ; Signal,Image and Speech Processing ; Statistical analysis</subject><ispartof>Wireless personal communications, 2019-03, Vol.105 (2), p.405-426</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2018</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-80bd6bbfeeaab078b01ab0ec75ee1e4d6788a0303953f015870831f1a883efdb3</citedby><cites>FETCH-LOGICAL-c316t-80bd6bbfeeaab078b01ab0ec75ee1e4d6788a0303953f015870831f1a883efdb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11277-018-5955-3$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11277-018-5955-3$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Jeong, Yoon-Su</creatorcontrib><creatorcontrib>Shin, Seung-Soo</creatorcontrib><title>Probabilistic Approach Processing Scheme Based on BLAST for Improving Search Speed of Bioinformatics</title><title>Wireless personal communications</title><addtitle>Wireless Pers Commun</addtitle><description>As researchers on bioinformatics using heuristic algorithms have been increasingly studied, information management used in various bioinformatics fields (new drug development, medical diagnosis, agricultural product improvement, etc.) has been studied mainly on BLAST algorithm. However, many of the algorithms that are being used in the large genome database use a complete sorting procedure, which takes a lot of time to search the database for proteins or nucleic acid sequences, which causes many problems in processing large amounts of bio information. We propose a BLAST-based probabilistic access processing method that can manage, analyze and process a large amount of bio data distributed based on information communication infrastructure and IT technology. The proposed method aims to improve the accessibility of data by linking weighted bioinformatics information with probability factors to easily access large capacity bio data. In addition, the proposed scheme classifies the priority information allocated to the bioinformatics information by hierarchical grouping according to the degree of similarity, thereby ensuring high accuracy of the search results of the bioinformatics information, and at the same time, the goal is to obtain low processing time by classifying information (type, attribute, priority, etc.) into weights by property. Previous researchers have suggested clustering algorithms for fragmentation of genetic information to solve the problem of haplotype assembly in genetics, or proposed particle swarm optimization methods similar to existing genetic algorithms using heuristic clustering method based on MEC model. In the performance evaluation, the proposed method improved the accuracy by average 13.5% and the efficiency of the data retrieval by average 19.7% more than previous scheme. The overhead of Bioinformatics information processing was 8.8% lower and the processing time was average 13.5% lower.</description><subject>Agricultural management</subject><subject>Algorithms</subject><subject>Bioinformatics</subject><subject>Clustering</subject><subject>Communications Engineering</subject><subject>Computer Communication Networks</subject><subject>Data processing</subject><subject>Data retrieval</subject><subject>Engineering</subject><subject>Genetic algorithms</subject><subject>Heuristic methods</subject><subject>Information management</subject><subject>Networks</subject><subject>Particle swarm optimization</subject><subject>Performance evaluation</subject><subject>Probabilistic methods</subject><subject>Proteins</subject><subject>Researchers</subject><subject>Searching</subject><subject>Signal,Image and Speech Processing</subject><subject>Statistical analysis</subject><issn>0929-6212</issn><issn>1572-834X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp1kE1LAzEQhoMoWKs_wFvAc3SSdDfZY1v8KBQUWsFbSHYn7Zbuh0kV_PemruDJ08DwPO8MLyHXHG45gLqLnAulGHDNsiLLmDwhI54pwbScvJ2SERSiYLng4pxcxLgDSFYhRqR6CZ2zrt7X8VCXdNr3obPllqZ1iTHW7Yauyi02SGc2YkW7ls6W09Wa-i7QRZPozx8GbUjWqscj4-ms7uo2IY1NqfGSnHm7j3j1O8fk9eF-PX9iy-fHxXy6ZKXk-YFpcFXunEe01oHSDniaWKoMkeOkypXWFiTIIpMeeKYVaMk9t1pL9JWTY3Iz5Ka33j8wHsyu-whtOmkE11rIXAMkig9UGboYA3rTh7qx4ctwMMcyzVCmSWWaY5lGJkcMTkxsu8Hwl_y_9A1POXeT</recordid><startdate>20190330</startdate><enddate>20190330</enddate><creator>Jeong, Yoon-Su</creator><creator>Shin, Seung-Soo</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20190330</creationdate><title>Probabilistic Approach Processing Scheme Based on BLAST for Improving Search Speed of Bioinformatics</title><author>Jeong, Yoon-Su ; Shin, Seung-Soo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-80bd6bbfeeaab078b01ab0ec75ee1e4d6788a0303953f015870831f1a883efdb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Agricultural management</topic><topic>Algorithms</topic><topic>Bioinformatics</topic><topic>Clustering</topic><topic>Communications Engineering</topic><topic>Computer Communication Networks</topic><topic>Data processing</topic><topic>Data retrieval</topic><topic>Engineering</topic><topic>Genetic algorithms</topic><topic>Heuristic methods</topic><topic>Information management</topic><topic>Networks</topic><topic>Particle swarm optimization</topic><topic>Performance evaluation</topic><topic>Probabilistic methods</topic><topic>Proteins</topic><topic>Researchers</topic><topic>Searching</topic><topic>Signal,Image and Speech Processing</topic><topic>Statistical analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jeong, Yoon-Su</creatorcontrib><creatorcontrib>Shin, Seung-Soo</creatorcontrib><collection>CrossRef</collection><jtitle>Wireless personal communications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jeong, Yoon-Su</au><au>Shin, Seung-Soo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Probabilistic Approach Processing Scheme Based on BLAST for Improving Search Speed of Bioinformatics</atitle><jtitle>Wireless personal communications</jtitle><stitle>Wireless Pers Commun</stitle><date>2019-03-30</date><risdate>2019</risdate><volume>105</volume><issue>2</issue><spage>405</spage><epage>426</epage><pages>405-426</pages><issn>0929-6212</issn><eissn>1572-834X</eissn><abstract>As researchers on bioinformatics using heuristic algorithms have been increasingly studied, information management used in various bioinformatics fields (new drug development, medical diagnosis, agricultural product improvement, etc.) has been studied mainly on BLAST algorithm. However, many of the algorithms that are being used in the large genome database use a complete sorting procedure, which takes a lot of time to search the database for proteins or nucleic acid sequences, which causes many problems in processing large amounts of bio information. We propose a BLAST-based probabilistic access processing method that can manage, analyze and process a large amount of bio data distributed based on information communication infrastructure and IT technology. The proposed method aims to improve the accessibility of data by linking weighted bioinformatics information with probability factors to easily access large capacity bio data. In addition, the proposed scheme classifies the priority information allocated to the bioinformatics information by hierarchical grouping according to the degree of similarity, thereby ensuring high accuracy of the search results of the bioinformatics information, and at the same time, the goal is to obtain low processing time by classifying information (type, attribute, priority, etc.) into weights by property. Previous researchers have suggested clustering algorithms for fragmentation of genetic information to solve the problem of haplotype assembly in genetics, or proposed particle swarm optimization methods similar to existing genetic algorithms using heuristic clustering method based on MEC model. In the performance evaluation, the proposed method improved the accuracy by average 13.5% and the efficiency of the data retrieval by average 19.7% more than previous scheme. The overhead of Bioinformatics information processing was 8.8% lower and the processing time was average 13.5% lower.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11277-018-5955-3</doi><tpages>22</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0929-6212
ispartof Wireless personal communications, 2019-03, Vol.105 (2), p.405-426
issn 0929-6212
1572-834X
language eng
recordid cdi_proquest_journals_2188236800
source Springer Nature - Complete Springer Journals
subjects Agricultural management
Algorithms
Bioinformatics
Clustering
Communications Engineering
Computer Communication Networks
Data processing
Data retrieval
Engineering
Genetic algorithms
Heuristic methods
Information management
Networks
Particle swarm optimization
Performance evaluation
Probabilistic methods
Proteins
Researchers
Searching
Signal,Image and Speech Processing
Statistical analysis
title Probabilistic Approach Processing Scheme Based on BLAST for Improving Search Speed of Bioinformatics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T17%3A08%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Probabilistic%20Approach%20Processing%20Scheme%20Based%20on%20BLAST%20for%20Improving%20Search%20Speed%20of%20Bioinformatics&rft.jtitle=Wireless%20personal%20communications&rft.au=Jeong,%20Yoon-Su&rft.date=2019-03-30&rft.volume=105&rft.issue=2&rft.spage=405&rft.epage=426&rft.pages=405-426&rft.issn=0929-6212&rft.eissn=1572-834X&rft_id=info:doi/10.1007/s11277-018-5955-3&rft_dat=%3Cproquest_cross%3E2188236800%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2188236800&rft_id=info:pmid/&rfr_iscdi=true