Soft Fuzzy Set Approach for Mining Frequent Amino Acid Associations in Peptide Sequences of Dengue Virus
Association analysis of amino acids in molecular sequences can reveal crucial information and knowledge for understanding structure, function and interaction of proteins. The traditional methods of association rule mining like apriori, F-P Growth etc. fail to generate appropriate patterns due to inh...
Gespeichert in:
Veröffentlicht in: | Proceedings of the National Academy of Sciences, India, Section A, physical sciences India, Section A, physical sciences, 2018-12, Vol.88 (4), p.529-538 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 538 |
---|---|
container_issue | 4 |
container_start_page | 529 |
container_title | Proceedings of the National Academy of Sciences, India, Section A, physical sciences |
container_volume | 88 |
creator | Gour, Alekh Pardasani, K. R. |
description | Association analysis of amino acids in molecular sequences can reveal crucial information and knowledge for understanding structure, function and interaction of proteins. The traditional methods of association rule mining like apriori, F-P Growth etc. fail to generate appropriate patterns due to inherent uncertainty present in data. The uncertainty in sequence data caused by variation in the length of sequences and lack of parameterization lead to under prediction and over prediction of the results. In this paper an attempt has been made to develop a soft set based approach for mining fuzzy association patterns in peptide sequences of dengue virus. The fuzzy set approach is employed to incorporate the degree of relationships among amino acids due to variation in length of the sequences. The soft set approach is employed to incorporate the relationship of parameters with amino acid association patterns. The 12,581 sequences of dengue virus are downloaded from NCBI and screened for redundancy to obtain non redundant 6995 sequences. The amino acid associations are explored and analyzed using soft fuzzy approach. Also the results obtained by soft fuzzy approach are compared with the results obtained individually by ordinary, fuzzy and soft set approaches. The soft fuzzy approach is able to overcome the issue of under prediction and over prediction of the results obtained by other approaches. Also the interesting association rules have been generated to predict the structure and physico chemical properties of the peptide sequences. |
doi_str_mv | 10.1007/s40010-016-0336-3 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2136592758</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2136592758</sourcerecordid><originalsourceid>FETCH-LOGICAL-c268t-82895ff139352cd7a5ca4d49f3db4b750924ad2a9260e0bfd48f06d0373bc63a3</originalsourceid><addsrcrecordid>eNp1kDFPwzAQhS0EElXpD2CzxBw424mTjFGhgFQEUoHVchy7dUXtYidD--txFSQmvNzg77179xC6JnBLAMq7mAMQyIDwDBjjGTtDE0oLyEjJ6TmaAON1VlFgl2gW4xbSK8qa8nyCNitverwYjscDXukeN_t98FJtsPEBv1hn3Rovgv4etEufO-s8bpTtcBOjV1b21ruIrcNvet_bTiePE6p0xN7ge-3Wg8afNgzxCl0Y-RX17HdO0cfi4X3-lC1fH5_nzTJTlFd9SlnVhTGE1aygqitloWTe5bVhXZu3ZQE1zWVHZUoPGlrT5ZUB3gErWas4k2yKbkbfdEeKEnux9UNwaaWghPGipmVRJYqMlAo-xqCN2Ae7k-EgCIhTp2LsVKROxalTwZKGjpqYWLfW4c_5f9EPkLl4uA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2136592758</pqid></control><display><type>article</type><title>Soft Fuzzy Set Approach for Mining Frequent Amino Acid Associations in Peptide Sequences of Dengue Virus</title><source>Springer Nature - Complete Springer Journals</source><creator>Gour, Alekh ; Pardasani, K. R.</creator><creatorcontrib>Gour, Alekh ; Pardasani, K. R.</creatorcontrib><description>Association analysis of amino acids in molecular sequences can reveal crucial information and knowledge for understanding structure, function and interaction of proteins. The traditional methods of association rule mining like apriori, F-P Growth etc. fail to generate appropriate patterns due to inherent uncertainty present in data. The uncertainty in sequence data caused by variation in the length of sequences and lack of parameterization lead to under prediction and over prediction of the results. In this paper an attempt has been made to develop a soft set based approach for mining fuzzy association patterns in peptide sequences of dengue virus. The fuzzy set approach is employed to incorporate the degree of relationships among amino acids due to variation in length of the sequences. The soft set approach is employed to incorporate the relationship of parameters with amino acid association patterns. The 12,581 sequences of dengue virus are downloaded from NCBI and screened for redundancy to obtain non redundant 6995 sequences. The amino acid associations are explored and analyzed using soft fuzzy approach. Also the results obtained by soft fuzzy approach are compared with the results obtained individually by ordinary, fuzzy and soft set approaches. The soft fuzzy approach is able to overcome the issue of under prediction and over prediction of the results obtained by other approaches. Also the interesting association rules have been generated to predict the structure and physico chemical properties of the peptide sequences.</description><identifier>ISSN: 0369-8203</identifier><identifier>EISSN: 2250-1762</identifier><identifier>DOI: 10.1007/s40010-016-0336-3</identifier><language>eng</language><publisher>New Delhi: Springer India</publisher><subject>Amino acids ; Applied and Technical Physics ; Associations ; Atomic ; Chemical properties ; Data mining ; Dengue fever ; Fuzzy sets ; Molecular ; Optical and Plasma Physics ; Organic chemistry ; Parameterization ; Peptides ; Physics ; Physics and Astronomy ; Proteins ; Quantum Physics ; Redundancy ; Research Article ; Sequences ; Uncertainty ; Viral diseases ; Viruses</subject><ispartof>Proceedings of the National Academy of Sciences, India, Section A, physical sciences, 2018-12, Vol.88 (4), p.529-538</ispartof><rights>The National Academy of Sciences, India 2017</rights><rights>Copyright Springer Science & Business Media 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c268t-82895ff139352cd7a5ca4d49f3db4b750924ad2a9260e0bfd48f06d0373bc63a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s40010-016-0336-3$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s40010-016-0336-3$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,314,776,780,785,786,23909,23910,25118,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Gour, Alekh</creatorcontrib><creatorcontrib>Pardasani, K. R.</creatorcontrib><title>Soft Fuzzy Set Approach for Mining Frequent Amino Acid Associations in Peptide Sequences of Dengue Virus</title><title>Proceedings of the National Academy of Sciences, India, Section A, physical sciences</title><addtitle>Proc. Natl. Acad. Sci., India, Sect. A Phys. Sci</addtitle><description>Association analysis of amino acids in molecular sequences can reveal crucial information and knowledge for understanding structure, function and interaction of proteins. The traditional methods of association rule mining like apriori, F-P Growth etc. fail to generate appropriate patterns due to inherent uncertainty present in data. The uncertainty in sequence data caused by variation in the length of sequences and lack of parameterization lead to under prediction and over prediction of the results. In this paper an attempt has been made to develop a soft set based approach for mining fuzzy association patterns in peptide sequences of dengue virus. The fuzzy set approach is employed to incorporate the degree of relationships among amino acids due to variation in length of the sequences. The soft set approach is employed to incorporate the relationship of parameters with amino acid association patterns. The 12,581 sequences of dengue virus are downloaded from NCBI and screened for redundancy to obtain non redundant 6995 sequences. The amino acid associations are explored and analyzed using soft fuzzy approach. Also the results obtained by soft fuzzy approach are compared with the results obtained individually by ordinary, fuzzy and soft set approaches. The soft fuzzy approach is able to overcome the issue of under prediction and over prediction of the results obtained by other approaches. Also the interesting association rules have been generated to predict the structure and physico chemical properties of the peptide sequences.</description><subject>Amino acids</subject><subject>Applied and Technical Physics</subject><subject>Associations</subject><subject>Atomic</subject><subject>Chemical properties</subject><subject>Data mining</subject><subject>Dengue fever</subject><subject>Fuzzy sets</subject><subject>Molecular</subject><subject>Optical and Plasma Physics</subject><subject>Organic chemistry</subject><subject>Parameterization</subject><subject>Peptides</subject><subject>Physics</subject><subject>Physics and Astronomy</subject><subject>Proteins</subject><subject>Quantum Physics</subject><subject>Redundancy</subject><subject>Research Article</subject><subject>Sequences</subject><subject>Uncertainty</subject><subject>Viral diseases</subject><subject>Viruses</subject><issn>0369-8203</issn><issn>2250-1762</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp1kDFPwzAQhS0EElXpD2CzxBw424mTjFGhgFQEUoHVchy7dUXtYidD--txFSQmvNzg77179xC6JnBLAMq7mAMQyIDwDBjjGTtDE0oLyEjJ6TmaAON1VlFgl2gW4xbSK8qa8nyCNitverwYjscDXukeN_t98FJtsPEBv1hn3Rovgv4etEufO-s8bpTtcBOjV1b21ruIrcNvet_bTiePE6p0xN7ge-3Wg8afNgzxCl0Y-RX17HdO0cfi4X3-lC1fH5_nzTJTlFd9SlnVhTGE1aygqitloWTe5bVhXZu3ZQE1zWVHZUoPGlrT5ZUB3gErWas4k2yKbkbfdEeKEnux9UNwaaWghPGipmVRJYqMlAo-xqCN2Ae7k-EgCIhTp2LsVKROxalTwZKGjpqYWLfW4c_5f9EPkLl4uA</recordid><startdate>20181201</startdate><enddate>20181201</enddate><creator>Gour, Alekh</creator><creator>Pardasani, K. R.</creator><general>Springer India</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20181201</creationdate><title>Soft Fuzzy Set Approach for Mining Frequent Amino Acid Associations in Peptide Sequences of Dengue Virus</title><author>Gour, Alekh ; Pardasani, K. R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c268t-82895ff139352cd7a5ca4d49f3db4b750924ad2a9260e0bfd48f06d0373bc63a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Amino acids</topic><topic>Applied and Technical Physics</topic><topic>Associations</topic><topic>Atomic</topic><topic>Chemical properties</topic><topic>Data mining</topic><topic>Dengue fever</topic><topic>Fuzzy sets</topic><topic>Molecular</topic><topic>Optical and Plasma Physics</topic><topic>Organic chemistry</topic><topic>Parameterization</topic><topic>Peptides</topic><topic>Physics</topic><topic>Physics and Astronomy</topic><topic>Proteins</topic><topic>Quantum Physics</topic><topic>Redundancy</topic><topic>Research Article</topic><topic>Sequences</topic><topic>Uncertainty</topic><topic>Viral diseases</topic><topic>Viruses</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gour, Alekh</creatorcontrib><creatorcontrib>Pardasani, K. R.</creatorcontrib><collection>CrossRef</collection><jtitle>Proceedings of the National Academy of Sciences, India, Section A, physical sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gour, Alekh</au><au>Pardasani, K. R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Soft Fuzzy Set Approach for Mining Frequent Amino Acid Associations in Peptide Sequences of Dengue Virus</atitle><jtitle>Proceedings of the National Academy of Sciences, India, Section A, physical sciences</jtitle><stitle>Proc. Natl. Acad. Sci., India, Sect. A Phys. Sci</stitle><date>2018-12-01</date><risdate>2018</risdate><volume>88</volume><issue>4</issue><spage>529</spage><epage>538</epage><pages>529-538</pages><issn>0369-8203</issn><eissn>2250-1762</eissn><abstract>Association analysis of amino acids in molecular sequences can reveal crucial information and knowledge for understanding structure, function and interaction of proteins. The traditional methods of association rule mining like apriori, F-P Growth etc. fail to generate appropriate patterns due to inherent uncertainty present in data. The uncertainty in sequence data caused by variation in the length of sequences and lack of parameterization lead to under prediction and over prediction of the results. In this paper an attempt has been made to develop a soft set based approach for mining fuzzy association patterns in peptide sequences of dengue virus. The fuzzy set approach is employed to incorporate the degree of relationships among amino acids due to variation in length of the sequences. The soft set approach is employed to incorporate the relationship of parameters with amino acid association patterns. The 12,581 sequences of dengue virus are downloaded from NCBI and screened for redundancy to obtain non redundant 6995 sequences. The amino acid associations are explored and analyzed using soft fuzzy approach. Also the results obtained by soft fuzzy approach are compared with the results obtained individually by ordinary, fuzzy and soft set approaches. The soft fuzzy approach is able to overcome the issue of under prediction and over prediction of the results obtained by other approaches. Also the interesting association rules have been generated to predict the structure and physico chemical properties of the peptide sequences.</abstract><cop>New Delhi</cop><pub>Springer India</pub><doi>10.1007/s40010-016-0336-3</doi><tpages>10</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0369-8203 |
ispartof | Proceedings of the National Academy of Sciences, India, Section A, physical sciences, 2018-12, Vol.88 (4), p.529-538 |
issn | 0369-8203 2250-1762 |
language | eng |
recordid | cdi_proquest_journals_2136592758 |
source | Springer Nature - Complete Springer Journals |
subjects | Amino acids Applied and Technical Physics Associations Atomic Chemical properties Data mining Dengue fever Fuzzy sets Molecular Optical and Plasma Physics Organic chemistry Parameterization Peptides Physics Physics and Astronomy Proteins Quantum Physics Redundancy Research Article Sequences Uncertainty Viral diseases Viruses |
title | Soft Fuzzy Set Approach for Mining Frequent Amino Acid Associations in Peptide Sequences of Dengue Virus |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T14%3A21%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Soft%20Fuzzy%20Set%20Approach%20for%20Mining%20Frequent%20Amino%20Acid%20Associations%20in%20Peptide%20Sequences%20of%20Dengue%20Virus&rft.jtitle=Proceedings%20of%20the%20National%20Academy%20of%20Sciences,%20India,%20Section%20A,%20physical%20sciences&rft.au=Gour,%20Alekh&rft.date=2018-12-01&rft.volume=88&rft.issue=4&rft.spage=529&rft.epage=538&rft.pages=529-538&rft.issn=0369-8203&rft.eissn=2250-1762&rft_id=info:doi/10.1007/s40010-016-0336-3&rft_dat=%3Cproquest_cross%3E2136592758%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2136592758&rft_id=info:pmid/&rfr_iscdi=true |