Phylo: a citizen science approach for improving multiple sequence alignment
Comparative genomics, or the study of the relationships of genome structure and function across different species, offers a powerful tool for studying evolution, annotating genomes, and understanding the causes of various genetic disorders. However, aligning multiple sequences of DNA, an essential i...
Gespeichert in:
Veröffentlicht in: | PloS one 2012-03, Vol.7 (3), p.e31362-e31362 |
---|---|
Hauptverfasser: | , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | e31362 |
---|---|
container_issue | 3 |
container_start_page | e31362 |
container_title | PloS one |
container_volume | 7 |
creator | Kawrykow, Alexander Roumanis, Gary Kam, Alfred Kwak, Daniel Leung, Clarence Wu, Chu Zarour, Eleyine Sarmenta, Luis Blanchette, Mathieu Waldispühl, Jérôme |
description | Comparative genomics, or the study of the relationships of genome structure and function across different species, offers a powerful tool for studying evolution, annotating genomes, and understanding the causes of various genetic disorders. However, aligning multiple sequences of DNA, an essential intermediate step for most types of analyses, is a difficult computational task. In parallel, citizen science, an approach that takes advantage of the fact that the human brain is exquisitely tuned to solving specific types of problems, is becoming increasingly popular. There, instances of hard computational problems are dispatched to a crowd of non-expert human game players and solutions are sent back to a central server.
We introduce Phylo, a human-based computing framework applying "crowd sourcing" techniques to solve the Multiple Sequence Alignment (MSA) problem. The key idea of Phylo is to convert the MSA problem into a casual game that can be played by ordinary web users with a minimal prior knowledge of the biological context. We applied this strategy to improve the alignment of the promoters of disease-related genes from up to 44 vertebrate species. Since the launch in November 2010, we received more than 350,000 solutions submitted from more than 12,000 registered users. Our results show that solutions submitted contributed to improving the accuracy of up to 70% of the alignment blocks considered.
We demonstrate that, combined with classical algorithms, crowd computing techniques can be successfully used to help improving the accuracy of MSA. More importantly, we show that an NP-hard computational problem can be embedded in casual game that can be easily played by people without significant scientific training. This suggests that citizen science approaches can be used to exploit the billions of "human-brain peta-flops" of computation that are spent every day playing games. Phylo is available at: http://phylo.cs.mcgill.ca. |
doi_str_mv | 10.1371/journal.pone.0031362 |
format | Article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1323935991</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A477150076</galeid><doaj_id>oai_doaj_org_article_17b0fef978574b429f92d91ae61a9b6e</doaj_id><sourcerecordid>A477150076</sourcerecordid><originalsourceid>FETCH-LOGICAL-c691t-c5471f4b8b7d229b4443c90bbf7be448e3815c626ce6abcfa12545d3b0b29e993</originalsourceid><addsrcrecordid>eNqNkttu1DAQhiMEoqXwBggiIYG42MWnODEXSFXFYUWlIk63lu1MEq-cOI2TivL0eNm02qBeIF_YHn_zjz3-k-QpRmtMc_xm66ehU27d-w7WCFFMObmXHGNByYoTRO8frI-SRyFsEcpowfnD5IgQhklB2XHy-Utz7fzbVKXGjvY3dGkwFjoDqer7wSvTpJUfUtvGzZXt6rSd3Gh7B2mAy2kPOlt3LXTj4-RBpVyAJ_N8kvz48P772afV-cXHzdnp-cpwgceVyViOK6YLnZeECM0Yo0YgratcA2MF0AJnhhNugCttKoVJxrKSaqSJACHoSfJ8r9s7H-TchyAxJVTQTAgcic2eKL3ayn6wrRqupVdW_g34oZZqGK1xIHGuUQWVyIssZ5oRUQlSCqyAYyU0h6j1bq426RZKEx86KLcQXZ50tpG1v5KUCM4FiQKvZoHBx5aFUbY2GHBOdeCnIAXJhUCkyCL54h_y7sfNVK3i_W1X-VjW7DTlKctznCGU80it76DiKKG1JnqmsjG-SHi9SIjMCL_GWk0hyM23r__PXvxcsi8P2AaUG5vg3TRa34UlyPagGXwIA1S3PcZI7ix_0w25s7ycLR_Tnh3-z23SjcfpH0vp-3M</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1323935991</pqid></control><display><type>article</type><title>Phylo: a citizen science approach for improving multiple sequence alignment</title><source>Public Library of Science (PLoS) Journals Open Access</source><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Kawrykow, Alexander ; Roumanis, Gary ; Kam, Alfred ; Kwak, Daniel ; Leung, Clarence ; Wu, Chu ; Zarour, Eleyine ; Sarmenta, Luis ; Blanchette, Mathieu ; Waldispühl, Jérôme</creator><contributor>Michalak, Pawel</contributor><creatorcontrib>Kawrykow, Alexander ; Roumanis, Gary ; Kam, Alfred ; Kwak, Daniel ; Leung, Clarence ; Wu, Chu ; Zarour, Eleyine ; Sarmenta, Luis ; Blanchette, Mathieu ; Waldispühl, Jérôme ; Phylo players ; Michalak, Pawel</creatorcontrib><description>Comparative genomics, or the study of the relationships of genome structure and function across different species, offers a powerful tool for studying evolution, annotating genomes, and understanding the causes of various genetic disorders. However, aligning multiple sequences of DNA, an essential intermediate step for most types of analyses, is a difficult computational task. In parallel, citizen science, an approach that takes advantage of the fact that the human brain is exquisitely tuned to solving specific types of problems, is becoming increasingly popular. There, instances of hard computational problems are dispatched to a crowd of non-expert human game players and solutions are sent back to a central server.
We introduce Phylo, a human-based computing framework applying "crowd sourcing" techniques to solve the Multiple Sequence Alignment (MSA) problem. The key idea of Phylo is to convert the MSA problem into a casual game that can be played by ordinary web users with a minimal prior knowledge of the biological context. We applied this strategy to improve the alignment of the promoters of disease-related genes from up to 44 vertebrate species. Since the launch in November 2010, we received more than 350,000 solutions submitted from more than 12,000 registered users. Our results show that solutions submitted contributed to improving the accuracy of up to 70% of the alignment blocks considered.
We demonstrate that, combined with classical algorithms, crowd computing techniques can be successfully used to help improving the accuracy of MSA. More importantly, we show that an NP-hard computational problem can be embedded in casual game that can be easily played by people without significant scientific training. This suggests that citizen science approaches can be used to exploit the billions of "human-brain peta-flops" of computation that are spent every day playing games. Phylo is available at: http://phylo.cs.mcgill.ca.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0031362</identifier><identifier>PMID: 22412834</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Algorithms ; Alignment ; Analysis ; Animal behavior ; Animals ; Bioinformatics ; Biological evolution ; Biology ; Brain ; Computation ; Computational neuroscience ; Computer Science ; Computers ; Databases, Nucleic Acid ; Deoxyribonucleic acid ; DNA ; Evolution ; Games ; Gene sequencing ; Genetic disorders ; Genetic research ; Genomes ; Genomics ; Genomics - methods ; Humans ; Internet ; Medical research ; Nucleotide sequence ; Personal computers ; Phylogenetics ; Problems ; Sequence Alignment - methods ; Sequence Analysis, DNA - methods ; Software ; Structure-function relationships ; User interface</subject><ispartof>PloS one, 2012-03, Vol.7 (3), p.e31362-e31362</ispartof><rights>COPYRIGHT 2012 Public Library of Science</rights><rights>2012 Kawrykow et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License: https://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>Kawrykow et al. 2012</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c691t-c5471f4b8b7d229b4443c90bbf7be448e3815c626ce6abcfa12545d3b0b29e993</citedby><cites>FETCH-LOGICAL-c691t-c5471f4b8b7d229b4443c90bbf7be448e3815c626ce6abcfa12545d3b0b29e993</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3296692/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3296692/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,2096,2915,23845,27901,27902,53766,53768,79343,79344</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/22412834$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Michalak, Pawel</contributor><creatorcontrib>Kawrykow, Alexander</creatorcontrib><creatorcontrib>Roumanis, Gary</creatorcontrib><creatorcontrib>Kam, Alfred</creatorcontrib><creatorcontrib>Kwak, Daniel</creatorcontrib><creatorcontrib>Leung, Clarence</creatorcontrib><creatorcontrib>Wu, Chu</creatorcontrib><creatorcontrib>Zarour, Eleyine</creatorcontrib><creatorcontrib>Sarmenta, Luis</creatorcontrib><creatorcontrib>Blanchette, Mathieu</creatorcontrib><creatorcontrib>Waldispühl, Jérôme</creatorcontrib><creatorcontrib>Phylo players</creatorcontrib><title>Phylo: a citizen science approach for improving multiple sequence alignment</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>Comparative genomics, or the study of the relationships of genome structure and function across different species, offers a powerful tool for studying evolution, annotating genomes, and understanding the causes of various genetic disorders. However, aligning multiple sequences of DNA, an essential intermediate step for most types of analyses, is a difficult computational task. In parallel, citizen science, an approach that takes advantage of the fact that the human brain is exquisitely tuned to solving specific types of problems, is becoming increasingly popular. There, instances of hard computational problems are dispatched to a crowd of non-expert human game players and solutions are sent back to a central server.
We introduce Phylo, a human-based computing framework applying "crowd sourcing" techniques to solve the Multiple Sequence Alignment (MSA) problem. The key idea of Phylo is to convert the MSA problem into a casual game that can be played by ordinary web users with a minimal prior knowledge of the biological context. We applied this strategy to improve the alignment of the promoters of disease-related genes from up to 44 vertebrate species. Since the launch in November 2010, we received more than 350,000 solutions submitted from more than 12,000 registered users. Our results show that solutions submitted contributed to improving the accuracy of up to 70% of the alignment blocks considered.
We demonstrate that, combined with classical algorithms, crowd computing techniques can be successfully used to help improving the accuracy of MSA. More importantly, we show that an NP-hard computational problem can be embedded in casual game that can be easily played by people without significant scientific training. This suggests that citizen science approaches can be used to exploit the billions of "human-brain peta-flops" of computation that are spent every day playing games. Phylo is available at: http://phylo.cs.mcgill.ca.</description><subject>Algorithms</subject><subject>Alignment</subject><subject>Analysis</subject><subject>Animal behavior</subject><subject>Animals</subject><subject>Bioinformatics</subject><subject>Biological evolution</subject><subject>Biology</subject><subject>Brain</subject><subject>Computation</subject><subject>Computational neuroscience</subject><subject>Computer Science</subject><subject>Computers</subject><subject>Databases, Nucleic Acid</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>Evolution</subject><subject>Games</subject><subject>Gene sequencing</subject><subject>Genetic disorders</subject><subject>Genetic research</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Genomics - methods</subject><subject>Humans</subject><subject>Internet</subject><subject>Medical research</subject><subject>Nucleotide sequence</subject><subject>Personal computers</subject><subject>Phylogenetics</subject><subject>Problems</subject><subject>Sequence Alignment - methods</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Software</subject><subject>Structure-function relationships</subject><subject>User interface</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><sourceid>DOA</sourceid><recordid>eNqNkttu1DAQhiMEoqXwBggiIYG42MWnODEXSFXFYUWlIk63lu1MEq-cOI2TivL0eNm02qBeIF_YHn_zjz3-k-QpRmtMc_xm66ehU27d-w7WCFFMObmXHGNByYoTRO8frI-SRyFsEcpowfnD5IgQhklB2XHy-Utz7fzbVKXGjvY3dGkwFjoDqer7wSvTpJUfUtvGzZXt6rSd3Gh7B2mAy2kPOlt3LXTj4-RBpVyAJ_N8kvz48P772afV-cXHzdnp-cpwgceVyViOK6YLnZeECM0Yo0YgratcA2MF0AJnhhNugCttKoVJxrKSaqSJACHoSfJ8r9s7H-TchyAxJVTQTAgcic2eKL3ayn6wrRqupVdW_g34oZZqGK1xIHGuUQWVyIssZ5oRUQlSCqyAYyU0h6j1bq426RZKEx86KLcQXZ50tpG1v5KUCM4FiQKvZoHBx5aFUbY2GHBOdeCnIAXJhUCkyCL54h_y7sfNVK3i_W1X-VjW7DTlKctznCGU80it76DiKKG1JnqmsjG-SHi9SIjMCL_GWk0hyM23r__PXvxcsi8P2AaUG5vg3TRa34UlyPagGXwIA1S3PcZI7ix_0w25s7ycLR_Tnh3-z23SjcfpH0vp-3M</recordid><startdate>20120307</startdate><enddate>20120307</enddate><creator>Kawrykow, Alexander</creator><creator>Roumanis, Gary</creator><creator>Kam, Alfred</creator><creator>Kwak, Daniel</creator><creator>Leung, Clarence</creator><creator>Wu, Chu</creator><creator>Zarour, Eleyine</creator><creator>Sarmenta, Luis</creator><creator>Blanchette, Mathieu</creator><creator>Waldispühl, Jérôme</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20120307</creationdate><title>Phylo: a citizen science approach for improving multiple sequence alignment</title><author>Kawrykow, Alexander ; Roumanis, Gary ; Kam, Alfred ; Kwak, Daniel ; Leung, Clarence ; Wu, Chu ; Zarour, Eleyine ; Sarmenta, Luis ; Blanchette, Mathieu ; Waldispühl, Jérôme</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c691t-c5471f4b8b7d229b4443c90bbf7be448e3815c626ce6abcfa12545d3b0b29e993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Analysis</topic><topic>Animal behavior</topic><topic>Animals</topic><topic>Bioinformatics</topic><topic>Biological evolution</topic><topic>Biology</topic><topic>Brain</topic><topic>Computation</topic><topic>Computational neuroscience</topic><topic>Computer Science</topic><topic>Computers</topic><topic>Databases, Nucleic Acid</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>Evolution</topic><topic>Games</topic><topic>Gene sequencing</topic><topic>Genetic disorders</topic><topic>Genetic research</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Genomics - methods</topic><topic>Humans</topic><topic>Internet</topic><topic>Medical research</topic><topic>Nucleotide sequence</topic><topic>Personal computers</topic><topic>Phylogenetics</topic><topic>Problems</topic><topic>Sequence Alignment - methods</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Software</topic><topic>Structure-function relationships</topic><topic>User interface</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kawrykow, Alexander</creatorcontrib><creatorcontrib>Roumanis, Gary</creatorcontrib><creatorcontrib>Kam, Alfred</creatorcontrib><creatorcontrib>Kwak, Daniel</creatorcontrib><creatorcontrib>Leung, Clarence</creatorcontrib><creatorcontrib>Wu, Chu</creatorcontrib><creatorcontrib>Zarour, Eleyine</creatorcontrib><creatorcontrib>Sarmenta, Luis</creatorcontrib><creatorcontrib>Blanchette, Mathieu</creatorcontrib><creatorcontrib>Waldispühl, Jérôme</creatorcontrib><creatorcontrib>Phylo players</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Opposing Viewpoints in Context (Gale)</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Proquest Nursing & Allied Health Source</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agricultural Science Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Engineering Database</collection><collection>Nursing & Allied Health Premium</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials Science Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kawrykow, Alexander</au><au>Roumanis, Gary</au><au>Kam, Alfred</au><au>Kwak, Daniel</au><au>Leung, Clarence</au><au>Wu, Chu</au><au>Zarour, Eleyine</au><au>Sarmenta, Luis</au><au>Blanchette, Mathieu</au><au>Waldispühl, Jérôme</au><au>Michalak, Pawel</au><aucorp>Phylo players</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Phylo: a citizen science approach for improving multiple sequence alignment</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2012-03-07</date><risdate>2012</risdate><volume>7</volume><issue>3</issue><spage>e31362</spage><epage>e31362</epage><pages>e31362-e31362</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>Comparative genomics, or the study of the relationships of genome structure and function across different species, offers a powerful tool for studying evolution, annotating genomes, and understanding the causes of various genetic disorders. However, aligning multiple sequences of DNA, an essential intermediate step for most types of analyses, is a difficult computational task. In parallel, citizen science, an approach that takes advantage of the fact that the human brain is exquisitely tuned to solving specific types of problems, is becoming increasingly popular. There, instances of hard computational problems are dispatched to a crowd of non-expert human game players and solutions are sent back to a central server.
We introduce Phylo, a human-based computing framework applying "crowd sourcing" techniques to solve the Multiple Sequence Alignment (MSA) problem. The key idea of Phylo is to convert the MSA problem into a casual game that can be played by ordinary web users with a minimal prior knowledge of the biological context. We applied this strategy to improve the alignment of the promoters of disease-related genes from up to 44 vertebrate species. Since the launch in November 2010, we received more than 350,000 solutions submitted from more than 12,000 registered users. Our results show that solutions submitted contributed to improving the accuracy of up to 70% of the alignment blocks considered.
We demonstrate that, combined with classical algorithms, crowd computing techniques can be successfully used to help improving the accuracy of MSA. More importantly, we show that an NP-hard computational problem can be embedded in casual game that can be easily played by people without significant scientific training. This suggests that citizen science approaches can be used to exploit the billions of "human-brain peta-flops" of computation that are spent every day playing games. Phylo is available at: http://phylo.cs.mcgill.ca.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>22412834</pmid><doi>10.1371/journal.pone.0031362</doi><tpages>e31362</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1932-6203 |
ispartof | PloS one, 2012-03, Vol.7 (3), p.e31362-e31362 |
issn | 1932-6203 1932-6203 |
language | eng |
recordid | cdi_plos_journals_1323935991 |
source | Public Library of Science (PLoS) Journals Open Access; MEDLINE; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Free Full-Text Journals in Chemistry |
subjects | Algorithms Alignment Analysis Animal behavior Animals Bioinformatics Biological evolution Biology Brain Computation Computational neuroscience Computer Science Computers Databases, Nucleic Acid Deoxyribonucleic acid DNA Evolution Games Gene sequencing Genetic disorders Genetic research Genomes Genomics Genomics - methods Humans Internet Medical research Nucleotide sequence Personal computers Phylogenetics Problems Sequence Alignment - methods Sequence Analysis, DNA - methods Software Structure-function relationships User interface |
title | Phylo: a citizen science approach for improving multiple sequence alignment |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T08%3A40%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Phylo:%20a%20citizen%20science%20approach%20for%20improving%20multiple%20sequence%20alignment&rft.jtitle=PloS%20one&rft.au=Kawrykow,%20Alexander&rft.aucorp=Phylo%20players&rft.date=2012-03-07&rft.volume=7&rft.issue=3&rft.spage=e31362&rft.epage=e31362&rft.pages=e31362-e31362&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0031362&rft_dat=%3Cgale_plos_%3EA477150076%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1323935991&rft_id=info:pmid/22412834&rft_galeid=A477150076&rft_doaj_id=oai_doaj_org_article_17b0fef978574b429f92d91ae61a9b6e&rfr_iscdi=true |