The Merck Gene Index browser : an extensible data integration system for gene finding, gene characterization and EST data mining

To make effective use of the vast amounts of expressed sequence tag (EST) sequence data generated by the Merck-sponsored EST project and other similar efforts, sequences must be organized into gene classes, and scientists must be able to 'mine' the gene class data in the context of related...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics (Oxford, England) England), 1998, Vol.14 (1), p.2-13
Hauptverfasser: ECKMAN, B. A, AARONSON, J. S, BORKOWSKI, J. A, BAILEY, W. J, ELLISTON, K. O, WILLIAMSON, A. R, BLEVINS, R. A
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 13
container_issue 1
container_start_page 2
container_title Bioinformatics (Oxford, England)
container_volume 14
creator ECKMAN, B. A
AARONSON, J. S
BORKOWSKI, J. A
BAILEY, W. J
ELLISTON, K. O
WILLIAMSON, A. R
BLEVINS, R. A
description To make effective use of the vast amounts of expressed sequence tag (EST) sequence data generated by the Merck-sponsored EST project and other similar efforts, sequences must be organized into gene classes, and scientists must be able to 'mine' the gene class data in the context of related genomic data. This paper presents the Merck Gene Index browser, an easily extensible, World Wide Web-based system for mining the Merck Gene Index (MGI) and related genomic data. The MGI is a non-redundant set of clones and sequences, each representing a distinct gene, constructed from all high-quality 3' EST sequences generated by the Merck-sponsored EST project. The MGI browser integrates data from a variety of sources and storage formats, both local and remote, using an eclectic integration strategy, including a federation of relational databases, a local data warehouse and simple hypertext links. Data currently integrated include: LENS cDNA clone and EST data, dbEST protein and non-EST nucleic acid similarity data, WashU sequence chromatograms. Entrez sequence and Medline entries, and UniGene gene clusters. Flatfile sequence data are accessed using the Bioapps server, an internally developed client-server system that supports generic sequence analysis applications. Browser data are retrieved and formatted by means of the Bioinformatics Data Integration Toolkit (B-DIT), a new suite of Perl routines.
doi_str_mv 10.1093/bioinformatics/14.1.2
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_79993386</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>79993386</sourcerecordid><originalsourceid>FETCH-LOGICAL-c417t-c19269ab201ff39700c105d7e8bfb72466f2e3f93d6a5ce78844d88fc473dc5a3</originalsourceid><addsrcrecordid>eNpVkElLBDEQhYMo7j9ByEE8OWO27k68yeAGigfHc0gnlTHandakB5eTP92WHgY8VRX1vXrFQ-iIkiklip_VoQvRd6k1fbD5jIopnbINtEt5WU2EpHRz3RO-g_ZyfiGEFKQot9G2KhgRqtxFP_NnwPeQ7Cu-hgj4Njr4xHXqPjIkfI5NxPDZQ8yhbgA70xscYg-LNLh2Eeev3EOLhzfw4k_uQ3QhLk7HyT6bZGwPKXyPuIkOXz7OxzttiAN6gLa8aTIcruo-erq6nM9uJncP17ezi7uJFbTqJ5YqVipTM0K956oixFJSuApk7euKibL0DLhX3JWmsFBJKYST0ltRcWcLw_fRyXj3LXXvS8i9bkO20DQmQrfMulJKcS7LASxG0KYu5wRev6XQmvSlKdF_yev_yWsqNNVs0B2tDJZ1C26tWkU97I9Xe5OtaXwy0Ya8xhiTQkrJfwH4wJIM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>79993386</pqid></control><display><type>article</type><title>The Merck Gene Index browser : an extensible data integration system for gene finding, gene characterization and EST data mining</title><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Alma/SFX Local Collection</source><creator>ECKMAN, B. A ; AARONSON, J. S ; BORKOWSKI, J. A ; BAILEY, W. J ; ELLISTON, K. O ; WILLIAMSON, A. R ; BLEVINS, R. A</creator><creatorcontrib>ECKMAN, B. A ; AARONSON, J. S ; BORKOWSKI, J. A ; BAILEY, W. J ; ELLISTON, K. O ; WILLIAMSON, A. R ; BLEVINS, R. A</creatorcontrib><description>To make effective use of the vast amounts of expressed sequence tag (EST) sequence data generated by the Merck-sponsored EST project and other similar efforts, sequences must be organized into gene classes, and scientists must be able to 'mine' the gene class data in the context of related genomic data. This paper presents the Merck Gene Index browser, an easily extensible, World Wide Web-based system for mining the Merck Gene Index (MGI) and related genomic data. The MGI is a non-redundant set of clones and sequences, each representing a distinct gene, constructed from all high-quality 3' EST sequences generated by the Merck-sponsored EST project. The MGI browser integrates data from a variety of sources and storage formats, both local and remote, using an eclectic integration strategy, including a federation of relational databases, a local data warehouse and simple hypertext links. Data currently integrated include: LENS cDNA clone and EST data, dbEST protein and non-EST nucleic acid similarity data, WashU sequence chromatograms. Entrez sequence and Medline entries, and UniGene gene clusters. Flatfile sequence data are accessed using the Bioapps server, an internally developed client-server system that supports generic sequence analysis applications. Browser data are retrieved and formatted by means of the Bioinformatics Data Integration Toolkit (B-DIT), a new suite of Perl routines.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/14.1.2</identifier><identifier>PMID: 9520496</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Abstracting and Indexing as Topic ; Algorithms ; Biological and medical sciences ; Computer Communication Networks ; Computer Systems ; Database Management Systems ; DNA, Complementary ; Fundamental and applied biological sciences. Psychology ; Gene Expression Regulation ; General aspects ; Genes ; Humans ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Sequence Homology, Amino Acid ; Sequence Homology, Nucleic Acid ; Software</subject><ispartof>Bioinformatics (Oxford, England), 1998, Vol.14 (1), p.2-13</ispartof><rights>1998 INIST-CNRS</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c417t-c19269ab201ff39700c105d7e8bfb72466f2e3f93d6a5ce78844d88fc473dc5a3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4022,27922,27923,27924</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=2284888$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/9520496$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>ECKMAN, B. A</creatorcontrib><creatorcontrib>AARONSON, J. S</creatorcontrib><creatorcontrib>BORKOWSKI, J. A</creatorcontrib><creatorcontrib>BAILEY, W. J</creatorcontrib><creatorcontrib>ELLISTON, K. O</creatorcontrib><creatorcontrib>WILLIAMSON, A. R</creatorcontrib><creatorcontrib>BLEVINS, R. A</creatorcontrib><title>The Merck Gene Index browser : an extensible data integration system for gene finding, gene characterization and EST data mining</title><title>Bioinformatics (Oxford, England)</title><addtitle>Bioinformatics</addtitle><description>To make effective use of the vast amounts of expressed sequence tag (EST) sequence data generated by the Merck-sponsored EST project and other similar efforts, sequences must be organized into gene classes, and scientists must be able to 'mine' the gene class data in the context of related genomic data. This paper presents the Merck Gene Index browser, an easily extensible, World Wide Web-based system for mining the Merck Gene Index (MGI) and related genomic data. The MGI is a non-redundant set of clones and sequences, each representing a distinct gene, constructed from all high-quality 3' EST sequences generated by the Merck-sponsored EST project. The MGI browser integrates data from a variety of sources and storage formats, both local and remote, using an eclectic integration strategy, including a federation of relational databases, a local data warehouse and simple hypertext links. Data currently integrated include: LENS cDNA clone and EST data, dbEST protein and non-EST nucleic acid similarity data, WashU sequence chromatograms. Entrez sequence and Medline entries, and UniGene gene clusters. Flatfile sequence data are accessed using the Bioapps server, an internally developed client-server system that supports generic sequence analysis applications. Browser data are retrieved and formatted by means of the Bioinformatics Data Integration Toolkit (B-DIT), a new suite of Perl routines.</description><subject>Abstracting and Indexing as Topic</subject><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>Computer Communication Networks</subject><subject>Computer Systems</subject><subject>Database Management Systems</subject><subject>DNA, Complementary</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Gene Expression Regulation</subject><subject>General aspects</subject><subject>Genes</subject><subject>Humans</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Sequence Homology, Amino Acid</subject><subject>Sequence Homology, Nucleic Acid</subject><subject>Software</subject><issn>1367-4803</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1998</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkElLBDEQhYMo7j9ByEE8OWO27k68yeAGigfHc0gnlTHandakB5eTP92WHgY8VRX1vXrFQ-iIkiklip_VoQvRd6k1fbD5jIopnbINtEt5WU2EpHRz3RO-g_ZyfiGEFKQot9G2KhgRqtxFP_NnwPeQ7Cu-hgj4Njr4xHXqPjIkfI5NxPDZQ8yhbgA70xscYg-LNLh2Eeev3EOLhzfw4k_uQ3QhLk7HyT6bZGwPKXyPuIkOXz7OxzttiAN6gLa8aTIcruo-erq6nM9uJncP17ezi7uJFbTqJ5YqVipTM0K956oixFJSuApk7euKibL0DLhX3JWmsFBJKYST0ltRcWcLw_fRyXj3LXXvS8i9bkO20DQmQrfMulJKcS7LASxG0KYu5wRev6XQmvSlKdF_yev_yWsqNNVs0B2tDJZ1C26tWkU97I9Xe5OtaXwy0Ya8xhiTQkrJfwH4wJIM</recordid><startdate>1998</startdate><enddate>1998</enddate><creator>ECKMAN, B. A</creator><creator>AARONSON, J. S</creator><creator>BORKOWSKI, J. A</creator><creator>BAILEY, W. J</creator><creator>ELLISTON, K. O</creator><creator>WILLIAMSON, A. R</creator><creator>BLEVINS, R. A</creator><general>Oxford University Press</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>1998</creationdate><title>The Merck Gene Index browser : an extensible data integration system for gene finding, gene characterization and EST data mining</title><author>ECKMAN, B. A ; AARONSON, J. S ; BORKOWSKI, J. A ; BAILEY, W. J ; ELLISTON, K. O ; WILLIAMSON, A. R ; BLEVINS, R. A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c417t-c19269ab201ff39700c105d7e8bfb72466f2e3f93d6a5ce78844d88fc473dc5a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1998</creationdate><topic>Abstracting and Indexing as Topic</topic><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>Computer Communication Networks</topic><topic>Computer Systems</topic><topic>Database Management Systems</topic><topic>DNA, Complementary</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Gene Expression Regulation</topic><topic>General aspects</topic><topic>Genes</topic><topic>Humans</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Sequence Homology, Amino Acid</topic><topic>Sequence Homology, Nucleic Acid</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>ECKMAN, B. A</creatorcontrib><creatorcontrib>AARONSON, J. S</creatorcontrib><creatorcontrib>BORKOWSKI, J. A</creatorcontrib><creatorcontrib>BAILEY, W. J</creatorcontrib><creatorcontrib>ELLISTON, K. O</creatorcontrib><creatorcontrib>WILLIAMSON, A. R</creatorcontrib><creatorcontrib>BLEVINS, R. A</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics (Oxford, England)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>ECKMAN, B. A</au><au>AARONSON, J. S</au><au>BORKOWSKI, J. A</au><au>BAILEY, W. J</au><au>ELLISTON, K. O</au><au>WILLIAMSON, A. R</au><au>BLEVINS, R. A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Merck Gene Index browser : an extensible data integration system for gene finding, gene characterization and EST data mining</atitle><jtitle>Bioinformatics (Oxford, England)</jtitle><addtitle>Bioinformatics</addtitle><date>1998</date><risdate>1998</risdate><volume>14</volume><issue>1</issue><spage>2</spage><epage>13</epage><pages>2-13</pages><issn>1367-4803</issn><eissn>1367-4811</eissn><abstract>To make effective use of the vast amounts of expressed sequence tag (EST) sequence data generated by the Merck-sponsored EST project and other similar efforts, sequences must be organized into gene classes, and scientists must be able to 'mine' the gene class data in the context of related genomic data. This paper presents the Merck Gene Index browser, an easily extensible, World Wide Web-based system for mining the Merck Gene Index (MGI) and related genomic data. The MGI is a non-redundant set of clones and sequences, each representing a distinct gene, constructed from all high-quality 3' EST sequences generated by the Merck-sponsored EST project. The MGI browser integrates data from a variety of sources and storage formats, both local and remote, using an eclectic integration strategy, including a federation of relational databases, a local data warehouse and simple hypertext links. Data currently integrated include: LENS cDNA clone and EST data, dbEST protein and non-EST nucleic acid similarity data, WashU sequence chromatograms. Entrez sequence and Medline entries, and UniGene gene clusters. Flatfile sequence data are accessed using the Bioapps server, an internally developed client-server system that supports generic sequence analysis applications. Browser data are retrieved and formatted by means of the Bioinformatics Data Integration Toolkit (B-DIT), a new suite of Perl routines.</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>9520496</pmid><doi>10.1093/bioinformatics/14.1.2</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics (Oxford, England), 1998, Vol.14 (1), p.2-13
issn 1367-4803
1367-4811
language eng
recordid cdi_proquest_miscellaneous_79993386
source MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Alma/SFX Local Collection
subjects Abstracting and Indexing as Topic
Algorithms
Biological and medical sciences
Computer Communication Networks
Computer Systems
Database Management Systems
DNA, Complementary
Fundamental and applied biological sciences. Psychology
Gene Expression Regulation
General aspects
Genes
Humans
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Sequence Homology, Amino Acid
Sequence Homology, Nucleic Acid
Software
title The Merck Gene Index browser : an extensible data integration system for gene finding, gene characterization and EST data mining
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T08%3A36%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Merck%20Gene%20Index%20browser%20:%20an%20extensible%20data%20integration%20system%20for%20gene%20finding,%20gene%20characterization%20and%20EST%20data%20mining&rft.jtitle=Bioinformatics%20(Oxford,%20England)&rft.au=ECKMAN,%20B.%20A&rft.date=1998&rft.volume=14&rft.issue=1&rft.spage=2&rft.epage=13&rft.pages=2-13&rft.issn=1367-4803&rft.eissn=1367-4811&rft_id=info:doi/10.1093/bioinformatics/14.1.2&rft_dat=%3Cproquest_cross%3E79993386%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=79993386&rft_id=info:pmid/9520496&rfr_iscdi=true