IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES

The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: KOZLOV, ANDREY PETROVICH, LOBASHEV, ANDREY VLADIMIROVICH, BARANOVA, A. V, YANKOVSKY, NIKOLAY KAZIMIROVICH, KRUKOVSKAYA, LARISA, LEONIDOVNA
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator KOZLOV, ANDREY PETROVICH
LOBASHEV, ANDREY VLADIMIROVICH
BARANOVA, A. V
YANKOVSKY, NIKOLAY KAZIMIROVICH
KRUKOVSKAYA, LARISA, LEONIDOVNA
description The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP1446757A2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP1446757A2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP1446757A23</originalsourceid><addsrcrecordid>eNrjZHDw9FMI9vTxdPZXCHYOcnX18_RzV3DzD1II8HD18w-JDHDVdQwO9nf2dAxxdVFwjQgIcg0OBrKCXQNDXf2cXYN5GFjTEnOKU3mhNDeDgptriLOHbmpBfnxqcUFicmpeakm8a4ChiYmZuam5o5ExEUoAdr4qnA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><source>esp@cenet</source><creator>KOZLOV, ANDREY PETROVICH ; LOBASHEV, ANDREY VLADIMIROVICH ; BARANOVA, A. V ; YANKOVSKY, NIKOLAY KAZIMIROVICH ; KRUKOVSKAYA, LARISA, LEONIDOVNA</creator><creatorcontrib>KOZLOV, ANDREY PETROVICH ; LOBASHEV, ANDREY VLADIMIROVICH ; BARANOVA, A. V ; YANKOVSKY, NIKOLAY KAZIMIROVICH ; KRUKOVSKAYA, LARISA, LEONIDOVNA</creatorcontrib><description>The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest.</description><edition>7</edition><language>eng ; fre ; ger</language><subject>BEER ; BIOCHEMISTRY ; CALCULATING ; CHEMISTRY ; COMPOSITIONS OR TEST PAPERS THEREFOR ; COMPUTING ; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; ENZYMOLOGY ; INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS ; INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES ; MEASURING ; MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS ; METALLURGY ; MICROBIOLOGY ; MUTATION OR GENETIC ENGINEERING ; PHYSICS ; PROCESSES OF PREPARING SUCH COMPOSITIONS ; SPIRITS ; TESTING ; VINEGAR ; WINE</subject><creationdate>2004</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20040818&amp;DB=EPODOC&amp;CC=EP&amp;NR=1446757A2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76516</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20040818&amp;DB=EPODOC&amp;CC=EP&amp;NR=1446757A2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>KOZLOV, ANDREY PETROVICH</creatorcontrib><creatorcontrib>LOBASHEV, ANDREY VLADIMIROVICH</creatorcontrib><creatorcontrib>BARANOVA, A. V</creatorcontrib><creatorcontrib>YANKOVSKY, NIKOLAY KAZIMIROVICH</creatorcontrib><creatorcontrib>KRUKOVSKAYA, LARISA, LEONIDOVNA</creatorcontrib><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><description>The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest.</description><subject>BEER</subject><subject>BIOCHEMISTRY</subject><subject>CALCULATING</subject><subject>CHEMISTRY</subject><subject>COMPOSITIONS OR TEST PAPERS THEREFOR</subject><subject>COMPUTING</subject><subject>CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>ENZYMOLOGY</subject><subject>INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS</subject><subject>INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES</subject><subject>MEASURING</subject><subject>MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS</subject><subject>METALLURGY</subject><subject>MICROBIOLOGY</subject><subject>MUTATION OR GENETIC ENGINEERING</subject><subject>PHYSICS</subject><subject>PROCESSES OF PREPARING SUCH COMPOSITIONS</subject><subject>SPIRITS</subject><subject>TESTING</subject><subject>VINEGAR</subject><subject>WINE</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2004</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHDw9FMI9vTxdPZXCHYOcnX18_RzV3DzD1II8HD18w-JDHDVdQwO9nf2dAxxdVFwjQgIcg0OBrKCXQNDXf2cXYN5GFjTEnOKU3mhNDeDgptriLOHbmpBfnxqcUFicmpeakm8a4ChiYmZuam5o5ExEUoAdr4qnA</recordid><startdate>20040818</startdate><enddate>20040818</enddate><creator>KOZLOV, ANDREY PETROVICH</creator><creator>LOBASHEV, ANDREY VLADIMIROVICH</creator><creator>BARANOVA, A. V</creator><creator>YANKOVSKY, NIKOLAY KAZIMIROVICH</creator><creator>KRUKOVSKAYA, LARISA, LEONIDOVNA</creator><scope>EVB</scope></search><sort><creationdate>20040818</creationdate><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><author>KOZLOV, ANDREY PETROVICH ; LOBASHEV, ANDREY VLADIMIROVICH ; BARANOVA, A. V ; YANKOVSKY, NIKOLAY KAZIMIROVICH ; KRUKOVSKAYA, LARISA, LEONIDOVNA</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP1446757A23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2004</creationdate><topic>BEER</topic><topic>BIOCHEMISTRY</topic><topic>CALCULATING</topic><topic>CHEMISTRY</topic><topic>COMPOSITIONS OR TEST PAPERS THEREFOR</topic><topic>COMPUTING</topic><topic>CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>ENZYMOLOGY</topic><topic>INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS</topic><topic>INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES</topic><topic>MEASURING</topic><topic>MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS</topic><topic>METALLURGY</topic><topic>MICROBIOLOGY</topic><topic>MUTATION OR GENETIC ENGINEERING</topic><topic>PHYSICS</topic><topic>PROCESSES OF PREPARING SUCH COMPOSITIONS</topic><topic>SPIRITS</topic><topic>TESTING</topic><topic>VINEGAR</topic><topic>WINE</topic><toplevel>online_resources</toplevel><creatorcontrib>KOZLOV, ANDREY PETROVICH</creatorcontrib><creatorcontrib>LOBASHEV, ANDREY VLADIMIROVICH</creatorcontrib><creatorcontrib>BARANOVA, A. V</creatorcontrib><creatorcontrib>YANKOVSKY, NIKOLAY KAZIMIROVICH</creatorcontrib><creatorcontrib>KRUKOVSKAYA, LARISA, LEONIDOVNA</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>KOZLOV, ANDREY PETROVICH</au><au>LOBASHEV, ANDREY VLADIMIROVICH</au><au>BARANOVA, A. V</au><au>YANKOVSKY, NIKOLAY KAZIMIROVICH</au><au>KRUKOVSKAYA, LARISA, LEONIDOVNA</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><date>2004-08-18</date><risdate>2004</risdate><abstract>The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest.</abstract><edition>7</edition><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; fre ; ger
recordid cdi_epo_espacenet_EP1446757A2
source esp@cenet
subjects BEER
BIOCHEMISTRY
CALCULATING
CHEMISTRY
COMPOSITIONS OR TEST PAPERS THEREFOR
COMPUTING
CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
ENZYMOLOGY
INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS
INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES
MEASURING
MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS
METALLURGY
MICROBIOLOGY
MUTATION OR GENETIC ENGINEERING
PHYSICS
PROCESSES OF PREPARING SUCH COMPOSITIONS
SPIRITS
TESTING
VINEGAR
WINE
title IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T18%3A27%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=KOZLOV,%20ANDREY%20PETROVICH&rft.date=2004-08-18&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP1446757A2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true