IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES
The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Patent |
Sprache: | eng ; fre ; ger |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | KOZLOV, ANDREY PETROVICH LOBASHEV, ANDREY VLADIMIROVICH BARANOVA, A. V YANKOVSKY, NIKOLAY KAZIMIROVICH KRUKOVSKAYA, LARISA, LEONIDOVNA |
description | The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest. |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP1446757A2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP1446757A2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP1446757A23</originalsourceid><addsrcrecordid>eNrjZHDw9FMI9vTxdPZXCHYOcnX18_RzV3DzD1II8HD18w-JDHDVdQwO9nf2dAxxdVFwjQgIcg0OBrKCXQNDXf2cXYN5GFjTEnOKU3mhNDeDgptriLOHbmpBfnxqcUFicmpeakm8a4ChiYmZuam5o5ExEUoAdr4qnA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><source>esp@cenet</source><creator>KOZLOV, ANDREY PETROVICH ; LOBASHEV, ANDREY VLADIMIROVICH ; BARANOVA, A. V ; YANKOVSKY, NIKOLAY KAZIMIROVICH ; KRUKOVSKAYA, LARISA, LEONIDOVNA</creator><creatorcontrib>KOZLOV, ANDREY PETROVICH ; LOBASHEV, ANDREY VLADIMIROVICH ; BARANOVA, A. V ; YANKOVSKY, NIKOLAY KAZIMIROVICH ; KRUKOVSKAYA, LARISA, LEONIDOVNA</creatorcontrib><description>The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest.</description><edition>7</edition><language>eng ; fre ; ger</language><subject>BEER ; BIOCHEMISTRY ; CALCULATING ; CHEMISTRY ; COMPOSITIONS OR TEST PAPERS THEREFOR ; COMPUTING ; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; ENZYMOLOGY ; INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS ; INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES ; MEASURING ; MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS ; METALLURGY ; MICROBIOLOGY ; MUTATION OR GENETIC ENGINEERING ; PHYSICS ; PROCESSES OF PREPARING SUCH COMPOSITIONS ; SPIRITS ; TESTING ; VINEGAR ; WINE</subject><creationdate>2004</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20040818&DB=EPODOC&CC=EP&NR=1446757A2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76516</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20040818&DB=EPODOC&CC=EP&NR=1446757A2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>KOZLOV, ANDREY PETROVICH</creatorcontrib><creatorcontrib>LOBASHEV, ANDREY VLADIMIROVICH</creatorcontrib><creatorcontrib>BARANOVA, A. V</creatorcontrib><creatorcontrib>YANKOVSKY, NIKOLAY KAZIMIROVICH</creatorcontrib><creatorcontrib>KRUKOVSKAYA, LARISA, LEONIDOVNA</creatorcontrib><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><description>The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest.</description><subject>BEER</subject><subject>BIOCHEMISTRY</subject><subject>CALCULATING</subject><subject>CHEMISTRY</subject><subject>COMPOSITIONS OR TEST PAPERS THEREFOR</subject><subject>COMPUTING</subject><subject>CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>ENZYMOLOGY</subject><subject>INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS</subject><subject>INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES</subject><subject>MEASURING</subject><subject>MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS</subject><subject>METALLURGY</subject><subject>MICROBIOLOGY</subject><subject>MUTATION OR GENETIC ENGINEERING</subject><subject>PHYSICS</subject><subject>PROCESSES OF PREPARING SUCH COMPOSITIONS</subject><subject>SPIRITS</subject><subject>TESTING</subject><subject>VINEGAR</subject><subject>WINE</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2004</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHDw9FMI9vTxdPZXCHYOcnX18_RzV3DzD1II8HD18w-JDHDVdQwO9nf2dAxxdVFwjQgIcg0OBrKCXQNDXf2cXYN5GFjTEnOKU3mhNDeDgptriLOHbmpBfnxqcUFicmpeakm8a4ChiYmZuam5o5ExEUoAdr4qnA</recordid><startdate>20040818</startdate><enddate>20040818</enddate><creator>KOZLOV, ANDREY PETROVICH</creator><creator>LOBASHEV, ANDREY VLADIMIROVICH</creator><creator>BARANOVA, A. V</creator><creator>YANKOVSKY, NIKOLAY KAZIMIROVICH</creator><creator>KRUKOVSKAYA, LARISA, LEONIDOVNA</creator><scope>EVB</scope></search><sort><creationdate>20040818</creationdate><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><author>KOZLOV, ANDREY PETROVICH ; LOBASHEV, ANDREY VLADIMIROVICH ; BARANOVA, A. V ; YANKOVSKY, NIKOLAY KAZIMIROVICH ; KRUKOVSKAYA, LARISA, LEONIDOVNA</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP1446757A23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2004</creationdate><topic>BEER</topic><topic>BIOCHEMISTRY</topic><topic>CALCULATING</topic><topic>CHEMISTRY</topic><topic>COMPOSITIONS OR TEST PAPERS THEREFOR</topic><topic>COMPUTING</topic><topic>CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>ENZYMOLOGY</topic><topic>INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS</topic><topic>INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES</topic><topic>MEASURING</topic><topic>MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS</topic><topic>METALLURGY</topic><topic>MICROBIOLOGY</topic><topic>MUTATION OR GENETIC ENGINEERING</topic><topic>PHYSICS</topic><topic>PROCESSES OF PREPARING SUCH COMPOSITIONS</topic><topic>SPIRITS</topic><topic>TESTING</topic><topic>VINEGAR</topic><topic>WINE</topic><toplevel>online_resources</toplevel><creatorcontrib>KOZLOV, ANDREY PETROVICH</creatorcontrib><creatorcontrib>LOBASHEV, ANDREY VLADIMIROVICH</creatorcontrib><creatorcontrib>BARANOVA, A. V</creatorcontrib><creatorcontrib>YANKOVSKY, NIKOLAY KAZIMIROVICH</creatorcontrib><creatorcontrib>KRUKOVSKAYA, LARISA, LEONIDOVNA</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>KOZLOV, ANDREY PETROVICH</au><au>LOBASHEV, ANDREY VLADIMIROVICH</au><au>BARANOVA, A. V</au><au>YANKOVSKY, NIKOLAY KAZIMIROVICH</au><au>KRUKOVSKAYA, LARISA, LEONIDOVNA</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES</title><date>2004-08-18</date><risdate>2004</risdate><abstract>The present invention provides methods for determining whether a nucleic acid sequence is a marker for a phenotype or cell type of interest which comprises providing a database of expressed sequence tag sequences (EST's) from the species; placing said EST's in groups termed clusters based on homology of EST's within each cluster; determining for each cluster the total number of EST's within said cluster; ordering said clusters sequentially based on the number of EST's in each cluster; dividing said ordered clusters into subranges based on the number of EST's per cluster; determining for each cluster subrange obtained from step (e) the number EST's within said cluster which are expressed in said predetermined cell type of interest; calculating according to a normal distribution the number of clusters in each subrange expected to contain a predetermined threshold percentage of EST's expressed in said cell type of interest, wherein said threshold percentage is a percentage from about 10% to about 100%; determining the number of clusters in each subrange observed to contain said predetermined threshold percentage of EST's expressed in said predetermined cell type; and identifying subranges having an observed number of clusters that meet said predetermined threshold percentage greater than the number of clusters expected to meet said predetermined threshold percentage for the subrange according to normal distribution; wherein if the percentage of EST's expressed in said cell type of interest in a cluster identified is equal to or greater than said predetermined threshold percentage, the cluster contains a nucleic acid that is a marker for the cell type of interest.</abstract><edition>7</edition><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | eng ; fre ; ger |
recordid | cdi_epo_espacenet_EP1446757A2 |
source | esp@cenet |
subjects | BEER BIOCHEMISTRY CALCULATING CHEMISTRY COMPOSITIONS OR TEST PAPERS THEREFOR COMPUTING CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL ORENZYMOLOGICAL PROCESSES COUNTING ELECTRIC DIGITAL DATA PROCESSING ENZYMOLOGY INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTEDFOR SPECIFIC APPLICATION FIELDS INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIRCHEMICAL OR PHYSICAL PROPERTIES MEASURING MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEICACIDS OR MICROORGANISMS METALLURGY MICROBIOLOGY MUTATION OR GENETIC ENGINEERING PHYSICS PROCESSES OF PREPARING SUCH COMPOSITIONS SPIRITS TESTING VINEGAR WINE |
title | IN SILICO SCREENING FOR PHENOTYPE-ASSOCIATED EXPRESSED SEQUENCES |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T18%3A27%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=KOZLOV,%20ANDREY%20PETROVICH&rft.date=2004-08-18&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP1446757A2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |