Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles
The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical a...
Gespeichert in:
Veröffentlicht in: | Analytical chemistry (Washington) 2024-07, Vol.96 (28), p.11216-11225 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 11225 |
---|---|
container_issue | 28 |
container_start_page | 11216 |
container_title | Analytical chemistry (Washington) |
container_volume | 96 |
creator | Roussis, Stilianos G. |
description | The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds. |
doi_str_mv | 10.1021/acs.analchem.4c00621 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3074138430</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3074138430</sourcerecordid><originalsourceid>FETCH-LOGICAL-a288t-a751a8b327c50c5e37317f861ad8e91f497e64110855d350e9257473e06423793</originalsourceid><addsrcrecordid>eNqFkc1OGzEURq2qFaSBN6gqS910M-H6b-xZ0rQpSKBWqKjLkfHcAcOMndoZVbx9HSWwYEFXlq7O99nXh5APDBYMODuxLi9ssIO7w3EhHUDN2RsyY4pDVRvD35IZAIiKa4BD8j7newDGgNUH5FCYRjZK8xl5XMU0ToPNNPb0zN_e0cvf9Do8hPg30GUc13EKXaZ9iiM9dW5KdoP00uZMv_q-x4TBYaY2dPTKhgcfbrc9XzBv6LIMfVfwfXrlU5n-TD44vx4wH5F3vR0yHu_PObleffu1PKsufnw_X55eVJYbs6msVsyaG8G1U-AUCi2Y7k3NbGewYb1sNNayLGaU6oQCbLjSUguEWnKhGzEnn3e96xT_TOVl7eizw2GwAeOUW8GUqHX5Jf1_FLRkwkgBBf30Ar2PUyo6tpSRUjNT0DmRO8qlmHPCvl0nP9r02DJotxbbYrF9stjuLZbYx335dDNi9xx60lYA2AHb-PPFr3b-AxvhqYE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3084471841</pqid></control><display><type>article</type><title>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</title><source>ACS Publications</source><creator>Roussis, Stilianos G.</creator><creatorcontrib>Roussis, Stilianos G.</creatorcontrib><description>The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds.</description><identifier>ISSN: 0003-2700</identifier><identifier>ISSN: 1520-6882</identifier><identifier>EISSN: 1520-6882</identifier><identifier>DOI: 10.1021/acs.analchem.4c00621</identifier><identifier>PMID: 38949572</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>analytical chemistry ; Atomic properties ; Chemical compounds ; First principles ; Impurities ; Mass spectrometry ; Mass spectroscopy ; Molecular weight ; Oligonucleotides ; Ranking ; Subtraction</subject><ispartof>Analytical chemistry (Washington), 2024-07, Vol.96 (28), p.11216-11225</ispartof><rights>2024 American Chemical Society</rights><rights>Copyright American Chemical Society Jul 16, 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a288t-a751a8b327c50c5e37317f861ad8e91f497e64110855d350e9257473e06423793</cites><orcidid>0000-0002-3704-6403</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/acs.analchem.4c00621$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/acs.analchem.4c00621$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,777,781,2752,27057,27905,27906,56719,56769</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38949572$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Roussis, Stilianos G.</creatorcontrib><title>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</title><title>Analytical chemistry (Washington)</title><addtitle>Anal. Chem</addtitle><description>The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds.</description><subject>analytical chemistry</subject><subject>Atomic properties</subject><subject>Chemical compounds</subject><subject>First principles</subject><subject>Impurities</subject><subject>Mass spectrometry</subject><subject>Mass spectroscopy</subject><subject>Molecular weight</subject><subject>Oligonucleotides</subject><subject>Ranking</subject><subject>Subtraction</subject><issn>0003-2700</issn><issn>1520-6882</issn><issn>1520-6882</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNqFkc1OGzEURq2qFaSBN6gqS910M-H6b-xZ0rQpSKBWqKjLkfHcAcOMndoZVbx9HSWwYEFXlq7O99nXh5APDBYMODuxLi9ssIO7w3EhHUDN2RsyY4pDVRvD35IZAIiKa4BD8j7newDGgNUH5FCYRjZK8xl5XMU0ToPNNPb0zN_e0cvf9Do8hPg30GUc13EKXaZ9iiM9dW5KdoP00uZMv_q-x4TBYaY2dPTKhgcfbrc9XzBv6LIMfVfwfXrlU5n-TD44vx4wH5F3vR0yHu_PObleffu1PKsufnw_X55eVJYbs6msVsyaG8G1U-AUCi2Y7k3NbGewYb1sNNayLGaU6oQCbLjSUguEWnKhGzEnn3e96xT_TOVl7eizw2GwAeOUW8GUqHX5Jf1_FLRkwkgBBf30Ar2PUyo6tpSRUjNT0DmRO8qlmHPCvl0nP9r02DJotxbbYrF9stjuLZbYx335dDNi9xx60lYA2AHb-PPFr3b-AxvhqYE</recordid><startdate>20240716</startdate><enddate>20240716</enddate><creator>Roussis, Stilianos G.</creator><general>American Chemical Society</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7U5</scope><scope>7U7</scope><scope>7U9</scope><scope>8BQ</scope><scope>8FD</scope><scope>C1K</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>7X8</scope><scope>7S9</scope><scope>L.6</scope><orcidid>https://orcid.org/0000-0002-3704-6403</orcidid></search><sort><creationdate>20240716</creationdate><title>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</title><author>Roussis, Stilianos G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a288t-a751a8b327c50c5e37317f861ad8e91f497e64110855d350e9257473e06423793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>analytical chemistry</topic><topic>Atomic properties</topic><topic>Chemical compounds</topic><topic>First principles</topic><topic>Impurities</topic><topic>Mass spectrometry</topic><topic>Mass spectroscopy</topic><topic>Molecular weight</topic><topic>Oligonucleotides</topic><topic>Ranking</topic><topic>Subtraction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Roussis, Stilianos G.</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><jtitle>Analytical chemistry (Washington)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Roussis, Stilianos G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</atitle><jtitle>Analytical chemistry (Washington)</jtitle><addtitle>Anal. Chem</addtitle><date>2024-07-16</date><risdate>2024</risdate><volume>96</volume><issue>28</issue><spage>11216</spage><epage>11225</epage><pages>11216-11225</pages><issn>0003-2700</issn><issn>1520-6882</issn><eissn>1520-6882</eissn><abstract>The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>38949572</pmid><doi>10.1021/acs.analchem.4c00621</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-3704-6403</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0003-2700 |
ispartof | Analytical chemistry (Washington), 2024-07, Vol.96 (28), p.11216-11225 |
issn | 0003-2700 1520-6882 1520-6882 |
language | eng |
recordid | cdi_proquest_miscellaneous_3074138430 |
source | ACS Publications |
subjects | analytical chemistry Atomic properties Chemical compounds First principles Impurities Mass spectrometry Mass spectroscopy Molecular weight Oligonucleotides Ranking Subtraction |
title | Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T05%3A37%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Formulas%20of%20High%20MW%20Unknown%20Compounds%20from%20Accurate%20Mass%20Differences%20and%20Ranking%20of%20Best%20Candidates%20from%20First%20Principles&rft.jtitle=Analytical%20chemistry%20(Washington)&rft.au=Roussis,%20Stilianos%20G.&rft.date=2024-07-16&rft.volume=96&rft.issue=28&rft.spage=11216&rft.epage=11225&rft.pages=11216-11225&rft.issn=0003-2700&rft.eissn=1520-6882&rft_id=info:doi/10.1021/acs.analchem.4c00621&rft_dat=%3Cproquest_cross%3E3074138430%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3084471841&rft_id=info:pmid/38949572&rfr_iscdi=true |