Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles

The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Analytical chemistry (Washington) 2024-07, Vol.96 (28), p.11216-11225
1. Verfasser: Roussis, Stilianos G.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 11225
container_issue 28
container_start_page 11216
container_title Analytical chemistry (Washington)
container_volume 96
creator Roussis, Stilianos G.
description The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds.
doi_str_mv 10.1021/acs.analchem.4c00621
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3074138430</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3074138430</sourcerecordid><originalsourceid>FETCH-LOGICAL-a288t-a751a8b327c50c5e37317f861ad8e91f497e64110855d350e9257473e06423793</originalsourceid><addsrcrecordid>eNqFkc1OGzEURq2qFaSBN6gqS910M-H6b-xZ0rQpSKBWqKjLkfHcAcOMndoZVbx9HSWwYEFXlq7O99nXh5APDBYMODuxLi9ssIO7w3EhHUDN2RsyY4pDVRvD35IZAIiKa4BD8j7newDGgNUH5FCYRjZK8xl5XMU0ToPNNPb0zN_e0cvf9Do8hPg30GUc13EKXaZ9iiM9dW5KdoP00uZMv_q-x4TBYaY2dPTKhgcfbrc9XzBv6LIMfVfwfXrlU5n-TD44vx4wH5F3vR0yHu_PObleffu1PKsufnw_X55eVJYbs6msVsyaG8G1U-AUCi2Y7k3NbGewYb1sNNayLGaU6oQCbLjSUguEWnKhGzEnn3e96xT_TOVl7eizw2GwAeOUW8GUqHX5Jf1_FLRkwkgBBf30Ar2PUyo6tpSRUjNT0DmRO8qlmHPCvl0nP9r02DJotxbbYrF9stjuLZbYx335dDNi9xx60lYA2AHb-PPFr3b-AxvhqYE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3084471841</pqid></control><display><type>article</type><title>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</title><source>ACS Publications</source><creator>Roussis, Stilianos G.</creator><creatorcontrib>Roussis, Stilianos G.</creatorcontrib><description>The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds.</description><identifier>ISSN: 0003-2700</identifier><identifier>ISSN: 1520-6882</identifier><identifier>EISSN: 1520-6882</identifier><identifier>DOI: 10.1021/acs.analchem.4c00621</identifier><identifier>PMID: 38949572</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>analytical chemistry ; Atomic properties ; Chemical compounds ; First principles ; Impurities ; Mass spectrometry ; Mass spectroscopy ; Molecular weight ; Oligonucleotides ; Ranking ; Subtraction</subject><ispartof>Analytical chemistry (Washington), 2024-07, Vol.96 (28), p.11216-11225</ispartof><rights>2024 American Chemical Society</rights><rights>Copyright American Chemical Society Jul 16, 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a288t-a751a8b327c50c5e37317f861ad8e91f497e64110855d350e9257473e06423793</cites><orcidid>0000-0002-3704-6403</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/acs.analchem.4c00621$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/acs.analchem.4c00621$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,777,781,2752,27057,27905,27906,56719,56769</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38949572$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Roussis, Stilianos G.</creatorcontrib><title>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</title><title>Analytical chemistry (Washington)</title><addtitle>Anal. Chem</addtitle><description>The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds.</description><subject>analytical chemistry</subject><subject>Atomic properties</subject><subject>Chemical compounds</subject><subject>First principles</subject><subject>Impurities</subject><subject>Mass spectrometry</subject><subject>Mass spectroscopy</subject><subject>Molecular weight</subject><subject>Oligonucleotides</subject><subject>Ranking</subject><subject>Subtraction</subject><issn>0003-2700</issn><issn>1520-6882</issn><issn>1520-6882</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNqFkc1OGzEURq2qFaSBN6gqS910M-H6b-xZ0rQpSKBWqKjLkfHcAcOMndoZVbx9HSWwYEFXlq7O99nXh5APDBYMODuxLi9ssIO7w3EhHUDN2RsyY4pDVRvD35IZAIiKa4BD8j7newDGgNUH5FCYRjZK8xl5XMU0ToPNNPb0zN_e0cvf9Do8hPg30GUc13EKXaZ9iiM9dW5KdoP00uZMv_q-x4TBYaY2dPTKhgcfbrc9XzBv6LIMfVfwfXrlU5n-TD44vx4wH5F3vR0yHu_PObleffu1PKsufnw_X55eVJYbs6msVsyaG8G1U-AUCi2Y7k3NbGewYb1sNNayLGaU6oQCbLjSUguEWnKhGzEnn3e96xT_TOVl7eizw2GwAeOUW8GUqHX5Jf1_FLRkwkgBBf30Ar2PUyo6tpSRUjNT0DmRO8qlmHPCvl0nP9r02DJotxbbYrF9stjuLZbYx335dDNi9xx60lYA2AHb-PPFr3b-AxvhqYE</recordid><startdate>20240716</startdate><enddate>20240716</enddate><creator>Roussis, Stilianos G.</creator><general>American Chemical Society</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7U5</scope><scope>7U7</scope><scope>7U9</scope><scope>8BQ</scope><scope>8FD</scope><scope>C1K</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>7X8</scope><scope>7S9</scope><scope>L.6</scope><orcidid>https://orcid.org/0000-0002-3704-6403</orcidid></search><sort><creationdate>20240716</creationdate><title>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</title><author>Roussis, Stilianos G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a288t-a751a8b327c50c5e37317f861ad8e91f497e64110855d350e9257473e06423793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>analytical chemistry</topic><topic>Atomic properties</topic><topic>Chemical compounds</topic><topic>First principles</topic><topic>Impurities</topic><topic>Mass spectrometry</topic><topic>Mass spectroscopy</topic><topic>Molecular weight</topic><topic>Oligonucleotides</topic><topic>Ranking</topic><topic>Subtraction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Roussis, Stilianos G.</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><jtitle>Analytical chemistry (Washington)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Roussis, Stilianos G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles</atitle><jtitle>Analytical chemistry (Washington)</jtitle><addtitle>Anal. Chem</addtitle><date>2024-07-16</date><risdate>2024</risdate><volume>96</volume><issue>28</issue><spage>11216</spage><epage>11225</epage><pages>11216-11225</pages><issn>0003-2700</issn><issn>1520-6882</issn><eissn>1520-6882</eissn><abstract>The number of possible candidate formulas for high molecular weight unknown compounds (e.g., 7000–8000 Da for common 20-mer oligonucleotides) by high-resolution mass spectrometry is in the order of several hundred thousand even at the highest level of experimental accuracy. In demanding analytical applications involving new chemistries and synthetic routes where little is known about the chemical nature or mechanisms of formation of the unknown compounds (e.g., impurities), the generation of a short list of the most plausible formulas would be highly desirable. Such an approach has been developed in the current work. The concept of mass difference from a reference compound is introduced to simplify the approach and greatly reduce the number of possible formulas. The approach allows for the generation of candidate formulas by both the addition and subtraction of atoms to account for all possible molecular changes from the parent compound. A reduction of 3 orders of magnitude in the number of possible formulas has been achieved by the approach. Ranking of the formulas by the product of the sums of the absolute changes in the total number of all atoms and all heteroatoms in the proposed difference formula successfully ranked the correct formula within the top 10 from a list of 200–250 best candidate formulas. There is a tendency for the impurities to be formed involving the least change in the number of atoms and heteroatoms. Δf H o and Δf G′ o values can be used as a complementary ranking system of the top candidates. The approach is applicable to unknowns in any other systems of high MW compounds.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>38949572</pmid><doi>10.1021/acs.analchem.4c00621</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-3704-6403</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0003-2700
ispartof Analytical chemistry (Washington), 2024-07, Vol.96 (28), p.11216-11225
issn 0003-2700
1520-6882
1520-6882
language eng
recordid cdi_proquest_miscellaneous_3074138430
source ACS Publications
subjects analytical chemistry
Atomic properties
Chemical compounds
First principles
Impurities
Mass spectrometry
Mass spectroscopy
Molecular weight
Oligonucleotides
Ranking
Subtraction
title Formulas of High MW Unknown Compounds from Accurate Mass Differences and Ranking of Best Candidates from First Principles
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T05%3A37%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Formulas%20of%20High%20MW%20Unknown%20Compounds%20from%20Accurate%20Mass%20Differences%20and%20Ranking%20of%20Best%20Candidates%20from%20First%20Principles&rft.jtitle=Analytical%20chemistry%20(Washington)&rft.au=Roussis,%20Stilianos%20G.&rft.date=2024-07-16&rft.volume=96&rft.issue=28&rft.spage=11216&rft.epage=11225&rft.pages=11216-11225&rft.issn=0003-2700&rft.eissn=1520-6882&rft_id=info:doi/10.1021/acs.analchem.4c00621&rft_dat=%3Cproquest_cross%3E3074138430%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3084471841&rft_id=info:pmid/38949572&rfr_iscdi=true