A new twist on a very old binary similarity coefficient
Pairwise similarity coefficients are downward biased when samples only record presences and sampling is partial. A simple but forgotten index proposed by Stephen Forbes in 1907 can help solve this problem. His original equation requires knowing the number of species absent in both samples that could...
Gespeichert in:
Veröffentlicht in: | Ecology (Durham) 2015-02, Vol.96 (2), p.575-586 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 586 |
---|---|
container_issue | 2 |
container_start_page | 575 |
container_title | Ecology (Durham) |
container_volume | 96 |
creator | Alroy, John |
description | Pairwise similarity coefficients are downward biased when samples only record presences and sampling is partial. A simple but forgotten index proposed by Stephen Forbes in 1907 can help solve this problem. His original equation requires knowing the number of species absent in both samples that could have been present. It is proposed that this count should simply be ignored and that the coefficient should be adjusted using a simple heuristic correction. Four analyses show that the corrected equation outperforms the Dice and Simpson indices, which are highly correlated with many others. In two-sample simulations, similarity is almost always closer to the assumed value when the species pool size and sampling intensity are varied, regardless of whether the underlying abundance distribution is uniform, log-normal, or geometric. The index is also much more robust when sampling is unequal. An analysis of bat samples from peninsular Malaysia buttresses these conclusions. The corrected coefficient also indicates that local assemblages of North American mammals are random subsamples of larger species pools by returning similarity of values of around 1, and it suggests a more consistent relationship between biome-scale comparisons and local-scale comparisons. Finally, it yields a better-dispersed pattern when the biome-scale inventories are ordinated. If these results are generalizable, then the new and old equation should see wide application, potentially taking the place of the two most commonly used alternatives (the interrelated Dice and Jaccard indices) whenever sampling is incomplete. |
doi_str_mv | 10.1890/14-0471.1 |
format | Article |
fullrecord | <record><control><sourceid>jstor_fao_a</sourceid><recordid>TN_cdi_fao_agris_US201600196974</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>43495096</jstor_id><sourcerecordid>43495096</sourcerecordid><originalsourceid>FETCH-LOGICAL-a5595-7a085baeb5c126efecc10e3d46ecb5a075cdbdf7205e73452bb5a9e7070cd8cc3</originalsourceid><addsrcrecordid>eNqFkU9v1DAQxS0EokvhwAcAInGBQ8rY8Z_4WK3aUqkSB-iBk-U4E-RVNl5sb5f99niVskhVEb7YmvnN07xnQl5TOKOthk-U18AVPaNPyILqRteaKnhKFgCU1VqK9oS8SGkF5VDePicnTDIOrVILos6rCXdV3vmUqzBVtrrDuK_C2Fedn2x5Jr_2o40-7ysXcBi88zjll-TZYMeEr-7vU3J7efFt-bm--XJ1vTy_qa0QWtTKQis6i51wlEkc0DkK2PRcouuEBSVc3_WDYiBQNVywrlQ1KlDg-ta55pR8mHU3MfzcYspm7ZPDcbQThm0yxWiJgCsm_o9K1TRaUsEK-v4BugrbOBUjhZLANdftgfo4Uy6GlCIOZhP9umRiKJhD8IZycwje0MK-vVfcdmvsj-SfpAvAZ2DnR9z_W8lcLL8zoEJLJtTB1Jt5bJVyiMcx3nAtQMvSfzf3BxuM_RF9Mrdfy7wsX62lVvyvV5v3mzAZTPbR_R-hjjtt-sHkX7n5DVAOs7w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1660494982</pqid></control><display><type>article</type><title>A new twist on a very old binary similarity coefficient</title><source>Jstor Complete Legacy</source><source>Wiley Online Library - AutoHoldings Journals</source><source>MEDLINE</source><creator>Alroy, John</creator><creatorcontrib>Alroy, John</creatorcontrib><description>Pairwise similarity coefficients are downward biased when samples only record presences and sampling is partial. A simple but forgotten index proposed by Stephen Forbes in 1907 can help solve this problem. His original equation requires knowing the number of species absent in both samples that could have been present. It is proposed that this count should simply be ignored and that the coefficient should be adjusted using a simple heuristic correction. Four analyses show that the corrected equation outperforms the Dice and Simpson indices, which are highly correlated with many others. In two-sample simulations, similarity is almost always closer to the assumed value when the species pool size and sampling intensity are varied, regardless of whether the underlying abundance distribution is uniform, log-normal, or geometric. The index is also much more robust when sampling is unequal. An analysis of bat samples from peninsular Malaysia buttresses these conclusions. The corrected coefficient also indicates that local assemblages of North American mammals are random subsamples of larger species pools by returning similarity of values of around 1, and it suggests a more consistent relationship between biome-scale comparisons and local-scale comparisons. Finally, it yields a better-dispersed pattern when the biome-scale inventories are ordinated. If these results are generalizable, then the new and old equation should see wide application, potentially taking the place of the two most commonly used alternatives (the interrelated Dice and Jaccard indices) whenever sampling is incomplete.</description><identifier>ISSN: 0012-9658</identifier><identifier>EISSN: 1939-9170</identifier><identifier>DOI: 10.1890/14-0471.1</identifier><identifier>PMID: 26240877</identifier><identifier>CODEN: ECGYAQ</identifier><language>eng</language><publisher>United States: Ecological Society of America</publisher><subject>abundance distributions ; Animal behavior ; Animal Distribution ; Animals ; Bats ; Biodiversity ; biogeography ; Biomes ; Chiroptera ; Coefficients ; community ecology ; Correlation analysis ; Ecology ; Ecosystems ; equations ; Forbes index ; inventories ; Mammals ; Mammals - physiology ; Marine ecology ; Models, Biological ; North America ; North American mammals ; principal coordinates analysis ; Sampling bias ; similarity coefficients ; Similarity theorem ; Simulation ; Species ; Synecology</subject><ispartof>Ecology (Durham), 2015-02, Vol.96 (2), p.575-586</ispartof><rights>Copyright © 2015 Ecological Society of America</rights><rights>2015 by the Ecological Society of America</rights><rights>Copyright Ecological Society of America Feb 2015</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a5595-7a085baeb5c126efecc10e3d46ecb5a075cdbdf7205e73452bb5a9e7070cd8cc3</citedby><cites>FETCH-LOGICAL-a5595-7a085baeb5c126efecc10e3d46ecb5a075cdbdf7205e73452bb5a9e7070cd8cc3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/43495096$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/43495096$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>314,776,780,799,1411,27901,27902,45550,45551,57992,58225</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26240877$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Alroy, John</creatorcontrib><title>A new twist on a very old binary similarity coefficient</title><title>Ecology (Durham)</title><addtitle>Ecology</addtitle><description>Pairwise similarity coefficients are downward biased when samples only record presences and sampling is partial. A simple but forgotten index proposed by Stephen Forbes in 1907 can help solve this problem. His original equation requires knowing the number of species absent in both samples that could have been present. It is proposed that this count should simply be ignored and that the coefficient should be adjusted using a simple heuristic correction. Four analyses show that the corrected equation outperforms the Dice and Simpson indices, which are highly correlated with many others. In two-sample simulations, similarity is almost always closer to the assumed value when the species pool size and sampling intensity are varied, regardless of whether the underlying abundance distribution is uniform, log-normal, or geometric. The index is also much more robust when sampling is unequal. An analysis of bat samples from peninsular Malaysia buttresses these conclusions. The corrected coefficient also indicates that local assemblages of North American mammals are random subsamples of larger species pools by returning similarity of values of around 1, and it suggests a more consistent relationship between biome-scale comparisons and local-scale comparisons. Finally, it yields a better-dispersed pattern when the biome-scale inventories are ordinated. If these results are generalizable, then the new and old equation should see wide application, potentially taking the place of the two most commonly used alternatives (the interrelated Dice and Jaccard indices) whenever sampling is incomplete.</description><subject>abundance distributions</subject><subject>Animal behavior</subject><subject>Animal Distribution</subject><subject>Animals</subject><subject>Bats</subject><subject>Biodiversity</subject><subject>biogeography</subject><subject>Biomes</subject><subject>Chiroptera</subject><subject>Coefficients</subject><subject>community ecology</subject><subject>Correlation analysis</subject><subject>Ecology</subject><subject>Ecosystems</subject><subject>equations</subject><subject>Forbes index</subject><subject>inventories</subject><subject>Mammals</subject><subject>Mammals - physiology</subject><subject>Marine ecology</subject><subject>Models, Biological</subject><subject>North America</subject><subject>North American mammals</subject><subject>principal coordinates analysis</subject><subject>Sampling bias</subject><subject>similarity coefficients</subject><subject>Similarity theorem</subject><subject>Simulation</subject><subject>Species</subject><subject>Synecology</subject><issn>0012-9658</issn><issn>1939-9170</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkU9v1DAQxS0EokvhwAcAInGBQ8rY8Z_4WK3aUqkSB-iBk-U4E-RVNl5sb5f99niVskhVEb7YmvnN07xnQl5TOKOthk-U18AVPaNPyILqRteaKnhKFgCU1VqK9oS8SGkF5VDePicnTDIOrVILos6rCXdV3vmUqzBVtrrDuK_C2Fedn2x5Jr_2o40-7ysXcBi88zjll-TZYMeEr-7vU3J7efFt-bm--XJ1vTy_qa0QWtTKQis6i51wlEkc0DkK2PRcouuEBSVc3_WDYiBQNVywrlQ1KlDg-ta55pR8mHU3MfzcYspm7ZPDcbQThm0yxWiJgCsm_o9K1TRaUsEK-v4BugrbOBUjhZLANdftgfo4Uy6GlCIOZhP9umRiKJhD8IZycwje0MK-vVfcdmvsj-SfpAvAZ2DnR9z_W8lcLL8zoEJLJtTB1Jt5bJVyiMcx3nAtQMvSfzf3BxuM_RF9Mrdfy7wsX62lVvyvV5v3mzAZTPbR_R-hjjtt-sHkX7n5DVAOs7w</recordid><startdate>201502</startdate><enddate>201502</enddate><creator>Alroy, John</creator><general>Ecological Society of America</general><scope>FBQ</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7SN</scope><scope>7SS</scope><scope>7ST</scope><scope>7T7</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>K9.</scope><scope>P64</scope><scope>RC3</scope><scope>SOI</scope><scope>7TG</scope><scope>KL.</scope><scope>7X8</scope></search><sort><creationdate>201502</creationdate><title>A new twist on a very old binary similarity coefficient</title><author>Alroy, John</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a5595-7a085baeb5c126efecc10e3d46ecb5a075cdbdf7205e73452bb5a9e7070cd8cc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>abundance distributions</topic><topic>Animal behavior</topic><topic>Animal Distribution</topic><topic>Animals</topic><topic>Bats</topic><topic>Biodiversity</topic><topic>biogeography</topic><topic>Biomes</topic><topic>Chiroptera</topic><topic>Coefficients</topic><topic>community ecology</topic><topic>Correlation analysis</topic><topic>Ecology</topic><topic>Ecosystems</topic><topic>equations</topic><topic>Forbes index</topic><topic>inventories</topic><topic>Mammals</topic><topic>Mammals - physiology</topic><topic>Marine ecology</topic><topic>Models, Biological</topic><topic>North America</topic><topic>North American mammals</topic><topic>principal coordinates analysis</topic><topic>Sampling bias</topic><topic>similarity coefficients</topic><topic>Similarity theorem</topic><topic>Simulation</topic><topic>Species</topic><topic>Synecology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Alroy, John</creatorcontrib><collection>AGRIS</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Environment Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>Environment Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>MEDLINE - Academic</collection><jtitle>Ecology (Durham)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Alroy, John</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A new twist on a very old binary similarity coefficient</atitle><jtitle>Ecology (Durham)</jtitle><addtitle>Ecology</addtitle><date>2015-02</date><risdate>2015</risdate><volume>96</volume><issue>2</issue><spage>575</spage><epage>586</epage><pages>575-586</pages><issn>0012-9658</issn><eissn>1939-9170</eissn><coden>ECGYAQ</coden><abstract>Pairwise similarity coefficients are downward biased when samples only record presences and sampling is partial. A simple but forgotten index proposed by Stephen Forbes in 1907 can help solve this problem. His original equation requires knowing the number of species absent in both samples that could have been present. It is proposed that this count should simply be ignored and that the coefficient should be adjusted using a simple heuristic correction. Four analyses show that the corrected equation outperforms the Dice and Simpson indices, which are highly correlated with many others. In two-sample simulations, similarity is almost always closer to the assumed value when the species pool size and sampling intensity are varied, regardless of whether the underlying abundance distribution is uniform, log-normal, or geometric. The index is also much more robust when sampling is unequal. An analysis of bat samples from peninsular Malaysia buttresses these conclusions. The corrected coefficient also indicates that local assemblages of North American mammals are random subsamples of larger species pools by returning similarity of values of around 1, and it suggests a more consistent relationship between biome-scale comparisons and local-scale comparisons. Finally, it yields a better-dispersed pattern when the biome-scale inventories are ordinated. If these results are generalizable, then the new and old equation should see wide application, potentially taking the place of the two most commonly used alternatives (the interrelated Dice and Jaccard indices) whenever sampling is incomplete.</abstract><cop>United States</cop><pub>Ecological Society of America</pub><pmid>26240877</pmid><doi>10.1890/14-0471.1</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0012-9658 |
ispartof | Ecology (Durham), 2015-02, Vol.96 (2), p.575-586 |
issn | 0012-9658 1939-9170 |
language | eng |
recordid | cdi_fao_agris_US201600196974 |
source | Jstor Complete Legacy; Wiley Online Library - AutoHoldings Journals; MEDLINE |
subjects | abundance distributions Animal behavior Animal Distribution Animals Bats Biodiversity biogeography Biomes Chiroptera Coefficients community ecology Correlation analysis Ecology Ecosystems equations Forbes index inventories Mammals Mammals - physiology Marine ecology Models, Biological North America North American mammals principal coordinates analysis Sampling bias similarity coefficients Similarity theorem Simulation Species Synecology |
title | A new twist on a very old binary similarity coefficient |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T02%3A19%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_fao_a&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20new%20twist%20on%20a%20very%20old%20binary%20similarity%20coefficient&rft.jtitle=Ecology%20(Durham)&rft.au=Alroy,%20John&rft.date=2015-02&rft.volume=96&rft.issue=2&rft.spage=575&rft.epage=586&rft.pages=575-586&rft.issn=0012-9658&rft.eissn=1939-9170&rft.coden=ECGYAQ&rft_id=info:doi/10.1890/14-0471.1&rft_dat=%3Cjstor_fao_a%3E43495096%3C/jstor_fao_a%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1660494982&rft_id=info:pmid/26240877&rft_jstor_id=43495096&rfr_iscdi=true |