Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies

The benefits and capabilities of pre-trained language models (LLMs) in current and future innovations are vital to any society. However, introducing and using LLMs comes with biases and discrimination, resulting in concerns about equality, diversity and fairness, and must be addressed. While underst...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-12
Hauptverfasser: Yogarajan, Vithya, Dobbie, Gillian, Te Taka Keegan, Neuwirth, Rostam J
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Yogarajan, Vithya
Dobbie, Gillian
Te Taka Keegan
Neuwirth, Rostam J
description The benefits and capabilities of pre-trained language models (LLMs) in current and future innovations are vital to any society. However, introducing and using LLMs comes with biases and discrimination, resulting in concerns about equality, diversity and fairness, and must be addressed. While understanding and acknowledging bias in LLMs and developing mitigation strategies are crucial, the generalised assumptions towards societal needs can result in disadvantages towards under-represented societies and indigenous populations. Furthermore, the ongoing changes to actual and proposed amendments to regulations and laws worldwide also impact research capabilities in tackling the bias problem. This research presents a comprehensive survey synthesising the current trends and limitations in techniques used for identifying and mitigating bias in LLMs, where the overview of methods for tackling bias are grouped into metrics, benchmark datasets, and mitigation strategies. The importance and novelty of this survey are that it explores the perspective of under-represented societies. We argue that current practices tackling the bias problem cannot simply be 'plugged in' to address the needs of under-represented societies. We use examples from New Zealand to present requirements for adopting existing techniques to under-represented societies.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2898163347</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2898163347</sourcerecordid><originalsourceid>FETCH-proquest_journals_28981633473</originalsourceid><addsrcrecordid>eNqNi80KgkAURocgSMp3uNBa0Bn_apkULQqCbBfI4NxkTEa7V98_Fz1Am_MtzncWwpNKRUEeS7kSPnMbhqFMM5kkyhPPUtfvzroGDlYzWAc3wmAkbR0auGjXTLpBuPYGO95DMRGhG6GcaRi0M_BwBikgHAh5VnN172uLo0XeiOVLd4z-b9diezqWxTkYqP9MyGPV9hO5WVUy3-VRqlScqf9eX2X4QwE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2898163347</pqid></control><display><type>article</type><title>Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies</title><source>Free E- Journals</source><creator>Yogarajan, Vithya ; Dobbie, Gillian ; Te Taka Keegan ; Neuwirth, Rostam J</creator><creatorcontrib>Yogarajan, Vithya ; Dobbie, Gillian ; Te Taka Keegan ; Neuwirth, Rostam J</creatorcontrib><description>The benefits and capabilities of pre-trained language models (LLMs) in current and future innovations are vital to any society. However, introducing and using LLMs comes with biases and discrimination, resulting in concerns about equality, diversity and fairness, and must be addressed. While understanding and acknowledging bias in LLMs and developing mitigation strategies are crucial, the generalised assumptions towards societal needs can result in disadvantages towards under-represented societies and indigenous populations. Furthermore, the ongoing changes to actual and proposed amendments to regulations and laws worldwide also impact research capabilities in tackling the bias problem. This research presents a comprehensive survey synthesising the current trends and limitations in techniques used for identifying and mitigating bias in LLMs, where the overview of methods for tackling bias are grouped into metrics, benchmark datasets, and mitigation strategies. The importance and novelty of this survey are that it explores the perspective of under-represented societies. We argue that current practices tackling the bias problem cannot simply be 'plugged in' to address the needs of under-represented societies. We use examples from New Zealand to present requirements for adopting existing techniques to under-represented societies.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Bias ; Trends</subject><ispartof>arXiv.org, 2023-12</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Yogarajan, Vithya</creatorcontrib><creatorcontrib>Dobbie, Gillian</creatorcontrib><creatorcontrib>Te Taka Keegan</creatorcontrib><creatorcontrib>Neuwirth, Rostam J</creatorcontrib><title>Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies</title><title>arXiv.org</title><description>The benefits and capabilities of pre-trained language models (LLMs) in current and future innovations are vital to any society. However, introducing and using LLMs comes with biases and discrimination, resulting in concerns about equality, diversity and fairness, and must be addressed. While understanding and acknowledging bias in LLMs and developing mitigation strategies are crucial, the generalised assumptions towards societal needs can result in disadvantages towards under-represented societies and indigenous populations. Furthermore, the ongoing changes to actual and proposed amendments to regulations and laws worldwide also impact research capabilities in tackling the bias problem. This research presents a comprehensive survey synthesising the current trends and limitations in techniques used for identifying and mitigating bias in LLMs, where the overview of methods for tackling bias are grouped into metrics, benchmark datasets, and mitigation strategies. The importance and novelty of this survey are that it explores the perspective of under-represented societies. We argue that current practices tackling the bias problem cannot simply be 'plugged in' to address the needs of under-represented societies. We use examples from New Zealand to present requirements for adopting existing techniques to under-represented societies.</description><subject>Bias</subject><subject>Trends</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNi80KgkAURocgSMp3uNBa0Bn_apkULQqCbBfI4NxkTEa7V98_Fz1Am_MtzncWwpNKRUEeS7kSPnMbhqFMM5kkyhPPUtfvzroGDlYzWAc3wmAkbR0auGjXTLpBuPYGO95DMRGhG6GcaRi0M_BwBikgHAh5VnN172uLo0XeiOVLd4z-b9diezqWxTkYqP9MyGPV9hO5WVUy3-VRqlScqf9eX2X4QwE</recordid><startdate>20231203</startdate><enddate>20231203</enddate><creator>Yogarajan, Vithya</creator><creator>Dobbie, Gillian</creator><creator>Te Taka Keegan</creator><creator>Neuwirth, Rostam J</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20231203</creationdate><title>Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies</title><author>Yogarajan, Vithya ; Dobbie, Gillian ; Te Taka Keegan ; Neuwirth, Rostam J</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_28981633473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Bias</topic><topic>Trends</topic><toplevel>online_resources</toplevel><creatorcontrib>Yogarajan, Vithya</creatorcontrib><creatorcontrib>Dobbie, Gillian</creatorcontrib><creatorcontrib>Te Taka Keegan</creatorcontrib><creatorcontrib>Neuwirth, Rostam J</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yogarajan, Vithya</au><au>Dobbie, Gillian</au><au>Te Taka Keegan</au><au>Neuwirth, Rostam J</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies</atitle><jtitle>arXiv.org</jtitle><date>2023-12-03</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>The benefits and capabilities of pre-trained language models (LLMs) in current and future innovations are vital to any society. However, introducing and using LLMs comes with biases and discrimination, resulting in concerns about equality, diversity and fairness, and must be addressed. While understanding and acknowledging bias in LLMs and developing mitigation strategies are crucial, the generalised assumptions towards societal needs can result in disadvantages towards under-represented societies and indigenous populations. Furthermore, the ongoing changes to actual and proposed amendments to regulations and laws worldwide also impact research capabilities in tackling the bias problem. This research presents a comprehensive survey synthesising the current trends and limitations in techniques used for identifying and mitigating bias in LLMs, where the overview of methods for tackling bias are grouped into metrics, benchmark datasets, and mitigation strategies. The importance and novelty of this survey are that it explores the perspective of under-represented societies. We argue that current practices tackling the bias problem cannot simply be 'plugged in' to address the needs of under-represented societies. We use examples from New Zealand to present requirements for adopting existing techniques to under-represented societies.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-12
issn 2331-8422
language eng
recordid cdi_proquest_journals_2898163347
source Free E- Journals
subjects Bias
Trends
title Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T18%3A40%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Tackling%20Bias%20in%20Pre-trained%20Language%20Models:%20Current%20Trends%20and%20Under-represented%20Societies&rft.jtitle=arXiv.org&rft.au=Yogarajan,%20Vithya&rft.date=2023-12-03&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2898163347%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2898163347&rft_id=info:pmid/&rfr_iscdi=true