Who Writes the Review, Human or AI?
With the increasing use of Artificial Intelligence in Natural Language Processing, concerns have been raised regarding the detection of AI-generated text in various domains. This study aims to investigate this issue by proposing a methodology to accurately distinguish AI-generated and human-written...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Theocharopoulos, Panagiotis C Georgakopoulos, Spiros V Tasoulis, Sotiris K Plagianakos, Vassilis P |
description | With the increasing use of Artificial Intelligence in Natural Language
Processing, concerns have been raised regarding the detection of AI-generated
text in various domains. This study aims to investigate this issue by proposing
a methodology to accurately distinguish AI-generated and human-written book
reviews. Our approach utilizes transfer learning, enabling the model to
identify generated text across different topics while improving its ability to
detect variations in writing style and vocabulary. To evaluate the
effectiveness of the proposed methodology, we developed a dataset consisting of
real book reviews and AI-generated reviews using the recently proposed Vicuna
open-source language model. The experimental results demonstrate that it is
feasible to detect the original source of text, achieving an accuracy rate of
96.86%. Our efforts are oriented toward the exploration of the capabilities and
limitations of Large Language Models in the context of text identification.
Expanding our knowledge in these aspects will be valuable for effectively
navigating similar models in the future and ensuring the integrity and
authenticity of human-generated content. |
doi_str_mv | 10.48550/arxiv.2405.20285 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2405_20285</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2405_20285</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-1e545f097d3c7a8569890be1e3927d34d577fda9d01d1917391115e911df22dc3</originalsourceid><addsrcrecordid>eNotzjsLwjAUBeAsDqL-ACcDrrbmJr2mmUSKj0JBEMGxRHOLBV_EWvXf-1zOgTMcPsa6IMIoRhRD6x9lHcpIYCiFjLHJ-pv9mW98WdGVV3viK6pLug_44na0J372fJKO26xR2MOVOv9usfVsuk4WQbacp8kkC-xIYwCEERbCaKd22sY4MrERWwJSRr63yKHWhbPGCXBgQCsDAEjvdIWUbqdarPe7_Srziy-P1j_zjzb_atULdis25A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Who Writes the Review, Human or AI?</title><source>arXiv.org</source><creator>Theocharopoulos, Panagiotis C ; Georgakopoulos, Spiros V ; Tasoulis, Sotiris K ; Plagianakos, Vassilis P</creator><creatorcontrib>Theocharopoulos, Panagiotis C ; Georgakopoulos, Spiros V ; Tasoulis, Sotiris K ; Plagianakos, Vassilis P</creatorcontrib><description>With the increasing use of Artificial Intelligence in Natural Language
Processing, concerns have been raised regarding the detection of AI-generated
text in various domains. This study aims to investigate this issue by proposing
a methodology to accurately distinguish AI-generated and human-written book
reviews. Our approach utilizes transfer learning, enabling the model to
identify generated text across different topics while improving its ability to
detect variations in writing style and vocabulary. To evaluate the
effectiveness of the proposed methodology, we developed a dataset consisting of
real book reviews and AI-generated reviews using the recently proposed Vicuna
open-source language model. The experimental results demonstrate that it is
feasible to detect the original source of text, achieving an accuracy rate of
96.86%. Our efforts are oriented toward the exploration of the capabilities and
limitations of Large Language Models in the context of text identification.
Expanding our knowledge in these aspects will be valuable for effectively
navigating similar models in the future and ensuring the integrity and
authenticity of human-generated content.</description><identifier>DOI: 10.48550/arxiv.2405.20285</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2024-05</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,886</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2405.20285$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2405.20285$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Theocharopoulos, Panagiotis C</creatorcontrib><creatorcontrib>Georgakopoulos, Spiros V</creatorcontrib><creatorcontrib>Tasoulis, Sotiris K</creatorcontrib><creatorcontrib>Plagianakos, Vassilis P</creatorcontrib><title>Who Writes the Review, Human or AI?</title><description>With the increasing use of Artificial Intelligence in Natural Language
Processing, concerns have been raised regarding the detection of AI-generated
text in various domains. This study aims to investigate this issue by proposing
a methodology to accurately distinguish AI-generated and human-written book
reviews. Our approach utilizes transfer learning, enabling the model to
identify generated text across different topics while improving its ability to
detect variations in writing style and vocabulary. To evaluate the
effectiveness of the proposed methodology, we developed a dataset consisting of
real book reviews and AI-generated reviews using the recently proposed Vicuna
open-source language model. The experimental results demonstrate that it is
feasible to detect the original source of text, achieving an accuracy rate of
96.86%. Our efforts are oriented toward the exploration of the capabilities and
limitations of Large Language Models in the context of text identification.
Expanding our knowledge in these aspects will be valuable for effectively
navigating similar models in the future and ensuring the integrity and
authenticity of human-generated content.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzjsLwjAUBeAsDqL-ACcDrrbmJr2mmUSKj0JBEMGxRHOLBV_EWvXf-1zOgTMcPsa6IMIoRhRD6x9lHcpIYCiFjLHJ-pv9mW98WdGVV3viK6pLug_44na0J372fJKO26xR2MOVOv9usfVsuk4WQbacp8kkC-xIYwCEERbCaKd22sY4MrERWwJSRr63yKHWhbPGCXBgQCsDAEjvdIWUbqdarPe7_Srziy-P1j_zjzb_atULdis25A</recordid><startdate>20240530</startdate><enddate>20240530</enddate><creator>Theocharopoulos, Panagiotis C</creator><creator>Georgakopoulos, Spiros V</creator><creator>Tasoulis, Sotiris K</creator><creator>Plagianakos, Vassilis P</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240530</creationdate><title>Who Writes the Review, Human or AI?</title><author>Theocharopoulos, Panagiotis C ; Georgakopoulos, Spiros V ; Tasoulis, Sotiris K ; Plagianakos, Vassilis P</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-1e545f097d3c7a8569890be1e3927d34d577fda9d01d1917391115e911df22dc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Theocharopoulos, Panagiotis C</creatorcontrib><creatorcontrib>Georgakopoulos, Spiros V</creatorcontrib><creatorcontrib>Tasoulis, Sotiris K</creatorcontrib><creatorcontrib>Plagianakos, Vassilis P</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Theocharopoulos, Panagiotis C</au><au>Georgakopoulos, Spiros V</au><au>Tasoulis, Sotiris K</au><au>Plagianakos, Vassilis P</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Who Writes the Review, Human or AI?</atitle><date>2024-05-30</date><risdate>2024</risdate><abstract>With the increasing use of Artificial Intelligence in Natural Language
Processing, concerns have been raised regarding the detection of AI-generated
text in various domains. This study aims to investigate this issue by proposing
a methodology to accurately distinguish AI-generated and human-written book
reviews. Our approach utilizes transfer learning, enabling the model to
identify generated text across different topics while improving its ability to
detect variations in writing style and vocabulary. To evaluate the
effectiveness of the proposed methodology, we developed a dataset consisting of
real book reviews and AI-generated reviews using the recently proposed Vicuna
open-source language model. The experimental results demonstrate that it is
feasible to detect the original source of text, achieving an accuracy rate of
96.86%. Our efforts are oriented toward the exploration of the capabilities and
limitations of Large Language Models in the context of text identification.
Expanding our knowledge in these aspects will be valuable for effectively
navigating similar models in the future and ensuring the integrity and
authenticity of human-generated content.</abstract><doi>10.48550/arxiv.2405.20285</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2405.20285 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2405_20285 |
source | arXiv.org |
subjects | Computer Science - Computation and Language |
title | Who Writes the Review, Human or AI? |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T14%3A55%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Who%20Writes%20the%20Review,%20Human%20or%20AI?&rft.au=Theocharopoulos,%20Panagiotis%20C&rft.date=2024-05-30&rft_id=info:doi/10.48550/arxiv.2405.20285&rft_dat=%3Carxiv_GOX%3E2405_20285%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |