Frame-Based Representation for Event Detection on Twitter

Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEICE Transactions on Information and Systems 2018/04/01, Vol.E101.D(4), pp.1180-1188
Hauptverfasser: QIN, Yanxia, ZHANG, Yue, ZHANG, Min, ZHENG, Dequan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1188
container_issue 4
container_start_page 1180
container_title IEICE Transactions on Information and Systems
container_volume E101.D
creator QIN, Yanxia
ZHANG, Yue
ZHANG, Min
ZHENG, Dequan
description Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs.
doi_str_mv 10.1587/transinf.2017EDP7311
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2022118531</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2022118531</sourcerecordid><originalsourceid>FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</originalsourceid><addsrcrecordid>eNpNUF1Lw0AQPETBWv0HPgR8jt7eVy6P2qYqFJRSn49LsqcpbVLvror_3mhtLSzs7DIzyw4hl0CvQersJnrbhqZ114xCVoyfMw5wRAaQCZkCV3BMBjQHlWrJ2Sk5C2FBKWgGckDyibcrTO9swDqZ4dpjwDba2HRt4jqfFB_9mIwxYvW762v-2cSI_pycOLsMePHXh-RlUsxHD-n06f5xdDtNK6mymObIcsvKHkvmOKWi1JaWgjNdW86sqpx2Uimlc5nVQBXUFgRqmesaa5GVfEiutr5r371vMESz6Da-7U8aRhkD6L-CniW2rMp3IXh0Zu2blfVfBqj5CcnsQjIHIfWy2Va2CNG-4l5kfWyqJf6LCqBgxkbswIHJnly9WW-w5d8wP3ho</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2022118531</pqid></control><display><type>article</type><title>Frame-Based Representation for Event Detection on Twitter</title><source>J-STAGE Free</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>QIN, Yanxia ; ZHANG, Yue ; ZHANG, Min ; ZHENG, Dequan</creator><creatorcontrib>QIN, Yanxia ; ZHANG, Yue ; ZHANG, Min ; ZHENG, Dequan</creatorcontrib><description>Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs.</description><identifier>ISSN: 0916-8532</identifier><identifier>EISSN: 1745-1361</identifier><identifier>DOI: 10.1587/transinf.2017EDP7311</identifier><language>eng</language><publisher>Tokyo: The Institute of Electronics, Information and Communication Engineers</publisher><subject>bursty ; Clustering ; event detection ; event representation ; Feature extraction ; Filtration ; frame ; Frames ; Information retrieval ; Probabilistic models ; Segments ; Social networks ; tweet ; z-score</subject><ispartof>IEICE Transactions on Information and Systems, 2018/04/01, Vol.E101.D(4), pp.1180-1188</ispartof><rights>2018 The Institute of Electronics, Information and Communication Engineers</rights><rights>Copyright Japan Science and Technology Agency 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</citedby><cites>FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,1884,27929,27930</link.rule.ids></links><search><creatorcontrib>QIN, Yanxia</creatorcontrib><creatorcontrib>ZHANG, Yue</creatorcontrib><creatorcontrib>ZHANG, Min</creatorcontrib><creatorcontrib>ZHENG, Dequan</creatorcontrib><title>Frame-Based Representation for Event Detection on Twitter</title><title>IEICE Transactions on Information and Systems</title><addtitle>IEICE Trans. Inf. &amp; Syst.</addtitle><description>Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs.</description><subject>bursty</subject><subject>Clustering</subject><subject>event detection</subject><subject>event representation</subject><subject>Feature extraction</subject><subject>Filtration</subject><subject>frame</subject><subject>Frames</subject><subject>Information retrieval</subject><subject>Probabilistic models</subject><subject>Segments</subject><subject>Social networks</subject><subject>tweet</subject><subject>z-score</subject><issn>0916-8532</issn><issn>1745-1361</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNpNUF1Lw0AQPETBWv0HPgR8jt7eVy6P2qYqFJRSn49LsqcpbVLvror_3mhtLSzs7DIzyw4hl0CvQersJnrbhqZ114xCVoyfMw5wRAaQCZkCV3BMBjQHlWrJ2Sk5C2FBKWgGckDyibcrTO9swDqZ4dpjwDba2HRt4jqfFB_9mIwxYvW762v-2cSI_pycOLsMePHXh-RlUsxHD-n06f5xdDtNK6mymObIcsvKHkvmOKWi1JaWgjNdW86sqpx2Uimlc5nVQBXUFgRqmesaa5GVfEiutr5r371vMESz6Da-7U8aRhkD6L-CniW2rMp3IXh0Zu2blfVfBqj5CcnsQjIHIfWy2Va2CNG-4l5kfWyqJf6LCqBgxkbswIHJnly9WW-w5d8wP3ho</recordid><startdate>20180101</startdate><enddate>20180101</enddate><creator>QIN, Yanxia</creator><creator>ZHANG, Yue</creator><creator>ZHANG, Min</creator><creator>ZHENG, Dequan</creator><general>The Institute of Electronics, Information and Communication Engineers</general><general>Japan Science and Technology Agency</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20180101</creationdate><title>Frame-Based Representation for Event Detection on Twitter</title><author>QIN, Yanxia ; ZHANG, Yue ; ZHANG, Min ; ZHENG, Dequan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>bursty</topic><topic>Clustering</topic><topic>event detection</topic><topic>event representation</topic><topic>Feature extraction</topic><topic>Filtration</topic><topic>frame</topic><topic>Frames</topic><topic>Information retrieval</topic><topic>Probabilistic models</topic><topic>Segments</topic><topic>Social networks</topic><topic>tweet</topic><topic>z-score</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>QIN, Yanxia</creatorcontrib><creatorcontrib>ZHANG, Yue</creatorcontrib><creatorcontrib>ZHANG, Min</creatorcontrib><creatorcontrib>ZHENG, Dequan</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEICE Transactions on Information and Systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>QIN, Yanxia</au><au>ZHANG, Yue</au><au>ZHANG, Min</au><au>ZHENG, Dequan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Frame-Based Representation for Event Detection on Twitter</atitle><jtitle>IEICE Transactions on Information and Systems</jtitle><addtitle>IEICE Trans. Inf. &amp; Syst.</addtitle><date>2018-01-01</date><risdate>2018</risdate><volume>E101.D</volume><issue>4</issue><spage>1180</spage><epage>1188</epage><pages>1180-1188</pages><issn>0916-8532</issn><eissn>1745-1361</eissn><abstract>Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs.</abstract><cop>Tokyo</cop><pub>The Institute of Electronics, Information and Communication Engineers</pub><doi>10.1587/transinf.2017EDP7311</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0916-8532
ispartof IEICE Transactions on Information and Systems, 2018/04/01, Vol.E101.D(4), pp.1180-1188
issn 0916-8532
1745-1361
language eng
recordid cdi_proquest_journals_2022118531
source J-STAGE Free; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects bursty
Clustering
event detection
event representation
Feature extraction
Filtration
frame
Frames
Information retrieval
Probabilistic models
Segments
Social networks
tweet
z-score
title Frame-Based Representation for Event Detection on Twitter
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-12T16%3A38%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Frame-Based%20Representation%20for%20Event%20Detection%20on%20Twitter&rft.jtitle=IEICE%20Transactions%20on%20Information%20and%20Systems&rft.au=QIN,%20Yanxia&rft.date=2018-01-01&rft.volume=E101.D&rft.issue=4&rft.spage=1180&rft.epage=1188&rft.pages=1180-1188&rft.issn=0916-8532&rft.eissn=1745-1361&rft_id=info:doi/10.1587/transinf.2017EDP7311&rft_dat=%3Cproquest_cross%3E2022118531%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2022118531&rft_id=info:pmid/&rfr_iscdi=true