Frame-Based Representation for Event Detection on Twitter
Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words...
Gespeichert in:
Veröffentlicht in: | IEICE Transactions on Information and Systems 2018/04/01, Vol.E101.D(4), pp.1180-1188 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1188 |
---|---|
container_issue | 4 |
container_start_page | 1180 |
container_title | IEICE Transactions on Information and Systems |
container_volume | E101.D |
creator | QIN, Yanxia ZHANG, Yue ZHANG, Min ZHENG, Dequan |
description | Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs. |
doi_str_mv | 10.1587/transinf.2017EDP7311 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2022118531</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2022118531</sourcerecordid><originalsourceid>FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</originalsourceid><addsrcrecordid>eNpNUF1Lw0AQPETBWv0HPgR8jt7eVy6P2qYqFJRSn49LsqcpbVLvror_3mhtLSzs7DIzyw4hl0CvQersJnrbhqZ114xCVoyfMw5wRAaQCZkCV3BMBjQHlWrJ2Sk5C2FBKWgGckDyibcrTO9swDqZ4dpjwDba2HRt4jqfFB_9mIwxYvW762v-2cSI_pycOLsMePHXh-RlUsxHD-n06f5xdDtNK6mymObIcsvKHkvmOKWi1JaWgjNdW86sqpx2Uimlc5nVQBXUFgRqmesaa5GVfEiutr5r371vMESz6Da-7U8aRhkD6L-CniW2rMp3IXh0Zu2blfVfBqj5CcnsQjIHIfWy2Va2CNG-4l5kfWyqJf6LCqBgxkbswIHJnly9WW-w5d8wP3ho</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2022118531</pqid></control><display><type>article</type><title>Frame-Based Representation for Event Detection on Twitter</title><source>J-STAGE Free</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>QIN, Yanxia ; ZHANG, Yue ; ZHANG, Min ; ZHENG, Dequan</creator><creatorcontrib>QIN, Yanxia ; ZHANG, Yue ; ZHANG, Min ; ZHENG, Dequan</creatorcontrib><description>Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs.</description><identifier>ISSN: 0916-8532</identifier><identifier>EISSN: 1745-1361</identifier><identifier>DOI: 10.1587/transinf.2017EDP7311</identifier><language>eng</language><publisher>Tokyo: The Institute of Electronics, Information and Communication Engineers</publisher><subject>bursty ; Clustering ; event detection ; event representation ; Feature extraction ; Filtration ; frame ; Frames ; Information retrieval ; Probabilistic models ; Segments ; Social networks ; tweet ; z-score</subject><ispartof>IEICE Transactions on Information and Systems, 2018/04/01, Vol.E101.D(4), pp.1180-1188</ispartof><rights>2018 The Institute of Electronics, Information and Communication Engineers</rights><rights>Copyright Japan Science and Technology Agency 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</citedby><cites>FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,1884,27929,27930</link.rule.ids></links><search><creatorcontrib>QIN, Yanxia</creatorcontrib><creatorcontrib>ZHANG, Yue</creatorcontrib><creatorcontrib>ZHANG, Min</creatorcontrib><creatorcontrib>ZHENG, Dequan</creatorcontrib><title>Frame-Based Representation for Event Detection on Twitter</title><title>IEICE Transactions on Information and Systems</title><addtitle>IEICE Trans. Inf. & Syst.</addtitle><description>Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs.</description><subject>bursty</subject><subject>Clustering</subject><subject>event detection</subject><subject>event representation</subject><subject>Feature extraction</subject><subject>Filtration</subject><subject>frame</subject><subject>Frames</subject><subject>Information retrieval</subject><subject>Probabilistic models</subject><subject>Segments</subject><subject>Social networks</subject><subject>tweet</subject><subject>z-score</subject><issn>0916-8532</issn><issn>1745-1361</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNpNUF1Lw0AQPETBWv0HPgR8jt7eVy6P2qYqFJRSn49LsqcpbVLvror_3mhtLSzs7DIzyw4hl0CvQersJnrbhqZ114xCVoyfMw5wRAaQCZkCV3BMBjQHlWrJ2Sk5C2FBKWgGckDyibcrTO9swDqZ4dpjwDba2HRt4jqfFB_9mIwxYvW762v-2cSI_pycOLsMePHXh-RlUsxHD-n06f5xdDtNK6mymObIcsvKHkvmOKWi1JaWgjNdW86sqpx2Uimlc5nVQBXUFgRqmesaa5GVfEiutr5r371vMESz6Da-7U8aRhkD6L-CniW2rMp3IXh0Zu2blfVfBqj5CcnsQjIHIfWy2Va2CNG-4l5kfWyqJf6LCqBgxkbswIHJnly9WW-w5d8wP3ho</recordid><startdate>20180101</startdate><enddate>20180101</enddate><creator>QIN, Yanxia</creator><creator>ZHANG, Yue</creator><creator>ZHANG, Min</creator><creator>ZHENG, Dequan</creator><general>The Institute of Electronics, Information and Communication Engineers</general><general>Japan Science and Technology Agency</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20180101</creationdate><title>Frame-Based Representation for Event Detection on Twitter</title><author>QIN, Yanxia ; ZHANG, Yue ; ZHANG, Min ; ZHENG, Dequan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c567t-9e29a2b56752f3004b8a0b4328da32a6cf8f56668957d1061da14e8598ded47b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>bursty</topic><topic>Clustering</topic><topic>event detection</topic><topic>event representation</topic><topic>Feature extraction</topic><topic>Filtration</topic><topic>frame</topic><topic>Frames</topic><topic>Information retrieval</topic><topic>Probabilistic models</topic><topic>Segments</topic><topic>Social networks</topic><topic>tweet</topic><topic>z-score</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>QIN, Yanxia</creatorcontrib><creatorcontrib>ZHANG, Yue</creatorcontrib><creatorcontrib>ZHANG, Min</creatorcontrib><creatorcontrib>ZHENG, Dequan</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEICE Transactions on Information and Systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>QIN, Yanxia</au><au>ZHANG, Yue</au><au>ZHANG, Min</au><au>ZHENG, Dequan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Frame-Based Representation for Event Detection on Twitter</atitle><jtitle>IEICE Transactions on Information and Systems</jtitle><addtitle>IEICE Trans. Inf. & Syst.</addtitle><date>2018-01-01</date><risdate>2018</risdate><volume>E101.D</volume><issue>4</issue><spage>1180</spage><epage>1188</epage><pages>1180-1188</pages><issn>0916-8532</issn><eissn>1745-1361</eissn><abstract>Large scale first-hand tweets motivate automatic event detection on Twitter. Previous approaches model events by clustering tweets, words or segments. On the other hand, event clusters represented by tweets are easier to understand than those represented by words/segments. However, compared to words/segments, tweets are sparser and therefore makes clustering less effective. This article proposes to represent events with triple structures called frames, which are as efficient as, yet can be easier to understand than words/segments. Frames are extracted based on shallow syntactic information of tweets with an unsupervised open information extraction method, which is introduced for domain-independent relation extraction in a single pass over web scale data. This is then followed by bursty frame element extraction functions as feature selection by filtering frame elements with bursty frequency pattern via a probabilistic model. After being clustered and ranked, high-quality events are yielded and then reported by linking frame elements back to frames. Experimental results show that frame-based event detection leads to improved precision over a state-of-the-art baseline segment-based event detection method. Superior readability of frame-based events as compared with segment-based events is demonstrated in some example outputs.</abstract><cop>Tokyo</cop><pub>The Institute of Electronics, Information and Communication Engineers</pub><doi>10.1587/transinf.2017EDP7311</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0916-8532 |
ispartof | IEICE Transactions on Information and Systems, 2018/04/01, Vol.E101.D(4), pp.1180-1188 |
issn | 0916-8532 1745-1361 |
language | eng |
recordid | cdi_proquest_journals_2022118531 |
source | J-STAGE Free; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals |
subjects | bursty Clustering event detection event representation Feature extraction Filtration frame Frames Information retrieval Probabilistic models Segments Social networks tweet z-score |
title | Frame-Based Representation for Event Detection on Twitter |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-12T16%3A38%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Frame-Based%20Representation%20for%20Event%20Detection%20on%20Twitter&rft.jtitle=IEICE%20Transactions%20on%20Information%20and%20Systems&rft.au=QIN,%20Yanxia&rft.date=2018-01-01&rft.volume=E101.D&rft.issue=4&rft.spage=1180&rft.epage=1188&rft.pages=1180-1188&rft.issn=0916-8532&rft.eissn=1745-1361&rft_id=info:doi/10.1587/transinf.2017EDP7311&rft_dat=%3Cproquest_cross%3E2022118531%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2022118531&rft_id=info:pmid/&rfr_iscdi=true |