User community detection via embedding of social network structure and temporal content
Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods emplo...
Gespeichert in:
Veröffentlicht in: | Information processing & management 2020-03, Vol.57 (2), p.102056, Article 102056 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 2 |
container_start_page | 102056 |
container_title | Information processing & management |
container_volume | 57 |
creator | Fani, Hossein Jiang, Eric Bagheri, Ebrahim Al-Obeidat, Feras Du, Weichang Kargar, Mehdi |
description | Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities. |
doi_str_mv | 10.1016/j.ipm.2019.102056 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2354809182</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S030645731830997X</els_id><sourcerecordid>2354809182</sourcerecordid><originalsourceid>FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AHcB1x3zaNIWVzL4ggE3Di5DJ7mV1GlSk3TEf2-GunZ1Odxz7j18CF1TsqKEytt-ZcdhxQhtsmZEyBO0oHXFC8EreooWhBNZlKLi5-gixp4QUgrKFuh9GyFg7Ydhcjb9YAMJdLLe4YNtMQw7MMa6D-w7HL227R47SN8-fOKYwqTTFAC3zuAEw-hDXmvvErh0ic66dh_h6m8u0fbx4W39XGxen17W95tCcyZSAQBSVx2lpC13taSiko2QtW60MVUnuTGSstZ0uT7bdbKUIAhrq0Z0ADUpDV-im_nuGPzXBDGp3k_B5ZeKcVHWpKE1yy46u3TwMQbo1Bjs0IYfRYk68lO9yvzUkZ-a-eXM3ZyBXP9gIaioLTgNxoaMSBlv_0n_AhrNeUc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2354809182</pqid></control><display><type>article</type><title>User community detection via embedding of social network structure and temporal content</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Fani, Hossein ; Jiang, Eric ; Bagheri, Ebrahim ; Al-Obeidat, Feras ; Du, Weichang ; Kargar, Mehdi</creator><creatorcontrib>Fani, Hossein ; Jiang, Eric ; Bagheri, Ebrahim ; Al-Obeidat, Feras ; Du, Weichang ; Kargar, Mehdi</creatorcontrib><description>Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities.</description><identifier>ISSN: 0306-4573</identifier><identifier>EISSN: 1873-5371</identifier><identifier>DOI: 10.1016/j.ipm.2019.102056</identifier><language>eng</language><publisher>Oxford: Elsevier Ltd</publisher><subject>Content analysis ; Datasets ; Embedded systems ; Embedding ; Evolution ; Identification methods ; Information processing ; Interpolation ; Learning ; Links ; Neural networks ; Production methods ; Social network analysis ; Social networks ; Twitter ; User community detection</subject><ispartof>Information processing & management, 2020-03, Vol.57 (2), p.102056, Article 102056</ispartof><rights>2019</rights><rights>Copyright Pergamon Press Inc. Mar 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</citedby><cites>FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</cites><orcidid>0000-0002-6033-6564 ; 0000-0002-5148-6237</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.ipm.2019.102056$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27911,27912,45982</link.rule.ids></links><search><creatorcontrib>Fani, Hossein</creatorcontrib><creatorcontrib>Jiang, Eric</creatorcontrib><creatorcontrib>Bagheri, Ebrahim</creatorcontrib><creatorcontrib>Al-Obeidat, Feras</creatorcontrib><creatorcontrib>Du, Weichang</creatorcontrib><creatorcontrib>Kargar, Mehdi</creatorcontrib><title>User community detection via embedding of social network structure and temporal content</title><title>Information processing & management</title><description>Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities.</description><subject>Content analysis</subject><subject>Datasets</subject><subject>Embedded systems</subject><subject>Embedding</subject><subject>Evolution</subject><subject>Identification methods</subject><subject>Information processing</subject><subject>Interpolation</subject><subject>Learning</subject><subject>Links</subject><subject>Neural networks</subject><subject>Production methods</subject><subject>Social network analysis</subject><subject>Social networks</subject><subject>Twitter</subject><subject>User community detection</subject><issn>0306-4573</issn><issn>1873-5371</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AHcB1x3zaNIWVzL4ggE3Di5DJ7mV1GlSk3TEf2-GunZ1Odxz7j18CF1TsqKEytt-ZcdhxQhtsmZEyBO0oHXFC8EreooWhBNZlKLi5-gixp4QUgrKFuh9GyFg7Ydhcjb9YAMJdLLe4YNtMQw7MMa6D-w7HL227R47SN8-fOKYwqTTFAC3zuAEw-hDXmvvErh0ic66dh_h6m8u0fbx4W39XGxen17W95tCcyZSAQBSVx2lpC13taSiko2QtW60MVUnuTGSstZ0uT7bdbKUIAhrq0Z0ADUpDV-im_nuGPzXBDGp3k_B5ZeKcVHWpKE1yy46u3TwMQbo1Bjs0IYfRYk68lO9yvzUkZ-a-eXM3ZyBXP9gIaioLTgNxoaMSBlv_0n_AhrNeUc</recordid><startdate>202003</startdate><enddate>202003</enddate><creator>Fani, Hossein</creator><creator>Jiang, Eric</creator><creator>Bagheri, Ebrahim</creator><creator>Al-Obeidat, Feras</creator><creator>Du, Weichang</creator><creator>Kargar, Mehdi</creator><general>Elsevier Ltd</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope><orcidid>https://orcid.org/0000-0002-6033-6564</orcidid><orcidid>https://orcid.org/0000-0002-5148-6237</orcidid></search><sort><creationdate>202003</creationdate><title>User community detection via embedding of social network structure and temporal content</title><author>Fani, Hossein ; Jiang, Eric ; Bagheri, Ebrahim ; Al-Obeidat, Feras ; Du, Weichang ; Kargar, Mehdi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Content analysis</topic><topic>Datasets</topic><topic>Embedded systems</topic><topic>Embedding</topic><topic>Evolution</topic><topic>Identification methods</topic><topic>Information processing</topic><topic>Interpolation</topic><topic>Learning</topic><topic>Links</topic><topic>Neural networks</topic><topic>Production methods</topic><topic>Social network analysis</topic><topic>Social networks</topic><topic>Twitter</topic><topic>User community detection</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fani, Hossein</creatorcontrib><creatorcontrib>Jiang, Eric</creatorcontrib><creatorcontrib>Bagheri, Ebrahim</creatorcontrib><creatorcontrib>Al-Obeidat, Feras</creatorcontrib><creatorcontrib>Du, Weichang</creatorcontrib><creatorcontrib>Kargar, Mehdi</creatorcontrib><collection>CrossRef</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><jtitle>Information processing & management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fani, Hossein</au><au>Jiang, Eric</au><au>Bagheri, Ebrahim</au><au>Al-Obeidat, Feras</au><au>Du, Weichang</au><au>Kargar, Mehdi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>User community detection via embedding of social network structure and temporal content</atitle><jtitle>Information processing & management</jtitle><date>2020-03</date><risdate>2020</risdate><volume>57</volume><issue>2</issue><spage>102056</spage><pages>102056-</pages><artnum>102056</artnum><issn>0306-4573</issn><eissn>1873-5371</eissn><abstract>Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities.</abstract><cop>Oxford</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.ipm.2019.102056</doi><orcidid>https://orcid.org/0000-0002-6033-6564</orcidid><orcidid>https://orcid.org/0000-0002-5148-6237</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0306-4573 |
ispartof | Information processing & management, 2020-03, Vol.57 (2), p.102056, Article 102056 |
issn | 0306-4573 1873-5371 |
language | eng |
recordid | cdi_proquest_journals_2354809182 |
source | ScienceDirect Journals (5 years ago - present) |
subjects | Content analysis Datasets Embedded systems Embedding Evolution Identification methods Information processing Interpolation Learning Links Neural networks Production methods Social network analysis Social networks User community detection |
title | User community detection via embedding of social network structure and temporal content |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T21%3A00%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=User%20community%20detection%20via%20embedding%20of%20social%20network%20structure%20and%20temporal%20content&rft.jtitle=Information%20processing%20&%20management&rft.au=Fani,%20Hossein&rft.date=2020-03&rft.volume=57&rft.issue=2&rft.spage=102056&rft.pages=102056-&rft.artnum=102056&rft.issn=0306-4573&rft.eissn=1873-5371&rft_id=info:doi/10.1016/j.ipm.2019.102056&rft_dat=%3Cproquest_cross%3E2354809182%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2354809182&rft_id=info:pmid/&rft_els_id=S030645731830997X&rfr_iscdi=true |