User community detection via embedding of social network structure and temporal content

Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods emplo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information processing & management 2020-03, Vol.57 (2), p.102056, Article 102056
Hauptverfasser: Fani, Hossein, Jiang, Eric, Bagheri, Ebrahim, Al-Obeidat, Feras, Du, Weichang, Kargar, Mehdi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 2
container_start_page 102056
container_title Information processing & management
container_volume 57
creator Fani, Hossein
Jiang, Eric
Bagheri, Ebrahim
Al-Obeidat, Feras
Du, Weichang
Kargar, Mehdi
description Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities.
doi_str_mv 10.1016/j.ipm.2019.102056
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2354809182</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S030645731830997X</els_id><sourcerecordid>2354809182</sourcerecordid><originalsourceid>FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AHcB1x3zaNIWVzL4ggE3Di5DJ7mV1GlSk3TEf2-GunZ1Odxz7j18CF1TsqKEytt-ZcdhxQhtsmZEyBO0oHXFC8EreooWhBNZlKLi5-gixp4QUgrKFuh9GyFg7Ydhcjb9YAMJdLLe4YNtMQw7MMa6D-w7HL227R47SN8-fOKYwqTTFAC3zuAEw-hDXmvvErh0ic66dh_h6m8u0fbx4W39XGxen17W95tCcyZSAQBSVx2lpC13taSiko2QtW60MVUnuTGSstZ0uT7bdbKUIAhrq0Z0ADUpDV-im_nuGPzXBDGp3k_B5ZeKcVHWpKE1yy46u3TwMQbo1Bjs0IYfRYk68lO9yvzUkZ-a-eXM3ZyBXP9gIaioLTgNxoaMSBlv_0n_AhrNeUc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2354809182</pqid></control><display><type>article</type><title>User community detection via embedding of social network structure and temporal content</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Fani, Hossein ; Jiang, Eric ; Bagheri, Ebrahim ; Al-Obeidat, Feras ; Du, Weichang ; Kargar, Mehdi</creator><creatorcontrib>Fani, Hossein ; Jiang, Eric ; Bagheri, Ebrahim ; Al-Obeidat, Feras ; Du, Weichang ; Kargar, Mehdi</creatorcontrib><description>Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities.</description><identifier>ISSN: 0306-4573</identifier><identifier>EISSN: 1873-5371</identifier><identifier>DOI: 10.1016/j.ipm.2019.102056</identifier><language>eng</language><publisher>Oxford: Elsevier Ltd</publisher><subject>Content analysis ; Datasets ; Embedded systems ; Embedding ; Evolution ; Identification methods ; Information processing ; Interpolation ; Learning ; Links ; Neural networks ; Production methods ; Social network analysis ; Social networks ; Twitter ; User community detection</subject><ispartof>Information processing &amp; management, 2020-03, Vol.57 (2), p.102056, Article 102056</ispartof><rights>2019</rights><rights>Copyright Pergamon Press Inc. Mar 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</citedby><cites>FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</cites><orcidid>0000-0002-6033-6564 ; 0000-0002-5148-6237</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.ipm.2019.102056$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27911,27912,45982</link.rule.ids></links><search><creatorcontrib>Fani, Hossein</creatorcontrib><creatorcontrib>Jiang, Eric</creatorcontrib><creatorcontrib>Bagheri, Ebrahim</creatorcontrib><creatorcontrib>Al-Obeidat, Feras</creatorcontrib><creatorcontrib>Du, Weichang</creatorcontrib><creatorcontrib>Kargar, Mehdi</creatorcontrib><title>User community detection via embedding of social network structure and temporal content</title><title>Information processing &amp; management</title><description>Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities.</description><subject>Content analysis</subject><subject>Datasets</subject><subject>Embedded systems</subject><subject>Embedding</subject><subject>Evolution</subject><subject>Identification methods</subject><subject>Information processing</subject><subject>Interpolation</subject><subject>Learning</subject><subject>Links</subject><subject>Neural networks</subject><subject>Production methods</subject><subject>Social network analysis</subject><subject>Social networks</subject><subject>Twitter</subject><subject>User community detection</subject><issn>0306-4573</issn><issn>1873-5371</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AHcB1x3zaNIWVzL4ggE3Di5DJ7mV1GlSk3TEf2-GunZ1Odxz7j18CF1TsqKEytt-ZcdhxQhtsmZEyBO0oHXFC8EreooWhBNZlKLi5-gixp4QUgrKFuh9GyFg7Ydhcjb9YAMJdLLe4YNtMQw7MMa6D-w7HL227R47SN8-fOKYwqTTFAC3zuAEw-hDXmvvErh0ic66dh_h6m8u0fbx4W39XGxen17W95tCcyZSAQBSVx2lpC13taSiko2QtW60MVUnuTGSstZ0uT7bdbKUIAhrq0Z0ADUpDV-im_nuGPzXBDGp3k_B5ZeKcVHWpKE1yy46u3TwMQbo1Bjs0IYfRYk68lO9yvzUkZ-a-eXM3ZyBXP9gIaioLTgNxoaMSBlv_0n_AhrNeUc</recordid><startdate>202003</startdate><enddate>202003</enddate><creator>Fani, Hossein</creator><creator>Jiang, Eric</creator><creator>Bagheri, Ebrahim</creator><creator>Al-Obeidat, Feras</creator><creator>Du, Weichang</creator><creator>Kargar, Mehdi</creator><general>Elsevier Ltd</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope><orcidid>https://orcid.org/0000-0002-6033-6564</orcidid><orcidid>https://orcid.org/0000-0002-5148-6237</orcidid></search><sort><creationdate>202003</creationdate><title>User community detection via embedding of social network structure and temporal content</title><author>Fani, Hossein ; Jiang, Eric ; Bagheri, Ebrahim ; Al-Obeidat, Feras ; Du, Weichang ; Kargar, Mehdi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c325t-eee6c7f110a4b8615769568c9cdd7f63dd612adf0302bf646e502a795fee804d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Content analysis</topic><topic>Datasets</topic><topic>Embedded systems</topic><topic>Embedding</topic><topic>Evolution</topic><topic>Identification methods</topic><topic>Information processing</topic><topic>Interpolation</topic><topic>Learning</topic><topic>Links</topic><topic>Neural networks</topic><topic>Production methods</topic><topic>Social network analysis</topic><topic>Social networks</topic><topic>Twitter</topic><topic>User community detection</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fani, Hossein</creatorcontrib><creatorcontrib>Jiang, Eric</creatorcontrib><creatorcontrib>Bagheri, Ebrahim</creatorcontrib><creatorcontrib>Al-Obeidat, Feras</creatorcontrib><creatorcontrib>Du, Weichang</creatorcontrib><creatorcontrib>Kargar, Mehdi</creatorcontrib><collection>CrossRef</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><jtitle>Information processing &amp; management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fani, Hossein</au><au>Jiang, Eric</au><au>Bagheri, Ebrahim</au><au>Al-Obeidat, Feras</au><au>Du, Weichang</au><au>Kargar, Mehdi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>User community detection via embedding of social network structure and temporal content</atitle><jtitle>Information processing &amp; management</jtitle><date>2020-03</date><risdate>2020</risdate><volume>57</volume><issue>2</issue><spage>102056</spage><pages>102056-</pages><artnum>102056</artnum><issn>0306-4573</issn><eissn>1873-5371</eissn><abstract>Identifying and extracting user communities is an important step towards understanding social network dynamics from a macro perspective. For this reason, the work in this paper explores various aspects related to the identification of user communities. To date, user community detection methods employ either explicit links between users (link analysis), or users’ topics of interest in posted content (content analysis), or in tandem. Little work has considered temporal evolution when identifying user communities in a way to group together those users who share not only similar topical interests but also similar temporal behavior towards their topics of interest. In this paper, we identify user communities through multimodal feature learning (embeddings). Our core contributions can be enumerated as (a) we propose a new method for learning neural embeddings for users based on their temporal content similarity; (b) we learn user embeddings based on their social network connections (links) through neural graph embeddings; (c) we systematically interpolate temporal content-based embeddings and social link-based embeddings to capture both social network connections and temporal content evolution for representing users, and (d) we systematically evaluate the quality of each embedding type in isolation and also when interpolated together and demonstrate their performance on a Twitter dataset under two different application scenarios, namely news recommendation and user prediction. We find that (1) content-based methods produce higher quality communities compared to link-based methods; (2) methods that consider temporal evolution of content, our proposed method in particular, show better performance compared to their non-temporal counter-parts; (3) communities that are produced when time is explicitly incorporated in user vector representations have higher quality than the ones produced when time is incorporated into a generative process, and finally (4) while link-based methods are weaker than content-based methods, their interpolation with content-based methods leads to improved quality of the identified communities.</abstract><cop>Oxford</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.ipm.2019.102056</doi><orcidid>https://orcid.org/0000-0002-6033-6564</orcidid><orcidid>https://orcid.org/0000-0002-5148-6237</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0306-4573
ispartof Information processing & management, 2020-03, Vol.57 (2), p.102056, Article 102056
issn 0306-4573
1873-5371
language eng
recordid cdi_proquest_journals_2354809182
source ScienceDirect Journals (5 years ago - present)
subjects Content analysis
Datasets
Embedded systems
Embedding
Evolution
Identification methods
Information processing
Interpolation
Learning
Links
Neural networks
Production methods
Social network analysis
Social networks
Twitter
User community detection
title User community detection via embedding of social network structure and temporal content
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T21%3A00%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=User%20community%20detection%20via%20embedding%20of%20social%20network%20structure%20and%20temporal%20content&rft.jtitle=Information%20processing%20&%20management&rft.au=Fani,%20Hossein&rft.date=2020-03&rft.volume=57&rft.issue=2&rft.spage=102056&rft.pages=102056-&rft.artnum=102056&rft.issn=0306-4573&rft.eissn=1873-5371&rft_id=info:doi/10.1016/j.ipm.2019.102056&rft_dat=%3Cproquest_cross%3E2354809182%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2354809182&rft_id=info:pmid/&rft_els_id=S030645731830997X&rfr_iscdi=true