SEHF: A Summary-Enhanced Hierarchical Framework for Financial Report Sentiment Analysis

Financial reports serve as crucial resources for investors and researchers, providing analysts' assessments of stocks that play a vital role in stock market applications. However, detecting analysts' opinions and sentiments in financial reports is challenging. First, the formal and profess...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on computational social systems 2024-06, Vol.11 (3), p.4087-4101
Hauptverfasser: Li, Haozhou, Peng, Qinke, Wang, Xinyuan, Mou, Xu, Wang, Yonghao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Financial reports serve as crucial resources for investors and researchers, providing analysts' assessments of stocks that play a vital role in stock market applications. However, detecting analysts' opinions and sentiments in financial reports is challenging. First, the formal and professional language used in these reports makes it difficult for previous methods to comprehend domain-specific knowledge. Second, financial reports often adopt lengthy and elaborate expressions to convey rich semantics, which exposes the existing methods to contextual information loss, especially on long-term dependencies. To address these problems, we propose a summary-enhanced hierarchical framework (SEHF), which leverages summary information to enhance financial report sentiment analysis. Our framework incorporates financial bidirectional and auto-regressive transformer (FinBART), equipped with extended position encoding to summarize lengthy report articles and capture long-range interactions. To mitigate information loss, we initially divide each report into segments and then propose the hierarchical analyst sentiment representation network (ASRN), which utilizes financial bidirectional encoder representation from transformer (FinBERT), bidirectional long short-term memory (BiLSTM)-Attention, and dendrite (DD) network to fuse information in the generated summary and report segments. Notably, FinBART and FinBERT are pretrained on large-scale financial corpora to effectively understand professional expressions. Furthermore, we construct a new dataset large-scale Chinese financial report (LCFR) for the lack of supervised datasets. Experimental results on LCFR and a benchmark dataset show that SEHF significantly outperforms state-of-the-art (SOTA) baselines, and the ablation study highlights the effectiveness of aggregating sentiment information in the summary and report segments.
ISSN:2329-924X
2329-924X
2373-7476
DOI:10.1109/TCSS.2023.3323885