An extraction method of time-series numerical data from press releases by using co-occurrence conditions of numbers and time stamps related to target business keywords

A method of extracting time series data on specific business matters from press releases of a company is proposed. In such press releases, numerical data on specific matters often co‐occur with specific words. From this viewpoint, when keywords are input, the proposed system extracts a set of numeri...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Electronics and communications in Japan 2010-03, Vol.93 (3), p.61-69
Hauptverfasser: Samejima, Masaki, Gen, Mayu, Nakazaki, Takeshi, Akiyoshi, Masanori, Komoda, Norihisa
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method of extracting time series data on specific business matters from press releases of a company is proposed. In such press releases, numerical data on specific matters often co‐occur with specific words. From this viewpoint, when keywords are input, the proposed system extracts a set of numerical data and a time stamp that co‐occurs with them. The proposed system extracts numerical data placed close to keywords by using the word distance, and time stamps are extracted in the same manner. The representation of the extracted data is unified for visualization. The results of the application of this method to press releases consisting of several kinds of documents confirm that the accuracy of the extracted numerical data is sufficient to support business analysis. © 2010 Wiley Periodicals, Inc. Electron Comm Jpn, 93(3): 61–69, 2010; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ecj.10011
ISSN:1942-9533
1942-9541
DOI:10.1002/ecj.10011