Automatic text analysis based on transition phenomena of word occurrences
A method of selecting index terms directly from a word frequency list is described. The original idea was suggested by Goffman who reasoned that the most content‐bearing words of a given text would occur at the transition region at which Zipf's First Law of words of high frequency of occurrence...
Gespeichert in:
Veröffentlicht in: | Journal of the American Society for Information Science 1978-05, Vol.29 (3), p.121-124 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A method of selecting index terms directly from a word frequency list is described. The original idea was suggested by Goffman who reasoned that the most content‐bearing words of a given text would occur at the transition region at which Zipf's First Law of words of high frequency of occurrences begins to take on properties of words of low frequency of occurrences. Word frequencies of two articles were analyzed. Results seem to indicate that the automated selection of index terms from a frequency list holds some promise for automatic indexing. |
---|---|
ISSN: | 0002-8231 1097-4571 |
DOI: | 10.1002/asi.4630290303 |