Automatic text analysis based on transition phenomena of word occurrences

A method of selecting index terms directly from a word frequency list is described. The original idea was suggested by Goffman who reasoned that the most content‐bearing words of a given text would occur at the transition region at which Zipf's First Law of words of high frequency of occurrence...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of the American Society for Information Science 1978-05, Vol.29 (3), p.121-124
1. Verfasser:	Pao, Miranda Lee
Format:	Artikel
Sprache:	eng
Schlagworte:	Automatic subject indexing Automatic text analysis Automation Experiments Frequency distribution Frequency distributions Frequency of occurrence Goffman's transition of word occurrences Indexing services Information science Information storage and retrieval Information work Law Libraries Mathematical techniques R&D Research & development Semantics Shelf arrangement Statistical techniques Subject indexing Technical services Terms Text analysis Word frequency Word frequency distributions Zipf's Law
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A method of selecting index terms directly from a word frequency list is described. The original idea was suggested by Goffman who reasoned that the most content‐bearing words of a given text would occur at the transition region at which Zipf's First Law of words of high frequency of occurrences begins to take on properties of words of low frequency of occurrences. Word frequencies of two articles were analyzed. Results seem to indicate that the automated selection of index terms from a frequency list holds some promise for automatic indexing.
ISSN:	0002-8231 1097-4571
DOI:	10.1002/asi.4630290303