MEASURING SIMILARITY OF NUMERIC CONCEPT VALUES WITHIN CORPUS
A method, a computer system, and a computer program product for measuring similarity of numeric concept values within a corpus are provided. The embodiment may include retrieving numerical values associated with a concept in a corpus. The embodiment may also include converting the numerical values t...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A method, a computer system, and a computer program product for measuring similarity of numeric concept values within a corpus are provided. The embodiment may include retrieving numerical values associated with a concept in a corpus. The embodiment may also include converting the numerical values to a standard unit. The embodiment may further include computing a distribution value of the converted numerical values. The embodiment may also include determining a tolerance value based on the distribution value, wherein the tolerance value is the maximum allowable distance between two numerical values. The embodiment may further include determining a distance function based on the determined tolerance value, wherein the distance function is defined by dividing a difference between two numerical values by the determined tolerance value. The embodiment may also include computing a similarity distance between the numerical values.
提供了一种用于测量语料库中的数字概念值的相似性的方法、计算机系统和计算机程序产品。实施例可以包括检索与语料库中的概念相关联的数值。实施例还可包括将数值转换为标准单位。该实 |
---|