MEASURING SIMILARITY OF NUMERIC CONCEPT VALUES WITHIN CORPUS

A method, a computer system, and a computer program product for measuring similarity of numeric concept values within a corpus are provided. The embodiment may include retrieving numerical values associated with a concept in a corpus. The embodiment may also include converting the numerical values t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: MCCOY TYRA ALEXA, CHRISTIANSON KYLE G, ERPENBACH ERIC L, KAIRIS KATHERINE A
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method, a computer system, and a computer program product for measuring similarity of numeric concept values within a corpus are provided. The embodiment may include retrieving numerical values associated with a concept in a corpus. The embodiment may also include converting the numerical values to a standard unit. The embodiment may further include computing a distribution value of the converted numerical values. The embodiment may also include determining a tolerance value based on the distribution value, wherein the tolerance value is the maximum allowable distance between two numerical values. The embodiment may further include determining a distance function based on the determined tolerance value, wherein the distance function is defined by dividing a difference between two numerical values by the determined tolerance value. The embodiment may also include computing a similarity distance between the numerical values. 提供了一种用于测量语料库中的数字概念值的相似性的方法、计算机系统和计算机程序产品。实施例可以包括检索与语料库中的概念相关联的数值。实施例还可包括将数值转换为标准单位。该实