Automatically assessing question answering system performance across possible confidence values
A mechanism is provided in a data processing system for assessing question answering system performance. The mechanism receives question answering system results. The question answering system results comprise questions posed to the question answering system, answers returned by the question answeri...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A mechanism is provided in a data processing system for assessing question answering system performance. The mechanism receives question answering system results. The question answering system results comprise questions posed to the question answering system, answers returned by the question answering system for each question posed to the question answering system, and a confidence value for each answer. The question answering system is trained or tested using the ground truth questions and answers. The mechanism performs a matching operation comparing each question in the question answering system results to questions in the ground truth. A given question is determined to be on-topic or off-topic based on results of the matching operation. For a plurality of confidence threshold values, the mechanism determines a rightness or wrongness of each answer in the question answering system results. The mechanism generates performance statistics for the plurality of confidence threshold values based on whether each question is on-topic or off-topic and whether each answer is right or wrong. The mechanism presents the performance statistics to the user via a user interface. |
---|