Quantifying Behavioural Distance Between Mathematical Expressions
Existing symbolic regression methods organize the space of candidate mathematical expressions primarily based on their syntactic, structural similarity. However, this approach overlooks crucial equivalences between expressions that arise from mathematical symmetries, such as commutativity, associati...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Existing symbolic regression methods organize the space of candidate
mathematical expressions primarily based on their syntactic, structural
similarity. However, this approach overlooks crucial equivalences between
expressions that arise from mathematical symmetries, such as commutativity,
associativity, and distribution laws for arithmetic operations. Consequently,
expressions with similar errors on a given data set are apart from each other
in the search space. This leads to a rough error landscape in the search space
that efficient local, gradient-based methods cannot explore. This paper
proposes and implements a measure of a behavioral distance, BED, that clusters
together expressions with similar errors. The experimental results show that
the stochastic method for calculating BED achieves consistency with a modest
number of sampled values for evaluating the expressions. This leads to
computational efficiency comparable to the tree-based syntactic distance. Our
findings also reveal that BED significantly improves the smoothness of the
error landscape in the search space for symbolic regression. |
---|---|
DOI: | 10.48550/arxiv.2408.11515 |