Approximating Optimal Binary Decision Trees

We give a (ln n +1)-approximation for the decision tree (DT) problem. An instance of DT is a set of m binary tests T =( T 1 ,…, T m ) and a set of n items X =( X 1 ,…, X n ). The goal is to output a binary tree where each internal node is a test, each leaf is an item and the total external path len...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Algorithmica 2012-04, Vol.62 (3-4), p.1112-1121
Hauptverfasser:	Adler, Micah, Heeringa, Brent
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithm Analysis and Problem Complexity Algorithmics. Computability. Computer arithmetics Algorithms Applied sciences Computer Science Computer science control theory systems Computer Systems Organization and Communication Networks Data Structures and Information Theory Decision theory. Utility theory Exact sciences and technology Mathematics of Computing Operational research and scientific management Operational research. Management science Theoretical computing Theory of Computation
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We give a (ln n +1)-approximation for the decision tree (DT) problem. An instance of DT is a set of m binary tests T =( T 1 ,…, T m ) and a set of n items X =( X 1 ,…, X n ). The goal is to output a binary tree where each internal node is a test, each leaf is an item and the total external path length of the tree is minimized. Total external path length is the sum of the depths of all the leaves in the tree. DT has a long history in computer science with applications ranging from medical diagnosis to experiment design. It also generalizes the problem of finding optimal average-case search strategies in partially ordered sets which includes several alphabetic tree problems. Our work decreases the previous best upper bound on the approximation ratio by a constant factor. We provide a new analysis of the greedy algorithm that uses a simple accounting scheme to spread the cost of a tree among pairs of items split at a particular node. We conclude by showing that our upper bound also holds for the DT problem with weighted tests.
ISSN:	0178-4617 1432-0541
DOI:	10.1007/s00453-011-9510-9