Approximating Edit Distance in Truly Subquadratic Time: Quantum and MapReduce
The edit distance between two strings is defined as the smallest number of insertions , deletions , and substitutions that need to be made to transform one of the strings to another one. Approximating edit distance in subquadratic time is “one of the biggest unsolved problems in the field of combina...
Gespeichert in:
Veröffentlicht in: | Journal of the ACM 2021-06, Vol.68 (3), p.1-41 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The
edit distance
between two strings is defined as the smallest number of
insertions
,
deletions
, and
substitutions
that need to be made to transform one of the strings to another one. Approximating edit distance in subquadratic time is “one of the biggest unsolved problems in the field of combinatorial pattern matching” [37]. Our main result is a quantum constant approximation algorithm for computing the edit distance in truly subquadratic time. More precisely, we give an
quantum algorithm that approximates the edit distance within a factor of 3. We further extend this result to an
quantum algorithm that approximates the edit distance within a larger constant factor.
Our solutions are based on a framework for approximating edit distance in parallel settings. This framework requires as black box an algorithm that computes the distances of several smaller strings all at once. For a quantum algorithm, we reduce the black box to
metric estimation
and provide efficient algorithms for approximating it. We further show that this framework enables us to approximate edit distance in distributed settings. To this end, we provide a MapReduce algorithm to approximate edit distance within a factor of
, with sublinearly many machines and sublinear memory. Also, our algorithm runs in a logarithmic number of rounds. |
---|---|
ISSN: | 0004-5411 1557-735X |
DOI: | 10.1145/3456807 |