Several extensions of the Parikh matrix L-morphism
The Parikh matrix mapping is a morphism assigning to each word w over a k-letter alphabet a (k+1)×(k+1) upper triangular matrix with entries expressing the number of occurrences in w of some specific subwords. To tackle the problem of ambiguity of this mapping two new mappings have been proposed in...
Gespeichert in:
Veröffentlicht in: | Journal of computer and system sciences 2013-08, Vol.79 (5), p.658-668 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The Parikh matrix mapping is a morphism assigning to each word w over a k-letter alphabet a (k+1)×(k+1) upper triangular matrix with entries expressing the number of occurrences in w of some specific subwords. To tackle the problem of ambiguity of this mapping two new mappings have been proposed in literature, assigning to words matrices with polynomial entries (q-matrices). One is a more subtle, but still ambiguous morphism, the other is unambiguous but not a morphism. We show that the former mapping can be extended to match even a fairly general extension of the original Parikh matrix morphism. Then we introduce an unambiguous q-matrix morphism based on the same general Parikh matrix mapping. Finally, we consider the problem of incomplete information on word symbols and show that the general Parikh matrix mapping can be further extended to deal with counts of fuzzy subword occurrences.
► We construct a Parikh q-matrix L-morphism as in Egecioglu and Ibarra (IFIP 2004) for subwords from any language L. ► We show that padding the Parikh L-matrix by one row and one column allows constructing unambiguous q-matrix morphism. ► We show that fuzzy subword occurrences (e.g., in DNA sequencing) can be counted by a Parikh matrix L-morphism. |
---|---|
ISSN: | 0022-0000 1090-2724 |
DOI: | 10.1016/j.jcss.2013.01.018 |