Abstract
© Springer International Publishing Switzerland 2015. In this article, we introduce a new and simple data structure, the prefix table under Hamming distance, and present two algorithms to compute it efficiently: one asymptotically fast; the other very fast on average and in practice. Because the latter approach avoids the computation of global data structures, such as the suffix array and the longest common prefix array, it yields algorithms much faster in practice than existing methods. We show how this data structure can be used to solve two string problems of interest: (a) approximate string matching under Hamming distance; and (b) longest approximate overlap under Hamming distance. Analogously, we introduce the prefix table under edit distance, and present an efficient algorithm for its computation. In the process, we also define the border array under both distance measures, and provide an algorithm for conversion between prefix tables and border arrays.
Original language | English |
---|---|
Title of host publication | Combinatorial Algorithms |
Place of Publication | USA |
Publisher | Springer |
Pages | 49-61 |
Volume | 8986 |
ISBN (Print) | 9783319193144 |
DOIs | |
Publication status | Published - 2015 |
Event | 25th International Workshop on Combinatorial Algorithms 2014 - Duluth, United States Duration: 15 Oct 2014 → 17 Dec 2014 |
Conference
Conference | 25th International Workshop on Combinatorial Algorithms 2014 |
---|---|
Abbreviated title | IWOCA 2014 |
Country | United States |
City | Duluth |
Period | 15/10/14 → 17/12/14 |