posted on 2022-03-02, 11:14authored byDjamal Belazzougui, Dmitry Kosolobov, Simon J. Puglisi, Rajeev Raman
The weighted ancestor problem is a well-known generalization of the predecessor problem to trees. It is known to require O(log log n) time for queries provided O(n polylog n) space is available and weights are from [0..n], where n is the number of tree nodes. However, when applied to suffix trees, the problem, surprisingly, admits an O(n)-space solution with constant query time, as was shown by Gawrychowski, Lewenstein, and Nicholson (Proc. ESA 2014). This variant of the problem can be reformulated as follows: given the suffix tree of a string s, we need a data structure that can locate in the tree any substring s[p..q] of s in O(1) time (as if one descended from the root reading s[p..q] along the way). Unfortunately, the data structure of Gawrychowski et al. has no efficient construction algorithm, limiting its wider usage as an algorithmic tool. In this paper we resolve this issue, describing a data structure for weighted ancestors in suffix trees with constant query time and a linear construction algorithm. Our solution is based on a novel approach using so-called irreducible LCP values.
Funding
Supported by the grant 20-71-00050 of the Russian Foundation of BasicResearch (for the final version of the data structure and for its construction algorithm).
History
Author affiliation
Department of Informatics
Source
CPM 2021 32nd Annual Symposium on Combinatorial Pattern Matching
Version
VoR (Version of Record)
Published in
Leibniz International Proceedings in Informatics, LIPIcs
Volume
191
Publisher
Schloss Dagstuhl – Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing