70
v1v2 (latest)

An Instance-Based Approach to the Trace Reconstruction Problem

Annual Conference on Information Sciences and Systems (CISS), 2024
Abstract

In the trace reconstruction problem, one observes the output of passing a binary string s{0,1}ns \in \{0,1\}^n through a deletion channel TT times and wishes to recover ss from the resulting TT "traces." Most of the literature has focused on characterizing the hardness of this problem in terms of the number of traces TT needed for perfect reconstruction either in the worst case or in the average case (over input sequences ss). In this paper, we propose an alternative, instance-based approach to the problem. We define the "Levenshtein difficulty" of a problem instance (s,T)(s,T) as the probability that the resulting traces do not provide enough information for correct recovery with full certainty. One can then try to characterize, for a specific ss, how TT needs to scale in order for the Levenshtein difficulty to go to zero, and seek reconstruction algorithms that match this scaling for each ss. For a class of binary strings with alternating long runs, we precisely characterize the scaling of TT for which the Levenshtein difficulty goes to zero. For this class, we also prove that a simple "Las Vegas algorithm" has an error probability that decays to zero with the same rate as that with which the Levenshtein difficulty tends to zero.

View on arXiv
Comments on this paper