An Instance-Based Approach to the Trace Reconstruction Problem
In the trace reconstruction problem, one observes the output of passing a binary string through a deletion channel times and wishes to recover from the resulting "traces." Most of the literature has focused on characterizing the hardness of this problem in terms of the number of traces needed for perfect reconstruction either in the worst case or in the average case (over input sequences ). In this paper, we propose an alternative, instance-based approach to the problem. We define the "Levenshtein difficulty" of a problem instance as the probability that the resulting traces do not provide enough information for correct recovery with full certainty. One can then try to characterize, for a specific , how needs to scale in order for the Levenshtein difficulty to go to zero, and seek reconstruction algorithms that match this scaling for each . For a class of binary strings with alternating long runs, we precisely characterize the scaling of for which the Levenshtein difficulty goes to zero. For this class, we also prove that a simple "Las Vegas algorithm" has an error probability that decays to zero with the same rate as that with which the Levenshtein difficulty tends to zero.
View on arXiv