Generalized Post Embedding Problems
The Regular Post Embedding Problem extended with partial (co)directness is shown decidable. This extends to universal and/or counting versions. It is also shown that combining directness and codirectness in Post Embedding problems leads to undecidability.
š” Research Summary
The paper investigates extensions of the Regular Post Embedding Problem (PEP), a classic decision problem that asks, given two morphisms u,āÆvāÆ:āÆĪ£*āÆāāÆĪ* and a regular language RāÆāāÆĪ£*, whether there exists a word ĻāÆāāÆR such that u(Ļ) is a scattered subword of v(Ļ) (denoted u(Ļ)āÆāāÆv(Ļ)). PEP is known to be decidable but extremely hard (F_{Ļ^Ļ}ācomplete).
The authors introduce two new variants that incorporate partial directness or partial codirectness. In PEPāÆpartialāÆdir, a second regular language Rā²āÆāāÆĪ£* specifies which prefixes of a candidate solution must satisfy the embedding condition; all other prefixes are unrestricted. Dually, PEPāÆpartialāÆcoādir requires the condition only for suffixes belonging to Rā². Setting Rā²āÆ=āÆā recovers the original PEP, while Rā²āÆ=āÆĪ£* yields the previously studied PEPāÆdir (every prefix must embed). Thus the new problems strictly generalize both PEP and PEPāÆdir.
The core technical contribution is a decidability proof for both PEPāÆpartialāÆdir and PEPāÆpartialāÆcoādir. The proof proceeds by showing that any solution can be transformed into a short solution whose length is bounded by a computable function of the input. The bound is obtained through a combination of three ingredients:
-
Myhillācongruence indices µ(R) and µ(Rā²). Since R and Rā² are regular, their Myhillācongruences have finite index; the product µ(R)·µ(Rā²) bounds the number of equivalence classes that suffixes of a candidate solution can fall into.
-
Higmanās Lemma (the wellāquasiāordering of words under the subword relation). This guarantees that any infinite sequence of suffixes must contain an increasing pair, which in turn yields a ācutting pointā where a portion of the solution can be removed without breaking the embedding property.
-
LengthāFunction Theorem for controlled bad sequences. By defining a kācontrolled sequence (where the length of the iāth word is at most iĀ·k) and an nābad sequence (one that contains no increasing subsequence of length n), the theorem provides a computable function H(n,āÆk,āÆĪ) that bounds the maximal length of an nābad kācontrolled sequence.
The authors set k to the maximal expansion factor K_uāÆ=āÆmax_{aāĪ£}|u(a)| and n to µ(R)·µ(Rā²)+1. The resulting bound LāÆ=āÆH(n,āÆK_u,āÆĪ) is effectively computable. They then prove two ācutting lemmasā:
- LemmaāÆ6 (blue indices) ā If two positions aāÆ<āÆb are congruent, both satisfy u(i,N)āÆāāÆv(i,N) (called blue), and the leftāmargin string at a is a subword of the leftāmargin at b, then the segment Ļ
Comments & Academic Discussion
Loading comments...
Leave a Comment