Diverse Palindromic Factorization is NP-Complete

Diverse Palindromic Factorization is NP-Complete
Notice: This research summary and analysis were automatically generated using AI technology. For absolute accuracy, please refer to the [Original Paper Viewer] below or the Original ArXiv Source.

We prove that it is NP-complete to decide whether a given string can be factored into palindromes that are each unique in the factorization.


šŸ’” Research Summary

The paper investigates the computational complexity of a string factorization problem that combines two natural constraints: every factor must be a palindrome, and all factors must be distinct (i.e., the factorization must be ā€œdiverseā€). The authors prove that deciding whether a given string admits such a factorization is NP‑complete.

The authors begin by formalizing the problem. A string S of length n over an alphabet Ī£ is to be partitioned into substrings F₁,…,F_t such that (i) each F_i is a palindrome, and (ii) no two factors are identical. This ā€œdiverse palindromic factorizationā€ is a natural extension of the well‑studied palindromic length problem (minimum number of palindromes needed to cover S) and of diverse factorizations such as LZ77/LZ78 parses. The decision version—does a diverse palindromic factorization exist?—is trivially in NP because a factorization serves as a polynomial‑size certificate.

To establish NP‑hardness, the authors reduce the classic Boolean circuit satisfiability problem (Circuit‑SAT) to the diverse palindromic factorization problem. They first transform any Boolean circuit into an equivalent circuit that uses only NAND gates (each with two inputs) and ā€œsplitterā€ gates that duplicate a wire. This transformation incurs at most a logarithmic blow‑up in size and depth, preserving polynomial‑time reducibility.

Each wire of the circuit is assigned a unique symbol a. For each such symbol they introduce three related characters: a, its complement ĀÆa, and a marker x_a. Multiple copies of x_a are denoted x_a^j. The complement symbols play opposite roles in the reduction, while the x_a markers are used to enforce palindromic structure. Two additional separator symbols $ and # are employed as unique delimiters that never appear elsewhere in the construction.

The reduction proceeds inductively. Starting from an empty circuit Cā‚€, the authors build a sequence of sub‑circuits C₁,…,C_t, each obtained from the previous one by one of three elementary operations: (1) adding a new wire, (2) splitting an existing wire into two, or (3) connecting two wires to a new NAND gate. Correspondingly, they construct strings S₁,…,S_t such that S_i ā€œrepresentsā€ C_i. A string S_i represents a circuit if every truth assignment to the circuit’s inputs is encoded by some diverse palindromic factorization of S_i, and conversely every diverse palindromic factorization of S_i encodes some assignment.

The key technical device is the notion of a ā€œcomplete factorā€: a factor that appears exactly once in the factorization and is not a proper substring of any other factor. For a wire a, the presence of the complete factors (a, x_a, x_aĀÆa) signals that the wire’s logical value is true, whereas (ĀÆa, x_a, x_a a) signals false. The construction guarantees that in any diverse palindromic factorization exactly one of these two triples can appear as complete factors, thereby faithfully representing the Boolean value of the wire.

For each elementary circuit operation the authors describe how to extend the current string:

  • Adding a wire: they append the block ā€œ$ # x_a a x_aĀÆaā€ (or its complement) to encode the two possible truth values of the new wire.
  • Splitting a wire: they insert a more elaborate pattern involving new symbols b′, c′ (the two split wires) together with several copies of x_a, ensuring that a factorization must choose either the ā€œb true, c falseā€ or the opposite configuration, mirroring the logical split.
  • Adding a NAND gate: they embed a pattern that forces the output wire’s complete factor to be consistent with the NAND of its two input wires’ values.

Each step preserves the property that any diverse palindromic factorization of the resulting string corresponds bijectively to a truth assignment of the extended circuit. Lemma 3 formalizes the ability to perform each step in constant time relative to the current string length, yielding an overall linear‑time reduction from Circuit‑SAT to the string problem.

Finally, to obtain a string whose factorization exists iff the original circuit’s output is true, the authors append ā€œ$ # x_o a_o x_oĀÆa_oā€ where a_o is the output wire’s label. Since $ and # are unique separators, they must appear as complete factors in any factorization, forcing the final block to be interpreted exactly as described. Consequently, the constructed string has a diverse palindromic factorization if and only if the circuit is satisfiable.

The paper also extends the result in two directions:

  1. k‑diverse factorization: for any fixed integer k ≄ 1, deciding whether a string admits a factorization where each palindrome appears at most k times remains NP‑complete. The reduction is adapted by limiting the multiplicity of the x_a markers.
  2. Binary alphabet restriction: the authors show that the problem stays NP‑complete even when the input alphabet is limited to two symbols (0/1). This is achieved by encoding each of the previously used symbols with short binary strings while preserving the palindrome and uniqueness constraints.

In summary, the authors provide a clean, linear‑time many‑one reduction from Boolean circuit satisfiability to the diverse palindromic factorization problem, establishing its NP‑completeness. The work highlights that seemingly simple string‑decomposition constraints can encode full logical reasoning, and it opens avenues for further research on approximation algorithms, parameterized complexity (e.g., bounded alphabet size, bounded k), and practical implications for string compression and data indexing where uniqueness and palindromicity might be desirable.


Comments & Academic Discussion

Loading comments...

Leave a Comment