Next Article in Journal
An Interferometric Method for Particle Mass Measurements
Previous Article in Journal
Applications of the q-Srivastava-Attiya Operator Involving a Certain Family of Bi-Univalent Functions Associated with the Horadam Polynomials
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Phase Transition Analysis for the Random Regular Exact 2-(d, k)-SAT Problem

1
College of Computer Science and Technology, Guizhou University, Guiyang 550025, China
2
Department of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China
3
Ningxia Key Laboratory of Intelligent Information and Big Data Processing, Yinchuan 750021, China
*
Author to whom correspondence should be addressed.
Symmetry 2021, 13(7), 1231; https://doi.org/10.3390/sym13071231
Submission received: 14 May 2021 / Revised: 1 July 2021 / Accepted: 8 July 2021 / Published: 8 July 2021
(This article belongs to the Section Computer)

Abstract

:
In a regular ( d , k )-CNF formula, each clause has length k and each variable appears d times. A regular structure such as this is symmetric, and the satisfiability problem of this symmetric structure is called the ( d , k ) -SAT problem for short. The regular exact 2-( d , k )-SAT problem is that for a ( d , k )-CNF formula F, if there is a truth assignment T, then exactly two literals of each clause in F are true. If the formula F contains only positive or negative literals, then there is a satisfiable assignment T with a size of 2 n / k such that F is 2-exactly satisfiable. This paper introduces the ( d , k )-SAT instance generation model, constructs the solution space, and employs the method of the first and second moments to present the phase transition point d * of the 2-( d , k )-SAT instance with only positive literals. When d < d * , the 2-( d , k )-SAT instance can be satisfied with high probability. When d > d * , the 2-( d , k )-SAT instance can not be satisfied with high probability. Finally, the verification results demonstrate that the theoretical results are consistent with the experimental results.

1. Introduction

The satisfiability (SAT) problem of Boolean formula mainly investigates whether there is a satisfiable truth assignment for a given Conjunctive Normal Form (CNF) formula. As the first NP-complete problem to be proved, the SAT problem has been deeply studied for a long time. Various SAT problems with regular structure have been attracting increasing attention from scholars, such as limiting the length of clauses or the number of variables appearing. The SAT problem that limits the length of each clause to k is called a k-SAT problem. It has been proven that the k-SAT problem was hard to solve in the “worst” case. In recent years, some scholars have begun to study the SAT problem with symmetric structure. In this type of SAT problem, the number of occurrences of each variable is limited to d and the length of each clause is k. The results show that any k-SAT instance can be reduced to a ( d , k )-SAT instance of this symmetric structure [1,2]. For the ( d , k )-SAT problem, if there is a set of variable assignments such that exactly two literals in each clause of the ( d , k )-SAT instance are true, such problem is called the regular exact 2-( d , k )-SAT problem. This problem is still hard to solve in the “worst” case, and the study of it is helpful to further analyze other SAT problems with symmetric structure.
Studying SAT problems in some specific situations will help people to discover the general mathematical phenomena of SAT problems and design more effective algorithms for solving SAT problems. Currently, more studies focus on the random k-SAT problem. Experimental verification and theoretical analysis of this problem have shown that the clause constraint density α (the ratio of the number of clauses m to the number of variables n) of the CNF formula was an important parameter affecting the satisfiability and the difficulty of solving the formula [3,4,5,6]. It has been found that there was a critical value α k associated with k. When α < α k , the random k-SAT problem instance could be satisfied with a high probability. When α > α k , the random k-SAT problem instance could not be satisfied with a high probability [3]. This sudden change from satisfied to unsatisfied was called the phase transition phenomenon of the k-SAT problem, and the corresponding critical value α k was called the phase transition point. Additionally, it was also found that when α was far away from the phase transition point α k , the random k-SAT problem was easy to solve. When α was close to the phase transition point α k , the random k-SAT problem was challenging to solve. In the random 3-SAT problem, the research revealed that α k was at least 3.52 [7] and at most 4.4898 [8]. In addition, the instances that most excellent SAT solvers could solve were closely related to α . For instance, the DPLL algorithm was applicable to solve pure text random 3-SAT instances with constraint density α < 1.6 , the Zchaff algorithm was suitable for dealing with random 3-SAT instances with clause constraint density α < 3.52 that had the most positive or negative literals, WalkSAT was suitable for solving random 3-SAT instances with clause constraint density α slightly higher than 3.95 [9], the Belief Propagation algorithm was suitable for random 3-SAT instances with clause constraint density α < 3.95 , and the Survey Propagation algorithm was suitable for solving clause constraint density α < 4.26 random 3-SAT instances [10]. However, all symmetric ( d , k )-SAT problems have the same clause constraint density, because in such formulas, each variable is copied d times to form the d n variable junction points, each clause is copied k times to form the k m clause nodes, and d n = k m . Obviously, it is not enough to characterize the satisfiability and difficulty of such a regular structure SAT problem only by the constraint density of clauses. To better investigate the difficulty of solving such SAT problems, new parameters should be introduced to characterize the structural features of the formulas.
The phase transition analysis of the SAT problem with symmetric structure has obtained abundant research results. Boufkhad et al. firstly proposed the regular ( d , k )-SAT problem with this symmetric structure and pointed out that the satisfiable phase transition point of the random regular ( 3 , k )-SAT problem was 2.46 α k , 3 3.78 in [11]. On this basis, Rathi et al. obtained the upper and lower bounds of the phase transition points of the problem by using the method of first and second moments [12]. The authors provided the phase transition point of the random regular NAE-SAT problem and obtained a phase transition control parameter concerning parameter d [13]. Achlioptas et al. studied the phase transition of 1-in-k-SAT and NAE-3-SAT problems, and found that the upper and lower bounds of NAE-3-SAT were 1.514 < c 3 , N A E < 2.215 [14]. Then, the phase transition of the NAE-k-SAT problem was provided in [15]. The author proposed the random regular exact cover problem (exactly one in every clause is true), studied its satisfiable phase transition, and proved it by using the subgraph method [16]. Further, the phase transition of the exact problem was provided by using the method of differential equation [17]. Inspired by the exact problem, we propose to study the phase transition of the regular exact 2-( d , k )-SAT problem, and find the phase transition control parameters for the problem.
This article introduces s-exact satisfiability. For a ( d , k )-CNF formula F, if it contains only positive or negative literals, there is a satisfiable assignment T of size s n / k , such that F is s-exactly satisfiable. Then, we introduce the generation model of ( d , k )-SAT instance, construct the solution space, and employ the method of the first-order moment and the second-order moment to present the phase transition point d * of the 2-( d , k )-SAT problem. Specifically, let k 3 , for a positive integer d, d * = 2 ln ( 2 / k ) + ( k 2 ) ln ( 1 2 / k ) / ln ( 1 1 / k ) + k 2 ln ( 1 2 / k ) . When d < d * , the problem is satisfiable with high probability. When d > d * , the problem is unsatisfiable with high probability. Since the phase transition point d * is a function of the clause length k, we take the three cases of k = 4 , k = 6 , and k = 7 respectively, and choose ( d , k )-SAT instance with a scale of n = 10 and n = 20 to conduct experiments. The verification results demonstrate that the theoretical results are in agreement with the experimental results.

2. Preliminaries

2.1. Random Regular Exact 2-( d , k )-SAT Problem

A literal is a propositional variable x or a negation of a propositional variable ¬ x . A clause C is the disjunction of several literals ( C = ( L 1 L k ) ), the clause C can be regarded as the set of literals ( C = { L 1 , , L k } ), k denotes the length of the clause, and V a r C j represents the set of variables in a clause C j . A Conjunction Normal Form (CNF) formula F is the conjunction of several clauses ( F = ( C 1 C m ) ), and the formula can be regarded as a set of clauses F = { C 1 , , C m } , or a clause table F = [ C 1 , , C m ] , where m is the size of the formula and var ( F ) represents the set of variables appearing in formula F. # var ( F ) and # c l ( F ) are the number of variables and clauses appearing in formula F, respectively. The number of occurrences of a variable x refers to the sum of the corresponding positive and negative literals, denoted as o c c s ( x , F ) , using p o s ( x , F ) and n e g ( x , F ) respectively, to represent the positive and negative occurrence times of the variable x in F, and signify o c c ( x , F ) = p o s ( x , F ) + n e g ( x , F ) . A regular ( d , k )-CNF formula means that in the CNF formula F, each variable occurs exactly d times, and the length of each clause is exactly k. The regular structure is symmetric. In the current formula class, there must be k m = d n .
For any CNF formula F, if there is a truth assignment on the set of variables x 1 , x 2 , , x n that makes the value of formula F be TRUE, then F is satisfiable (SAT). If the value of F is FALSE due to 2 n different assignments of n Boolean variables, then formula F is unsatisfiable (UNSAT). The satisfiability problem of symmetric structures is called the ( d , k ) -SAT problem.
Definition 1.
Let F be a ( d , k )-CNF formula with n variables x 1 , x 2 , , x n , and m clauses C 1 , C 2 , , C m . It is said that formula F is s-exactly satisfiable. If there is a truth value assignment T, each clause C j has exactly s literals to be true, where 1 s < k .
Note: a formula can be satisfied, but it does not have to be s-exactly satisfied.
Suppose F is a ( d , k )-CNF formula with only positive literals or only negative literals and T is a satisfying assignment, for each clause C j , if T C j = s , then d T = m k m ( k s ) = s m = s d · n / k . Thus, T = s n / k . According to the above derivation, for any ( d , k )-CNF formula F containing only positive literals or only negative literals, if F is s-exactly satisfiable, then F must have a satisfiable assignment with a size of s n / k . However, for the ( d , k )-CNF formula with both positive and negative literals, there is no such property.
Theorem 1.
Let the formula F be a regular ( d , k )-CNF formula with n variables and m clauses. If F contains only positive or negative literals, then there is a satisfiable assignment T of size s n / k , such that F is s-exactly satisfiable.
Proof. 
Since the formula F is a regular ( d , k )-CNF formula, each variable appears d times, and the length of each clause is k. Let a satisfiable assignment T = x 1 , x 2 , , x s n / k . Since k m = d n , then d T = d · s n / k = s · d n / k = s · m k / k = s m . Therefore, we take all s n / k variables as positive literals or negative literals, and equally distribute them to m clauses. Thus, the formula F is s-exactly satisfiable. □
The following figure (Figure 1) is a regular (4,3)-CNF formula, where n = 6 and m = 8 . The formula is 2-exactly satisfiable, and T = x 1 , x 2 , x 3 , x 5 is a satisfiable assignment of it.
Therefore, we discuss the 2-exactly satisfiable problem of the ( d , k )-CNF formula with only positive literals, in order to further analyze and understand the degree of difficulty and phase transition law of the s-( d , k )-SAT problem.

2.2. Random Regular ( d , k )-SAT Instance Generation Model

Let the set of variables V = x 1 , x 2 , , x n , and copy each variable in V d times to produce the following variable matrix:
V * = x 1 , 1 x 1 , 2 x 1 , d x 2 , 1 x 2 , 2 x 2 , d x n , 1 x n , 2 x n , d .
Let the CNF formula F = ( C 1 C m ) , and copy each clause in F k times to produce the following matrix, which is called the position matrix of argument placement (short for position matrix):
F * = C 1 , 1 C 1 , 2 C 1 , k C 2 , 1 C 2 , 2 C 2 , k C m , 1 C m , 2 C m , k .
Note: for a regular ( d , k )-CNF formula, k m = d n .
All permutations on the set 1 , 2 , , m k constitute the sample space, and there are m k ! permutations in total. Randomly take a permutation π from it, corresponding to the generation of a regular ( d , k ) -bipartite graph G = U , V , E , where the relationship between the edges is as follows: π x i , p = C j , q x i C j . Since a permutation determines an instance, the size of the formula instance space is m k ! .

2.3. Phase Change Analysis Technique

This section mainly introduces phase transitions and some methods used in phase transitions.
Definition 2.
Suppose a model M, a property Q, and a probability function p : N 0 , 1 , if there are two probability functions satisfying the following conditions [18]:
(1) 
When lim n p 0 ( n ) / p ( n ) = 0 , almost certainly model M does not have property Q.
(2) 
When lim n p ( n ) / p ( n ) = , almost certainly model M has property Q.
Then, the property Q has a phase change, and the function is called criticality (or threshold).
Suppose Q is a property with a phase transition property, then the probability function p : N 0 , 1 is called strict criticality. If for any c < 1 , for almost all n, model M has no property Q. For any c > 1 , for almost all n, the model M has property Q.
In the proof of the phase transition phenomenon of the SAT problem, the first moment method is usually used to demonstrate that the SAT problem can satisfy the upper bound of the critical value. At the same time, the second moment method is used to prove that the SAT problem can satisfy the lower bound of the critical value.
The first moment method is a simple application of Markov’s inequality.
Theorem 2.
(Markov’s inequality [19]): Let Z be a random variable whose value is a non-negative integer, then for any real number, ν > 0 :
Pr ( Z ν ) E Z ν .
It is of note that the value of Z is only a non-negative integer. According to Markov’s inequality, the probability of a random variable Z 1 (i.e., Z > 0 ) is at most E ( Z ) , that is, Pr ( Z 1 ) E Z .
The second moment method is an inequality which is directly derived from Chebyshev’s inequality.
Theorem 3.
(The second moment method [19]): Let Z be a random variable of non-negative integer value only, then:
Pr ( Z = 0 ) E Z 2 E Z 2 .

3. Main Results

The phase transition phenomenon is one of the important characteristics of SAT problems. The study of this phenomenon will help to further understand the properties of SAT problems and design more efficient algorithms. In this section, the phase transition points of the 2-( d , k )-SAT problem with symmetric structure are obtained by using the method of first and second moments.
Theorem 4.
Suppose F is a random regular exact 2-( d , k )-SAT instance, and
d * = 2 ln ( 2 k ) + ( k 2 ) ln ( 1 2 k ) ln ( 1 1 k ) + ( k 2 ) ln ( 1 2 k ) .
When k 3 , for any positive integer d:
lim n Pr F i s s a t i s f i a b l e = 1 d < d * 0 d > d * .
Theorem 4 shows that for a 2-( d , k )-SAT instance F, when k 3 , if d < d * , then the high probability of F is satisfied. If d > d * , then the high probability of F is not satisfied. Theorem 4 can be proved by the following two lemmas.
Lemma 1.
Suppose F is a random regular exact 2-( d , k )-SAT instance. When k 3 , if d > d * , F is unsatisfied with high probability.
Proof. 
Let F = C 1 , C 2 , , C m be a regular ( d , k )-SAT instance, let V = x 1 , x 2 , , x n be a set of variables of instance F, and let T be a truth assignment on the set of variables V. Each clause C j in F satisfies | T C j | = 2 . Therefore, for each clause C j , there are k 2 variables that are not entered in T. Since each variable occurs d times and k m = d n , then d T = m k m k 2 = 2 m = 2 d n / k . Thus, T = 2 n / k .
Let T = x 1 , x 2 , , x 2 n / k , copy d times, and generate the following matrix:
T = x 1 , 1 x 1 , 2 x 1 , d x 2 , 1 x 2 , 2 x 2 , d x 2 n k , 1 x 2 n k , 2 x 2 n k , d .
There are 2 n k · d = 2 m variables in the matrix T , and the 2 m variables are divided into m groups with 2 elements in each group. There are C 2 m 2 · C 2 m 2 2 C 4 2 · C 2 2 / m ! = 2 m ! / m ! · 2 m possibilities for such a division. Since there are k positions in each row of formula F, there are 2 m ! / m ! · 2 m · m ! · k k 1 m / 2 m substitutions by replacing any two positions with the variables in matrix T . There are ( k 2 ) empty positions in each row of formula F after the substitution, which can be replaced by other remaining variables in V. There are [ m ( k 2 ) ] ! possibilities for such a substitution. Therefore, the number of formulas that T satisfies is 2 m ! / 2 m · k k 1 m / 2 m · m k 2 ! in total. Thus, the number satisfying the formula in the instance space is C n 2 n 2 n k k · 2 m ! / 2 m · k k 1 m / 2 m · [ m ( k 2 ) ] ! .
Assuming that the instance F is uniformly and randomly selected in the instance space which the size is m k ! , and the random variable Z ( n ) is used to represent the number of satisfying solutions of F, the mathematical expectation of Z ( n ) is expressed as follows:
E [ Z ( n ) ] = C n 2 n / k · ( 2 m ) ! 2 m · [ k ( k 1 ) ] m 2 m · [ m ( k 2 ) ] ! ( m k ) ! .
According to the first moment, the task now is to prove that lim n E [ Z ( n ) ] = 0 . Firstly, E [ Z ( n ) ] is simplified by the combined approximate formula: C n α n 1 / 2 π n α 1 α 1 / 2 exp n H α , ( 0 < α < 1 ) .
E [ Z ( n ) ] = C n 2 n k · ( 2 m ) ! 2 m · ( k ( k 1 ) ) m 2 m · ( m ( k 2 ) ) ! ( m k ) ! = k m · ( k 1 ) m 2 2 m · C n 2 n k C m k 2 m = 2 2 m · k m · ( k 1 ) m · 2 π m k 2 k 1 2 k 2 π n 2 k 1 2 k · e n m k H ( 2 k ) = d · e m ln k + m ln k 1 2 m ln 2 + n m k H ( 2 k ) = d · e n d k ln k + d k ln k 1 2 d k ln 2 + 1 d H 2 k = d e n φ d ,
where φ ( d ) = d / k ln k + d / k ln k 1 2 d / k ln 2 + 1 d H 2 / k and H ( α ) = α ln ( α ) ( 1 α ) ln ( 1 α ) is the entropy function.
To further simplify the processing of φ ( d ) :
φ ( d ) = d k ln k + d k ln k 1 2 d k ln 2 + 1 d H 2 k = d k ln k + ln ( k 1 ) 2 ln 2 + 2 ln ( 2 k ) + ( k 2 ) ln ( 1 2 k ) + H ( 2 k ) = d 1 k ln ( 1 1 k ) + k 2 k ln ( 1 2 k ) + H ( 2 k ) .
Obviously, when k 3 , the function is a monotonically decreasing function of d, and has a unique zero:
d * = 2 ln ( 2 k ) + ( k 2 ) ln ( 1 2 k ) ln ( 1 1 k ) + ( k 2 ) ln ( 1 2 k ) .
Therefore, when k 3 , d > d * , lim n E [ Z ( n ) ] = 0 . Then:
Pr n [ Z ( n ) 1 ] = 0 .
In other words, when k 3 , d > d * , the regular exact 2-( d , k )-SAT instances are unsatisfied with high probability. □
Lemma 2.
Let F be a random regular exact 2-( d , k )-SAT instance. When k 3 , if d < d * , then F can be satisfied with high probability.
Proof. 
The following is mainly performed to calculate the value of E [ Z 2 ( n ) ] . Suppose F is a satisfiable instance and has a satisfiable assignment set T 1 , T 2 , , T r with r elements. Suppose a pair of assignments ( T p , T q ) satisfy F, if and only if T p and T q satisfy F. Consider the set of assignment pairs { ( T p , T q ) : 1 p , q r } on T 1 , T 2 , , T r ,, and the number of assignment pairs is r 2 . Assume T p q = ( T p , T q ) , then, the assignment pair can be expressed as the following matrix:
T * = T 11 T 12 T 1 r T 21 T 22 T 2 r T r 1 T r 2 T r r .
For the ( d , k )-CNF formula, the size of any 2-exact satisfiable assignment T is 2 n 2 n k k . The matrix T is obtained by copying the variables in T d times. We write the matrix as set T = { x i , j : 1 i 2 n / k , 1 j d } . By pairing 2 n / k · d = 2 m elements in T , the set of element pairs T is constructed, and the size of the set T is m. Such set has C 2 m 2 · C 2 m 2 2 C 4 2 · C 2 2 = ( 2 m ) ! / m ! · 2 m .
For any assignment pair ( T p , T q ) , the relationship is as follows:
T p T q = T q T p , 0 T p T q 2 n k , 0 T p T q 2 n k .
Therefore, the size of the set difference can be used to divide the set { ( T p , T q ) | 1 p , q r } . When T p T q = T l T h = λ , ( T p , T q ) and ( T l , T h ) are in the same class, denoted as:
P λ = { ( T p , T q ) : | T p T q | = λ } .
It can be seen that P 0 , P 1 , , P 2 n / k forms a partition of { ( T p , T q ) | 1 p , q r } and has:
P 0 + P 1 + + P 2 n / k = r 2 .
In a partition block P λ , T p T q = T q T p = λ , | T p T q | = 2 n / k λ , the total number of variables involved is 2 n / k + λ . Introduce a normalization parameter: 0 ω 1 , and express the sum as an integral. Mark the division as follows:
Q ω = P 2 ω n k = { ( T p , T q ) : | T p T q | = 2 ω n k } .
The set pair ( T p , T q ) in the Q ω block satisfies the following conditions:
| T p T q | = | T q T p | = 2 ω n k , | T p T q | = 2 ( 1 ω ) n k , | T p T q | = 2 ( 1 + ω ) n k .
Taking a pair of satisfiable assignments as an example, the number of satisfiable formulas is calculated by the cross-assignment method. Let T p = { x 1 , , x 2 ω n 2 ω n k k , x 2 ω n 2 ω n k k + 1 , , x 2 n / k } and T q = { x 2 ω n / k + 1 , , x 2 n / k , x 2 n / k + 1 , ,   x 2 ( 1 + ω ) n / k } . Then, T p T q = { x 1 , , x 2 ω n 2 ω n k k } , T p T q = { x 2 ω n / k + 1 , , x 2 n / k } , T q T p = { x 2 n / k + 1 , , x 2 ( 1 + ω ) n / k } and T p T q ¯ = { x 2 ( 1 + ω ) n / k + 1 , , x n } . After the elements of these four parts are copied d times, the matrices A, B, C, and D are constructed:
A = x 1 , 1 x 1 , 2 x 1 , d x 2 , 1 x 2 , 2 x 2 , d x 2 ω n k , 1 x 2 ω n k , 2 x 2 ω n k , d , D = x 2 1 + ω n k + 1 , 1 x 2 1 + ω n k + 1 , 2 x 2 1 + ω n k + 1 , d x 2 1 + ω n k + 2 , 1 x 2 1 + ω n k + 2 , 2 x 2 1 + ω n k + 2 , d x n , 1 x n , 2 x n , d .
B = x 2 ω n k + 1 , 1 x 2 ω n k + 1 , 2 x 2 ω n k + 1 , d x 2 ω n k + 2 , 1 x 2 ω n k + 2 , 2 x 2 ω n k + 2 , d x 2 n k , 1 x 2 n k , 2 x 2 n k , d , C = x 2 n k + 1 , 1 x 2 n k + 1 , 2 x 2 n k + 1 , d x 2 n k + 2 , 1 x 2 n k + 2 , 2 x 2 n k + 2 , d x 2 1 + ω n k , 1 x 2 1 + ω n k , 2 x 2 1 + ω n k , d .
For each clause C j in ( d , k )-CNF instance F, there is | C j T | = 2 . According to the cross-assignment method, there are several cases, as follows:
(1) The matrix A matches ω n / k · d = ω m clauses in m clauses, corresponds to ω m rows in the position matrix, and occupies two positions in each row.
(2) The matrix B matches ( 1 ω ) m clauses in m clauses, corresponds to ( 1 ω ) m rows in the position matrix, and occupies two positions in each row.
(3) The matrix C matches ω n / k · d = ω m clauses in m clauses, corresponds to ω m rows in the position matrix, and occupies two positions in each row.
For cases (1) and (3), which are jointly allocated to each row of the ω m row and match the four positions in the position matrix, there are a total of ( ω m ) ! · C k 2 ω m · ( ω m ) ! · C k 2 2 ω m matches. Since | C j T | = 2 , in case (2), if the two elements in { x i , x j } are both in T p T q , then there are a total of ( 1 ω ) m ! · C k 2 ( 1 ω ) m matches. If one of the elements in { x i , x j } is in T p T q , then there are a total of ( 1 ω ) m ! · C k 2 ( 1 ω ) m · k 2 ( 1 ω ) m matches. Furthermore, the variables { x i , x j } , both in T p T q and one of them in T p T q , are two mutually exclusive events. Therefore, in case (2), there are at least ( 1 ω ) m ! · C k 2 ( 1 ω ) m types of matches, and at most ( 1 ω ) m ! · C k 2 ( 1 ω ) m · k 2 ( 1 ω ) m types of matches. Through the above-mentioned matching, at most, ( 1 ω ) m · ( k 2 ) + ω m · ( k 4 ) = m ( k 2 2 ω ) positions remain unmatched. Thus, there are m ( k 2 2 ω ) ! matches in total.
For each satisfied assignment T p , it can construct 2 m ! / m ! · 2 m elements. Thus, the formula can be satisfied as follows:
( 2 m ) ! m ! · 2 m · ( 1 ω ) m ! · C k 2 ( 1 ω ) m · k 2 ( 1 ω ) m · ( ω m ) ! · C k 2 ω m · ( ω m ) ! · C k 2 2 ω m · m ( k 2 2 ω ) ! .
Therefore, for a given assignment pair ( T p , T q ) , the total number of formulas that ( T p , T q ) can satisfy is:
C m ω m · ( 2 m ) ! m ! · 2 m · ( 1 ω ) m ! · C k 2 ( 1 ω ) m · k 2 ( 1 ω ) m · ( ω m ) ! · C k 2 ω m · ( ω m ) ! · C k 2 2 ω m · m ( k 2 2 ω ) ! = ( 2 m ) ! m ! · 2 m · m ! · ( ω m ) ! · m ( k 2 2 ω ) ! · k m ( k 1 ) m 2 m · ( k 2 ) ω m ( k 3 ) ω m 2 ω m · ( k 2 ) ( 1 ω ) m = ( 2 m ) ! · ( ω m ) ! · m ( k 2 2 ω ) ! · k m ( k 1 ) m ( k 2 ) m ( k 3 ) ω m 2 ( 2 + ω ) m .
So, the maximum number of formulas satisfied by the assignment pair in Q ω is:
| Q ω | ( 2 m ) ! · ( ω m ) ! · m ( k 2 2 ω ) ! · k m ( k 1 ) m ( k 2 ) m ( k 3 ) ω m 2 ( 2 + ω ) m .
Specify a set S of size 2 n / k , and let S and S satisfy the following conditions:
| S | = 2 n k , | S S | = | S S | = 2 ω n k .
Such subsets S have:
C 2 n / k 2 ( 1 ω ) n / k · C n 2 n / k 2 ω n / k = C 2 n / k 2 ω n / k · C n 2 n / k 2 ω n / k .
Then, the maximum number of formulas satisfied by all assignment pairs in Q ω is:
C n 2 n / k · C 2 n / k 2 ω n / k · C n 2 n / k 2 ω n / k · ( 2 m ) ! · ( ω m ) ! · m ( k 2 2 ω ) ! · k m ( k 1 ) m ( k 2 ) m ( k 3 ) ω m 2 ( 2 + ω ) m .
Let Z ω ( n ) denote the number of randomly selected formulas satisfied by the assignment pairs in Q ω , then:
E [ Z ω ( n ) ] C n 2 n / k · C 2 n / k 2 ω n / k · C n 2 n / k 2 ω n / k · ( 2 m ) ! · ( ω m ) ! · m ( k 2 2 ω ) ! · k m ( k 1 ) m ( k 2 ) m ( k 3 ) ω m 2 ( 2 + ω ) m · ( m k ) ! = E [ Z ( n ) ] · C 2 n / k 2 ω n / k · C n 2 n / k 2 ω n / k · ( ω m ) ! · m ( k 2 2 ω ) ! · ( k 2 ) m ( k 3 ) ω m 2 ω m · m ( k 2 ) ! E [ Z ( n ) ] · C 2 n / k 2 ω n / k · C n 2 n / k 2 ω n / k · ( k 2 ) m ( k 3 ) ω m 2 ω m · C m ( k 2 ) ω m .
Based on the combined approximate formula, it is simplified as follows:
C 2 n / k 2 ω n / k · C n 2 n / k 2 ω n / k C m ( k 2 ) ω m = ω m ( 1 ω k 2 ) 2 π 2 n k ω ( 1 ω ) · 2 n k ω ( 1 2 ω k 2 ) · e 2 n k H ω + n k k 2 H 2 ω k 2 m k 2 H ω k 2 d 4 π 2 n k ω ( 1 ω ) · e n k 2 H ω + k 2 H 2 ω k 2 d k 2 H ω k 2 .
Therefore,
E [ Z ω ( n ) ] E [ Z ( n ) ] · ( k 2 ) m ( k 3 ) ω m 2 ω m · d 4 π 2 n k ω ( 1 ω ) · e n k 2 H ω + k 2 H 2 ω k 2 d k 2 H ω k 2 = E [ Z ( n ) ] · d 4 π 2 n k ω ( 1 ω ) · e n k 2 H ω + k 2 H 2 ω k 2 d k 2 H ω k 2 + d ln k 2 + ω d ln k 3 ω d ln 2 .
and E [ Z 2 ( n ) ] = j = 0 2 n / k E [ Z k / 2 n · j ( n ) ] = E [ Z ( n ) ] + j = 1 2 n / k E [ Z k / 2 n · j ( n ) ] . Replace k j / 2 n with ω j , as follows:
E [ Z 2 ( n ) ] = E [ Z ( n ) ] + j = 1 2 n / k E [ Z ω j ( n ) ] E [ Z ( n ) ] 1 + j = 1 2 n / k d 4 π 2 n k ω j ( 1 ω j ) · e n ψ ω j = E 2 Z n 1 d e n φ d + e n φ d j = 1 2 n / k 1 4 π 2 n k ω j 1 ω j e n ψ ω j ,
where ψ ω j = 2 / k H ω j + k 2 / k H 2 ω j / k 2   d k 2 / k H ω j / k 2 + d / k ln k 2 + ω j d / k · ln k 3 ω j d / k ln 2 and φ ( d ) = d / k ln k + d / k ln k 1 2 d / k ln 2 + 1 d H 2 / k .
Let f ω j = 1 / 4 π 2 n / k ω j 1 ω j 1 / 2 , then:
E Z 2 n E 2 Z n 1 d e n φ d + e n φ d j = 1 2 n / k f ω j e n ψ ω j = E 2 Z n β e n φ d + e n φ d j = 1 2 n / k f ω j e n ψ ω j .
where β = 1 / d 1 / 2 .
Convert cumulative sums to integrals as follows:
E Z 2 n E 2 Z n β e n φ d + e n φ d 2 n k 0 1 f ω e n ψ ω d ω .
Here,
ψ ω = 2 k H ω + k 2 k H 2 ω k 2 d k 2 k H ω k 2 + d k ln k 2 + ω d k ln k 3 ω d k ln 2 = 2 k ω ln ω 1 ω ln 1 ω + k 2 k 2 ω k 2 ln 2 ω k 2 1 2 ω k 2 ln 1 2 ω k 2 d k 2 k ω k 2 ln ω k 2 1 ω k 2 ln 1 ω k 2 + ω d k ln k 3 ω d k ln 2 = d ω 4 ω k ln ω 2 1 ω k ln 1 ω 2 ω + d ω k ln 2 + k 2 d k + 3 d k ln k 2 k 2 2 ω k ln k 2 2 ω + d k 2 d d ω k ln k 2 ω + ω d k ln k 3 .
Because
H ω = ω ln ω 1 ω ln 1 ω = ln ω + ln 1 ω ,
H 2 ω k 2 = 2 ω k 2 ln 2 ω k 2 1 2 ω k 2 ln 1 2 ω k 2 = 2 k 2 ln k 2 2 ω k 2 2 k 2 ln 2 ω k 2 ,
H ω k 2 = ω k 2 ln ω k 2 1 ω k 2 ln 1 ω k 2 = 1 k 2 ln k 2 ω k 2 1 k 2 ln ω k 2 .
Thus,
ψ ω = 2 k ln 1 ω ln ω + k 2 k 2 k 2 ln k 2 2 ω k 2 2 k 2 ln 2 ω k 2 d k 2 k 1 k 2 ln k 2 ω k 2 1 k 2 ln ω k 2 + d k ln k 3 d k ln 2 = d 4 k ln ω + 2 k ln 1 ω + 2 k ln k 2 2 ω d k ln k 2 ω + d k ln k 3 2 + d k ln 2 .
Therefore, ψ ω = d 4 / k · 1 / ω 2 / k · 1 / 1 ω   4 / k · 1 / k 2 2 ω + d / k · 1 / k 2 ω .
When k = 6 , d * = 2.1168 can be obtained from Theorem 4. The image of the function ψ ω is shown in Figure 2a.
Therefore, it can be proven that when k 3 , d < d * , ψ ω < 0 . The function ψ ω is smooth in the interval 0 , 1 , with a unique maximum point.
Let ω max be the maximum point of the function ψ ω . From Reference [12], it can be obtained that:
lim n 0 1 f ω e n ψ ω d ω f ω max 2 π n ψ ω max · e n ψ ω max .
Finally,
E Z 2 n E 2 Z n β e n φ d + e n φ d 2 n k f ω max 2 π n ψ ω max e n ψ ω max = E 2 Z n β e n φ d + f 1 ω max n 1 2 e n ψ ω max φ d ,
where f 1 ω max = 2 / k · 2 π / ψ ω max 1 / 2 · f ω max .
Let,
ϕ d = ψ ω max φ d .
When k = 6 and 0 < ω max < 1 , the image of the function ϕ d is shown in Figure 2b.
Therefore, it can be proven that when k 3 , d < d * , ϕ d is smooth and non-positive in the interval 0 , 1 , which has a unique maximum point. Thereby,
E Z 2 n E 2 Z n β e n φ d + f 1 ω max . n e n ψ ω max φ d .
Thus, when d < d * , n , E Z 2 n = E 2 Z n 1 + o 1 .
In conclusion, Pr n [ Z ( n ) = 0 ] = 0 , that is, Pr n [ Z ( n ) > 0 ] = 1 . □
In other words, when d < d * , the 2-( d , k )-SAT instance has high probability to be satisfied. When d > d * , the high probability of the 2-( d , k )-SAT instance cannot be satisfied. Thus, Theorem 4 is proven.

4. Numerical Experiment and Analysis

This section mainly designs experiments to verify the rationality of the derivation of Theorem 4. From the above derivation, it is found that the phase transition point d * is a function of the clause length k. Therefore, we selected the 6 total cases where the clause length k = 4 , k = 6 , and k = 7 , and the variables set sizes are n = 10 and n = 10 respectively, to verify. The generation model introduced in the article is used to randomly generate ( d , k )-SAT instances. When k = 4 , d * = 1.6563 . When k = 6 , d * = 2.1168 and k = 6 , d * = 2.2803 .
Figure 3, Figure 4 and Figure 5 respectively show the phase transition phenomenon of 2-( d , k )-SAT instances with the clause lengths k = 4 , k = 6 , and k = 7 , and the number of variables n = 10 and n = 20 . In the figures, the horizontal axis represents the phase transition control parameter d, and the vertical axis represents the probability that 2-( d , k )-SAT instance can be satisfied. In the experimental design, because the occurrence times of the argument d and the number of clauses m are rounded off, a small error occurred. However, it can be seen that as the scale of the problem increases, this error gradually decreases. From the figures, for a positive integer d, when d < d , the problem can be satisfied with a high probability. When d > d , the problem cannot be satisfied with a high probability. The experimental value d is basically the same as the theoretical value d * .
In the experiment, we used a brute force method. The algorithm exhausts all the assignments, verifies them one by one, and finds all 2-exactly satisfiable formulas. This algorithm is suitable for solving small-scale examples of the problem. When the problem size increases, the algorithm solving time also increases exponentially. However, when the problem size was increased from 10 to 20, the experimental results were closer to the theoretical results. Therefore, as the scale of the problem further increases, the experimental results will become more and more accurate.

5. Conclusions

This article described s-exact satisfiability. For a ( d , k )-CNF formula F, if F contains only positive or negative literals, there is a satisfying assignment T with a size of s n / k , so that F is s-exactly satisfiable. A random regular ( d , k )-SAT instance generation model was introduced. By constructing the solution space and using the methods of first and second moments, the satisfiable phase transition point d * for the 2-( d , k )-SAT problem with only positive literals was provided. d * is a function of k ( k 3 ). For a positive integer d, if d < d * , the problem can be satisfied with high probability. If d > d * , the problem cannot be satisfied with high probability. The 2-( d , k )-SAT problem is a special case of the s-( d , k )-SAT problem. This study of the phase transition of the problem contributes to understanding and analyzing the phase transition of the s-( d , k )-SAT problem and similar problems. In further work, the phase transition of the s-( d , k )-SAT problem in other cases will be provided by using this method or exploring new methods, such as clauses with both positive and negative literals.

Author Contributions

Formal analysis, G.N.; Investigation, G.N. and D.X.; Methodology, G.N., D.X. and X.W. (Xiaofeng Wang); Writing—original draft, G.N.; Writing—review and editing, G.N., D.X., X.W. (Xiaofeng Wang) and X.W. (Xi Wang). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under grant numbers 61762019, 61862051, and 62062001.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Xu, D.Y. Reduction Techniques for Satisfiability Problems. Stud. Logic. 2012, 5, 98–111. [Google Scholar] [CrossRef]
  2. Xu, D.; Wang, X. A regular NP-complete problem and it’s inaproximability. J. Front. Comput. Sci. Technol. 2013, 7, 691–697. [Google Scholar] [CrossRef]
  3. Kirkpatrick, S.; Selman, B. Critical behavior in the satisfiability of random Boolean Formulae. Science 1994, 264, 1297–1301. [Google Scholar] [CrossRef] [PubMed]
  4. Mertens, S.; Mezard, M.; Zecchina, R. Threshold values of random k-SAT from the cavity method. Random Struct. Algorithms 2006, 28, 340–373. [Google Scholar] [CrossRef] [Green Version]
  5. Krzakała, F.; Montanari, A.; Ricci-Tersenghi, F.; Semerjian, G.; Zdeborová, L. Gibbs states and the set of solutions of random constraint satisfaction Problems. Proc. Natl. Acad. Sci. USA 2007, 104, 10318–10323. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Fan, Y.; Shen, J. On the phase transitions of random k-constraint satisfaction problems. Artif. Intell. 2011, 175, 914–927. [Google Scholar] [CrossRef] [Green Version]
  7. Kaporis, A.; Kirousis, L.; Lalas, E. The probabilistic analysis of a greedy satisfiability algorithm. Random Struct. Algorithms 2006, 28, 444–448. [Google Scholar] [CrossRef]
  8. Díaz, J.; Kirousis, L.; Mitsche, D.; Pérez-Giménez, X. On the satisfiability threshold of formulas with three literals per clause. Theor. Comput. Sci. 2009, 410, 2920–2934. [Google Scholar] [CrossRef] [Green Version]
  9. Aurell, E.; Gordon, U.; Kirkkpatrick, S. Comparing beliefs, surveys, and random walks. Adv. Neural Inf. Process. Syst. 2004, 17, 1–8. [Google Scholar]
  10. Braunstein, A.; Zecchina, R. Survey and belief propagation on random k-SAT. Lect. Notes Comput. Sci. 2004, 2919, 519–528. [Google Scholar] [CrossRef]
  11. Boufkhad, Y.; Dubois, O.; Interian, Y.; Selman, B. Regular random k-sat: Properties of balanced formulas. J. Autom. Reason. 2005, 35, 181–200. [Google Scholar] [CrossRef]
  12. Rathi, V.; Aurell, E.; Rasmussen, L.K.; Skoglund, M. Bounds on Threshold of Regular Random k-SAT. Lect. Notes Comput. Sci. 2010, 6175, 264–277. [Google Scholar] [CrossRef]
  13. Jian, D.; Sly, A.; Nike, S. Satisfiability threshold for random regular NAE-SAT. In Proceedings of the Forty-Sixth Annual ACM Symposium on Theory of Computing, New York, NY, USA, 31 May–3 June 2014; pp. 814–822. [Google Scholar] [CrossRef] [Green Version]
  14. Achlioptas, D.; Chtcherba, A.; Istrate, G.; Moore, C. The phase transition in 1-in-k SAT and NAE 3-SAT. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms, Washington, DC, USA, 7–9 January 2001; pp. 721–722. [Google Scholar] [CrossRef]
  15. Achioptas, D.; Moore, C. RANDOM k-SAT: Two moments suffice to cross a sharp threshold. SIAM J. Comput. 2006, 36, 740–762. [Google Scholar] [CrossRef] [Green Version]
  16. Moore, C. The phase transition in random regular exact cover. Annales l’Institut Henri Poincaré D 2016, 3, 349–362. [Google Scholar] [CrossRef] [Green Version]
  17. Kalapala, V.; Moore, C. The phase transition in exact cover. Chic. J. Theor. Comput. 2008, 5, 9. [Google Scholar] [CrossRef]
  18. Mo, X.; Xu, D.; Wang, X. The Phase Transition Analysis for Random Regular Exact (s,c,k)-SAT Problem. IEEE Access 2021, 9, 26664–26673. [Google Scholar] [CrossRef]
  19. Mitzenmacher, M.; Upfal, E. Probability and Computing; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The factor graph of (4,3)-CNF.
Figure 1. The factor graph of (4,3)-CNF.
Symmetry 13 01231 g001
Figure 2. (a) The graph of function ψ ω ( k = 6 ). (b) The graph of function ϕ d ( k = 6 ).
Figure 2. (a) The graph of function ψ ω ( k = 6 ). (b) The graph of function ϕ d ( k = 6 ).
Symmetry 13 01231 g002
Figure 3. Phase transition phenomenon of the regular exact 2-( d , k )-SAT instance ( k = 4 , n = 10 , and n = 20 ).
Figure 3. Phase transition phenomenon of the regular exact 2-( d , k )-SAT instance ( k = 4 , n = 10 , and n = 20 ).
Symmetry 13 01231 g003
Figure 4. Phase transition phenomenon of the regular exact 2-( d , k )-SAT instance ( k = 6 , n = 10 , and n = 20 ).
Figure 4. Phase transition phenomenon of the regular exact 2-( d , k )-SAT instance ( k = 6 , n = 10 , and n = 20 ).
Symmetry 13 01231 g004
Figure 5. Phase transition phenomenon of the regular exact 2-( d , k )-SAT instance ( k = 7 , n = 10 , and n = 20 ).
Figure 5. Phase transition phenomenon of the regular exact 2-( d , k )-SAT instance ( k = 7 , n = 10 , and n = 20 ).
Symmetry 13 01231 g005
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nie, G.; Xu, D.; Wang, X.; Wang, X. The Phase Transition Analysis for the Random Regular Exact 2-(d, k)-SAT Problem. Symmetry 2021, 13, 1231. https://doi.org/10.3390/sym13071231

AMA Style

Nie G, Xu D, Wang X, Wang X. The Phase Transition Analysis for the Random Regular Exact 2-(d, k)-SAT Problem. Symmetry. 2021; 13(7):1231. https://doi.org/10.3390/sym13071231

Chicago/Turabian Style

Nie, Guoxia, Daoyun Xu, Xiaofeng Wang, and Xi Wang. 2021. "The Phase Transition Analysis for the Random Regular Exact 2-(d, k)-SAT Problem" Symmetry 13, no. 7: 1231. https://doi.org/10.3390/sym13071231

APA Style

Nie, G., Xu, D., Wang, X., & Wang, X. (2021). The Phase Transition Analysis for the Random Regular Exact 2-(d, k)-SAT Problem. Symmetry, 13(7), 1231. https://doi.org/10.3390/sym13071231

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop