Wednesday, 25 July 2007

computer science - Satisfiability of general Boolean formulas with at most two occurrences per variable

(If you know basics in theoretical computer science, you may skip immediately to the dark box below. I thought I would try to explain my question very carefully, to maximize the number of people that understand it.)



We say that a Boolean formula is a propositional formula over some 0-1 variables x1,ldots,xnx1,ldots,xn involving AND and OR connectives, where the atoms are literals which are instances of either xi or negxi for some i. That is, a Boolean formula is a "monotone" formula over the 2n atoms x1,ldots,xn,negx1,ldots,negxn. For example, ((negx1 OR x2) AND (negx2 OR x1)) OR x3 is a Boolean formula expressing that either x1 and x2 take on the same value, or x3 must be 1. We say that a 0-1 assignment to the variables of a formula is satisfying if the formula evaluates to 1 on the assignment.



The Cook-Levin theorem says that the general satisfiability of Boolean formulas problem is NP-complete: given an arbitrary formula, it is NP-hard to find a satisfying assignment for it. In fact, even satisfying Boolean formulas where each variable xi appears at most three times in the formula is NP-complete. (Here is a reduction from the general case to this case: Suppose a variable x appears k>3 times. Replace each of its occurrences with fresh new variables x1,x2,ldots,xk, and constrain these k variables to all take on the same value, by ANDing the formula with (negx1 OR x2) AND (negx2 OR x3) AND cdots AND (negxk1 OR xk) AND (negxk OR x1). The total number of occurrences of each variable xj is now three.) On the other hand, if each variable appears only once in the formula, then the satisfiability algorithm is very easy: since we have a monotone formula in x1,ldots,xn,negx1,ldots,negxn, we set xi to 0 if negxi appears, otherwise we set xi to 1. If this assignment does not get the formula to output 1, then no assignment will.




My question is, suppose every variable appears at most twice in a general Boolean formula. Is the satisfiability problem for this class of formulas NP-complete, or solvable in polynomial time?




EDIT: To clarify further, here is an example instance of the problem:
((x1 AND x3) OR (x2 AND x4 AND x5)) AND (negx1 OR negx4) AND (negx2 OR (negx3 AND negx5))



Note that when we restrict the class of formulas further to conjunctive normal form (i.e. a depth-2 circuit, an AND of ORs of literals) then this "at most twice" problem is known to be solvable in polynomial time. In fact, applying the "resolution rule" repeatedly will work. But it is not clear (at least, not to me) how to extend resolution for the class of general formulas to get a polytime algorithm. Note when we reduce a formula to conjunctive normal form in the usual way, this reduction introduces variables with three occurrences. So it seems plausible that perhaps one might be able to encode an NP-complete problem in the additional structure provided by a formula, even one with only two occurrences per variable.



My guess is that the problem is polynomial time solvable. I'm very surprised that I could not find a reference to this problem in the literature. Perhaps I'm just not looking in the right places.



UPDATE: Please think about the problem before looking below. The answer is surprisingly simple.

No comments:

Post a Comment