Learning for Dynamic subsumption

Learning for Dynamic Subsu mption Y. Hamadi 1 , S. Jabb our 2 , and L. Sa ¨ ıs 2 1 Microsof t Research 7 J J Thomson Avenue Cam bridge, United Kingdom youssefh@micr osoft.com 2 Universit ´ e Li lle-Nord de F rance CRIL - CNRS UMR 8188 Artois, F-62307 Lens { jabbour,sais } @cril .fr Abstract. This pap er presents an original d ynamic sub sumption tech- nique for Bo olean CNF formulae. It exploits simple and suﬃcient condi- tions to detect, during conﬂict analysis, clauses from the fo rmula that can b e reduced by subsump t ion. During the learnt clause d eriva tion, and at eac h step of the associated resolution pro cess, chec ks for backw ard sub- sumption b etw een the current resolv en t and clauses from the original form ula are eﬃcien tly performed. T he resulting metho d allo ws the dy- namic remo v al of literals from the original clauses. Exp erimental results sho w that the integration of our dynamic subsumption technique within the state-of-the-art SA T solv ers Minisat and Rsat particularly b eneﬁts to crafted problems. 1 In tro duction The SA T pro ble m, i.e., the pro blem of c hec king whether a set of Boolea n clauses is sa tisﬁable or not, is central to ma ny do ma ins in c omputer s cience and ar- tiﬁcial intelligence including constraint satisfa ction pr o blems (CSP), planning, non-monotonic rea soning, VLSI correctness chec king, etc. T o day , SA T has gained a consider able audience with the a dven t of a ne w ge ne r ation of SA T solvers able to solve larg e instances enco ding rea l-world applications a nd the demonstra- tion that these solvers r epresent imp o rtant low-lev el building blocks for ma ny impo rtant ﬁelds, e.g., SMT s o lving, Theorem proving, Mo del ﬁnding, Q BF solv- ing, etc. These solvers, called mo dern SA T solvers [13 , 9], a re based on cla ssical unit propa gation [6] eﬃciently combined thr ough incremental data str uctures with: (i) res ta rt po lic ies [10, 11], (ii) activity-based v ariable selection heuristics (VSIDS-lik e) [13], and (iii) clause learning [1, 12, 13]. Mo dern SA T solvers can be seen as an extended version of the well known DPLL- like pro cedure obtained thanks to these diﬀerent enhancements. It is imp or tant to note tha t the w ell known reso lution rule ` a la Robinson still pla ys a stro ng role in the eﬃcienc y of mo dern SA T solvers which ca n b e understo o d as a particular for m of general resolution [2]. Indeed, conﬂict-based learning, o ne of th e most important c o mpo nent of SA T solvers is bas e d on r esolution. W e ca n als o mention, that the well known and hig hly successful ( SatElite ) prepro cess or is bas e d on v ariable elimination through the r esolution r ule [15 , 3]. As mentioned in [15], on industrial instances, resolution leads to the generation of many tauto lo gical resolven ts. This ca n b e explained by the fact that many clauses represent Bo ole a n functions e nco ded through a common set o f v ariables. This pro pe r ty o f the enco dings might also be at the origin of many redundant or subsumed clauses at diﬀerent steps of the search pr o cess. The utilit y of ( SatElite ) on industr ial problems ha s been prov ed, and therefore one ca n wonder if the a pplica tion o f the res o lution rule could b e p er- formed no t o nly a s a pre-pro c essing stag e but systematically during the s earch pro cess. Unfortunately , dynamically maintaining a formula clo s ed under s ub- sumption migh t b e time cons uming . An attempt has been made recen tly in this direction b y L. Zhang [1 6]. In this work, a novel algor ithm maintains a subsumption-free cla use database b y dynamically detecting a nd removing sub- sumed clauses as they ar e added. Int erestingly , the a uthor ment ion the following per sp ective o f resear ch: ” How to balance the runtime cost and the qualit y of the result fo r on-the-ﬂy CNF simpliﬁcation is a very interesting problem worth m uch further in v estigation”. In this pap er, our ob jective is to design an e ﬀectiv e dynamic simpliﬁcation a l- gorithm based on reso lution. Our pr o p osed a pproach aims at eliminating liter a ls from the CNF formula by dynamically substituting s maller claus es. More pre- cisely , o ur approach exploits the intermediate steps of cla ssical conﬂict ana lysis to subsume the clauses of the formula whic h ar e used in the under lying r esolution deriv a tion of the ass erting clause. Since orig inal clauses or learnt clauses ca n b e used dur ing conﬂict analysis b oth categorie s ca n b e s impliﬁed. The eﬀectiveness of our technique lies in the eﬃcie ncy o f the subsumption test, which is bas ed on a simple and s uﬃcient condition computable in constant time. Mo reov er, since our technique relies o n the der iv ation of a co nﬂict- c lause, it is guided b y the co n- ﬂicts, and simpliﬁes parts of the formula identiﬁed as imp ortant b y the sea rch strategy (VSIDS guida nce). This dynamic pro cess preserves the s a tisﬁability of the fo r mula, a nd with some a dditional b o okkeeping c an preserve the eq uiv alence of the mo dels. The pape r is org anized as follows. After some preliminar y deﬁnitions and no- tations, classic al implica tio n gra ph and learning schemes a re presented in s ection 2. Then our dyna mic subsumption a pproach is desc rib ed in section 3. Finally , befo re the conclusion, exp erimental res ults demonstrating the p erformances of our approach are pr esented. 2 T ec hnica l bac kgr ound 2.1 Preliminary deﬁniti ons and notations A CNF formula F is a conjunction of clauses , where a c lause is a disjunction of liter al s . A literal is a po sitive ( x ) o r negated ( ¬ x ) prop ositiona l v ariable. The t wo literals x and ¬ x a re called c omplementary . W e note by ¯ l the complementary literal of l . F or a s et of liter als L , ¯ L is deﬁned a s { ¯ l | l ∈ L } . A unit c la use is a clause containing only o ne literal (called unit liter al ), while a binary clause contains exactly tw o literals. An empty clause , noted ⊥ , is interpreted as fals e (unsatisﬁable), wherea s an empty CNF formula , noted ⊤ , is interpreted as true (satisﬁable). The set of v ar iables o ccurring in F is noted V F . A set of litera ls is c omplete if it con tains one literal for each v a riable in V F , and fun damental if it do es not co n tain complementary litera ls. An assignment ρ of a Bo o le an formula F is function which asso cia tes a v a lue ρ ( x ) ∈ { f al se, true } to so me of the v ariables x ∈ F . ρ is c omple te if it assig ns a v alue to e very x ∈ F , and p artial otherwise. An assignment is alternatively represented by a complete and fundamental set of litera ls, in the ob vious wa y . A mo de l of a formula F is an assignment ρ that makes the formula tr ue ; noted ρ  Σ . The following no tations will b e heavily used throughout the pap er: – η [ x, c i , c j ] denotes the r esolv ent b etw een a cla use c i containing the liter a l x and c j a clause c ontaining the opp osite litera l ¬ x . In other words η [ x, c i , c j ] = c i ∪ c j \{ x, ¬ x } . A resolven t is calle d taut olo gic al when it contains opp osite literals. – F | x will denote the fo r mula obtaine d from F by assigning x the truth- v alue true . F ormally F | x = { c | c ∈ F , { x, ¬ x } ∩ c = ∅} ∪ { c \ { ¬ x } | c ∈ F , ¬ x ∈ c } (that is: the cla uses co ntaining x and ar e therefor e satisﬁed are remov ed; and tho s e co nt aining ¬ x a r e s impliﬁed). This notation is ex- tended to assignments: given an ass ignment ρ = { x 1 , . . . , x n } , w e deﬁne F | ρ = ( . . . (( F | x 1 ) | x 2 ) . . . | x n ). – F ∗ denotes the formula F c lo sed under unit propa g ation, deﬁned recurs ively as follows: (1) F ∗ = F if F do es no t contain any unit cla use, (2) F ∗ = ⊥ if F contains tw o unit-cla uses { x } and {¬ x } , (3) otherwis e, F ∗ = ( F | x ) ∗ where x is the litera l app earing in a unit clause of F . A c lause c is deduced by unit propaga tion from F , no ted F | = ∗ c , if ( F | ¯ c ) ∗ = ⊥ . Let c 1 and c 2 be t w o claus es of a formula F . W e say that c 1 (resp ectively c 2 ) subsume (resp ectively is subsumed) c 2 (resp ectively by c 1 ) iﬀ c 1 ⊆ c 2 . If c 1 subsume c 2 , then c 1 | = c 2 (the conv erse is no t true). Also F and F − c 2 are equiv alent with res p ect to sa tisﬁability . 2.2 DPLL searc h DPLL [6] is a tree- ba sed backtrack search pro cedure; at each no de of the se a rch tree, the assigned literals (decision literal and the pro pagated ones) a r e lab eled with the same de cision level starting from 1 and increased at each decisio n (or branching). After backtrac king, so me v ar iables ar e unassigned, and the c urrent decision level is dec r eased a ccordingly . A t level i , the current par tial ass ign- men t ρ ca n b e repres ent ed as a seq uence of decision-pro pagation of the form h ( x i k ) , x i k 1 , x i k 2 , . . . , x i k n k i where the ﬁrst literal x i k corres p o nds to the decisio n literal x k assigned at level i and each x i k j for 1 ≤ j ≤ n k represents a propa g ated (unit) litera ls a t level i . L e t x ∈ ρ , we note l ( x ) the a ssignment level of x . F o r a clause α , l ( α ) is deﬁned as the maximum level of its assigned litera ls. 2.3 Conﬂict analysis using im plication graphs Implication graphs is a standard representation conv enien tly used to analyze conﬂicts in mo dern SA T solvers. Whenever a literal y is propag ated, we keep a r eference to the clause which triggers the propag ation of y , which w e note imp ( y ). The clause imp ( y ), ca lled implication of y , is in this case of the form ( x 1 ∨ · · · ∨ x n ∨ y ) where every liter al x i is false under the current pa r tial a ssignment ( ρ ( x i ) = false , ∀ i ∈ 1 ..n ), while ρ ( y ) = true . When a literal y is no t obtained by pro pagation but comes from a decision, imp ( y ) is undeﬁned, which we no te for conv enience imp ( y ) = ⊥ . When imp ( y ) 6 = ⊥ , we deno te by e xp ( y ) the set { x | x ∈ i mp ( y ) \ { y }} , called set of explanations of y . When imp ( y ) is undeﬁned we deﬁne e xp ( y ) as the empty set. Deﬁnition 1 (Implication Graph). L et F b e a C NF formula, ρ a p artial assignment, and let exp denotes the set of explanations for the de du c e d (u nit pr op aga te d) liter als in ρ . The implic ation gr aph asso ciate d to F , ρ and exp is G ρ,exp F = ( N , E ) wher e: – N = ρ , i.e., ther e is exactly one no de for every liter al, de cide d or implie d; – E = { ( x, y ) | x ∈ ρ, y ∈ ρ, x ∈ exp ( y ) } In the rest of this pap er, for simplicity reason, exp is omitted, a nd an impli- cation graph is simply noted as G ρ F . W e a lso note m as the conﬂict level. Example 1. G ρ F , shown in Figure 1 is an implication graph for the formula F and the pa rtial assignment ρ g iven below : F ⊇ { c 1 , . . . , c 12 } ( c 1 ) ¬ x 1 ∨ ¬ x 11 ∨ x 2 ( c 2 ) ¬ x 1 ∨ x 3 ( c 3 ) ¬ x 2 ∨ ¬ x 12 ∨ x 4 ( c 4 ) ¬ x 1 ∨ ∨¬ x 3 ∨ x 5 ( c 5 ) ¬ x 4 ∨ ¬ x 5 ∨ ¬ x 6 ∨ x 7 ( c 6 ) ¬ x 5 ∨ ¬ x 6 ∨ x 8 ( c 7 ) ¬ x 7 ∨ x 9 ( c 8 ) ¬ x 5 ∨ ¬ x 8 ∨ ¬ x 9 ( c 9 ) ¬ x 10 ∨ ¬ x 17 ∨ x 1 ( c 10 ) ¬ x 13 ∨ ¬ x 14 ∨ x 10 ( c 11 ) ¬ x 13 ∨ x 17 ( c 12 ) ¬ x 15 ∨ ¬ x 16 ∨ x 13 ρ = {h . . . ¬ x 1 15 . . . ih ( x 2 11 ) . . . . . . ih ( x 3 12 ) . . . x 3 6 . . . ih ( x 4 14 ) , . . . ih ( x 5 16 ) , x 5 13 , . . . i} . The conﬂict level is 5 a nd ρ ( F ) = f al se . Deﬁnition 2 (Asserting clause). A clause c of the form ( α ∨ x ) is c alle d an asserting clause iﬀ ρ ( c ) = false , l ( x ) = m and ∀ y ∈ α, l ( y ) < l ( x ) . x is c al le d asserting liter al. Conﬂict a na lysis is the result of the applicatio n of r e solution star ting from the conﬂict cla use using diﬀerent implications implicitly enco ded in the implication graph. W e call this pro cess an asser ting clause deriv ation (in shor t ACD). Deﬁnition 3 (Asserting clause deri v ation). A n asserting clause derivation π is a se qu enc e of clauses h σ 1 , σ 2 , . . . σ k i s atisfying t he fol lowing c ondi tions : 1. σ 1 = η [ x, imp ( x ) , imp ( ¬ x )] , wher e { x, ¬ x } is the c onﬂ ict. x4 x5 x8 x9 x2 x7 c1 x3 x1 c2 c3 x12(3) −x9 x11(2) x10 x14(4) x13 x16 x15(1) x6(3) x6(3) c5 c6 c7 c10 c4 c9 c8 c11 c12 x17 Fig. 1. I mplication Graph G ρ F = ( N , E ) 2. σ i , for i ∈ 2 ..k , is built by sele cting a liter al y ∈ σ i − 1 for which imp ( y ) is deﬁne d. We then have y ∈ σ i − 1 and y ∈ imp ( y ) : t he two clauses r esolve. The clause σ i is deﬁne d as η [ y , σ i − 1 , imp ( y )] ; 3. σ k is, mor e over an asserting clause. Note that every σ i is a re s olven t o f the formula F : by induction, σ 1 is the resolven t b etw een tw o clauses that dire c tly b elong to F ; for every i > 1, σ i is a reso lven t b etw een σ i − 1 (whic h, by induction h ypothesis , is a r esolven t) and a clause of F . Every σ i is therefore also an implic ate of F , that is: F | = σ i . By deﬁnition of the implication gra ph, we also have F ′ | = ∗ σ i where F ′ ⊂ F is the set of c la uses used to derive σ i . Let us consider a gain the ex ample 1 . The trav ersal of the g raph G ρ F (see Fig . 1) lea ds to the following asserting clause der iv atio n: h σ 1 , . . . , σ 7 i where σ 1 = η [ x 9 , c 7 , c 8 ] = ( ¬ x 5 5 ∨ ¬ x 7 5 ∨ ¬ x 8 3 ) and σ 7 = ( ¬ x 11 2 ∨ ¬ x 12 3 ∨ ¬ x 6 3 ∨ ¬ x 1 5 ). The no de x 1 corres p o nding to the asser ting literal ¬ x 1 is ca lled the ﬁr st Unique Implication Poin t (First UIP). 3 Learning for dynamic subsumption In this sectio n, we show how classic al lea rning can b e adapted for an eﬃcient dynamic subsumption o f clauses . In our appro ach, we explo it the intermediate steps or resolven ts g e nerated during the classica l conﬂict analysis to s ubsume some of the clauses used in the underlying r esolution der iv ation o f the asserting clause. O bviously , it would b e p ossible to consider the subsumption test b etw e en each generated resolven t and the whole set of clause s. How ev er, this could b e very costly in pr a ctice. Let us, illus tr ate some o f the main features of o ur prop osed approach. 3.1 Motiv ating example Let us recons ider a gain the example 1 and the implication gra ph G ρ F (ﬁgure 1). The as serting cla use deriv ation leading to the a sserting clause ∆ 1 is desc rib ed as follows: π = h σ 1 , σ 2 , σ 3 , . . . , σ 7 = ∆ 1 i – σ 1 = η [ x 9 , c 7 , c 8 ] = ( ¬ x 8 5 ∨ ¬ x 7 5 ∨ ¬ x 5 5 ) – σ 2 = η [ x 8 , σ 1 , c 6 ] = ( ¬ x 6 3 ∨ ¬ x 7 5 ∨ ¬ x 5 5 ) – σ 3 = η [ x 7 , σ 2 , c 5 ] = ( ¬ x 6 3 ∨ ¬ x 5 5 ∨ ¬ x 4 5 ) ⊂ c 5 (subsumption) – . . . – ∆ 1 = σ 7 = η [ x 2 , σ 6 , c 1 ] = ( ¬ x 11 2 ∨ ¬ x 12 3 ∨ ¬ x 6 3 ∨ ¬ x 1 5 ) As w e c a n see the asser ting cla use der iv ation π includes the resolvent σ 3 = ( ¬ x 6 3 ∨ ¬ x 5 5 ∨ ¬ x 4 5 ) w hich subsumes the claus e c 5 = ( ¬ x 6 ∨ ¬ x 5 ∨ ¬ x 4 ∨ x 7 ). Consequently , the literal x 7 is eliminated from the clause c 5 . In gene r al, the resolven t σ 3 can s ubsume other cla uses from the implica tion gra ph that include the literals ¬ x 6 , ¬ x 5 and ¬ x 4 . 3.2 Dynamic subsum ption: a ge neral form ulation Let us now g ive a for mal pr esentation of o ur dy na mic subsumption approach. Deﬁnition 4 (F-subsumpti on mo dulo UP). L et c ∈ F . c is F -subsu me d mo dulo Unit Pr op agation iﬀ ∃ c ′ ⊂ c such that F | ¯ c ′ | = ∗ ⊥ Given tw o clause s c 1 and c 2 from F such that c 1 subsumes c 2 , then c 2 is F -subsumed mo dulo UP . As explained b efore, subsuming claus e s during se arch might b e time co nsum- ing. In o ur prop o sed framework, to reduce the co mputational cost, we restrict subsumption ch ecks to the intermediate reso lvents σ i and the clauses of the form imp ( y ) used to derive them (clauses enco ded in the implication graph). Deﬁnition 5. L et F b e a formula and π = h σ 1 . . . σ k i an asserting clause derivation. F or e ach σ i ∈ π , we deﬁne C σ i = { i mp ( y ) ∈ F |∃ j ≤ i st . σ j = η [ y , imp ( y ) , σ j − 1 ] } as the set of clauses of F u se d for the derivation of σ i . Pr op erty 1. Let F b e a formula and π = h σ 1 , σ 2 , . . . , σ i , . . . , σ k i an asserting clause deriv a tion. If σ i subsumes a cla use c of C σ k then c ∈ C σ i . Pr o of. As σ i +1 = η [ y , imp ( y ) , σ i ] where ¬ y ∈ σ i , we hav e σ i 6⊂ imp ( y ). The next resolution steps can no t inv olv e cla uses containing the litera l ¬ y . Otherwise, the literal y in the implicatio n gr aph will admit tw o p oss ible explanations (implica- tions), which is not p ossible by deﬁnition of the implication g raph. Consequently , σ i can no t subsume cla uses from C σ k − C σ i . Pr op erty 2. Let F be a formula and π a n as s erting clause deriv ation. If σ i ∈ π subsumes a clause c o f C σ i then c is C σ i -subsumed mo dulo UP . Pr o of. As σ i ∈ π is derived from C σ i by r esolution, then C σ i | = σ i . By deﬁnition of an asserting clause deriv ation and implication g raphs, we als o have C σ i | = ∗ σ i (see section 2.3). As σ i subsumes c ( σ i ⊆ c ), then C σ i | = ∗ c . The Prop er t y 2 shows that if a clause c enco ded in the implicatio n graph is subsumed by σ i , such subsumption c an b e captured by subs umption mo dulo UP , while the Pro per ty 1 mention that subsumption chec ks of σ i can b e restr icted to clauses from C σ i . Consequently , a p ossible g eneral dyna mic subsumption a p- proach can b e stated a s follows: Let π = h σ 1 , . . . , σ i , . . . , σ k i be an as serting resolution der iv atio n. F or ea ch re s olven t σ i ∈ π , we a pply subsumption chec ks betw een σ i and all the clauses in C σ i . In the following, we show that w e can reduce further the num b er of clauses to b e check ed for subsumption by considering only a subset of C σ i . Obviously , as σ i is a r esolven t of an ass e rting cla use deriv ation π , then there exis ts tw o paths from the conﬂict no des x and ¬ x r esp ectively , to one or more no des of the implication g raph as so ciated to the literals of σ i assigned a t the conﬂict level. Consequently , we derive the fo llowing pro pe r ty: Pr op erty 3. Let π b e an asser ting clause deriv ation, σ i ∈ π and c ∈ C σ i . If σ i subsumes c , then there exists t wo pa ths from the co nﬂict nodes x and ¬ x resp ectively , to one or mor e no des of the implication gra ph asso ciated to the literals of c assigned at the conﬂict level. The pr o of o f the pro per ty is immediate since σ i ⊂ c . As this pro pe rty is tr ue for σ i which is der ived by resolution fro m the tw o clauses inv olving x and ¬ x . Then it is a lso, true fo r its super s ets ( c ). F or a given σ i , the Pr op erty 3 leads us to ano ther r estriction of the s et of clauses to b e chec ked for subsumption. Indeed, we only need to consider the set of clauses P σ i , linked (by paths) to the tw o co nﬂicting litera ls x a nd ¬ x . W e illustrate this characteriza tion using the e xample 1 (see. also Figur e 1). Le t π = h σ 1 , . . . , σ 7 i where σ 7 = ( ¬ x 11 ∨ ¬ x 12 ∨ ¬ x 6 ∨ ¬ x 1 ). W e hav e C σ 7 = { c 1 , c 2 , c 3 , c 4 , c 5 , c 6 , c 7 , c 8 } a nd P σ 7 = { c 1 , c 2 , c 4 , c 5 , c 6 , c 8 } . Indeed, from the nodes x 2 of the cla use c 3 we only hav e one path to the no des x 9 . Conse- quently , the clause c 3 might b e discarded from the set o f clauses to be chec k ed for subsumption. Similarly , the no de x 7 of the cla use c 7 is not link ed with a path to ¬ x 9 . Then c 7 is not considered for subsumption tests. Pr op erty 4. Given an asserting clause deriv a tion π = h σ 1 , . . . , σ k i . The time complexity of our g eneral dynamic subsumption appro ach is in O ( |C σ k | 2 ). Pr o of. F r om the deﬁnition of C σ i , we have |C σ i | = i + 1. In the worst ca se, we need to co nsider i + 1 subsumption chec ks. Then for all σ i with 1 ≤ i ≤ k , we have to chec k P 1 ≤ i ≤ k ( i + 1) = k × ( k +3) 2 . As k = |C σ k | , then the worst case complexity is in O ( |C σ k | 2 ). The worst c ase complexity is q uadratic even if w e consider P σ k ⊂ C σ k . 3.3 Dynamic subsum ption on the ﬂy In section 3.2, we hav e presented the genera l approa ch fo r dyna mic subsumption. Its co mplexity is qua dr atic in the num ber of clauses use d in the der iv atio n of a n asserting clause. As stated, in the intro duction, to desig n an eﬃcient dynamic simpliﬁcation technique, one need to bala nce the run time cos t and the quality of the simpliﬁcation. In this s ection, we prop ose a restriction of the g eneral dynamic s ubsumption scheme, called dyna mic susbumption o n the ﬂy , which applies subsumption o nly b etw een the current resolvent σ i and the la st cla us e from the implication graph used for its deriv ation. More prec isely , supp ose σ i = η [ y , c, σ i − 1 ], we o nly check s ubs umption be tw een σ i and c . The following prop erty gives a suﬃcient c o ndition under which y ca n b e remov ed fo r m c Pr op erty 5. Let π b e an ass e rting clause deriv ation, σ i ∈ π such that σ i = η [ y , c, σ i − 1 ]. If σ i − 1 − { y } ⊆ c , then c is subsumed by σ i . Pr o of. Let c = ( ¬ y ∨ α ) and σ i − 1 = ( y ∨ β ). Then σ i = ( α ∨ β ). As σ i − 1 − { y } ⊆ c , then β ⊆ α . So, σ i = α which subsumes ( ¬ y ∨ α ) = c . Considering mo dern SA T so lvers that include conﬂict ana ly sis, the integra- tion of this new dynamic subsumption approach can b e done with negligible additional cos t. Indeed, by using a simple counter during the conﬂict ana lysis pro cedure, we can verify the suﬃcient condition g iven in the Prop er ty 5 with a constant complexity time. Indeed, at each step of the a sserting cla use deriv ation, we ge ner ate the next r esolven t σ i from a clause c and a r esolven t σ i − 1 . In the classical implementation o f conﬂict analy sis, o ne can check in consta n t time if a given liter al is present in the current resolvent . Co nsequently , during the visit of the clause c , we additio nally compute the num ber n of litera ls of c that belo ng to σ i − 1 . If n ≥ | σ i − 1 | − 1 then c is s ubsumed by σ i = η [ y , c, σ i − 1 ]. 4 Exp erimen ts The ex per iments were done on a la rge panel of cr afted and industrial prob- lems co ming fro m the last comp etitions. All the instances were simpliﬁed by the SatE lite prepro cess or [8]. W e implemented our dynamic subsumption a p- proach in Minis at [9 ] and Rsat [14] and made a compa r ison b etw e en the o riginal solvers and the ones enhanced with dynamic subsumption. All the tes ts w ere made o n a Xeo n 3.2 GHz (2 GB RAM) cluster. Results a re r ep orted in sec o nds. 4.1 Crafted problem s During these exp eriments, the CPU time limit was ﬁxed to 3 hour s. These prob- lems a re hand made and many o f them are designed to b eat all the existing DPLL solvers. They co nt ain for example Qua si-gro up instances, forced r andom SA T instances , counting, or de r ing and p ebbling instances, so cial g olfer problems, etc. 1e-04 0.001 0.01 0.1 1 10 100 1000 10000 100000 1e-04 0.001 0.01 0.1 1 10 100 1000 10000 100000 Minisat+DS Minisat 0 2000 4000 6000 8000 10000 12000 400 450 500 550 600 650 time (seconds) # instances Minisat Minisat+DS Fig. 2. Crafted problems: Minisat vs Minisat+DS The log -scaled scatter plot (in log scale ) given in the left par t of Figure 2 details the results for Mi nisat a nd Minisat+DS on each cra fted insta nce. The x-axis (res p. y-axis ) corr esp onds to the CPU time tx (resp. ty ) obtained by Minisat (r e sp. Minisat+ DS ). Each do t with ( tx, ty ) co or dinates, cor resp onds to a SA T ins ta nce. Dots b elow (re s p. ab ov e) the diago nal indica te ins ta nces where the subsumption is mor e eﬃcient i.e. ty < tx . T his ﬁgur e clea rly shows the computational gain obtained thank s to our eﬃcient dyna mic subsumption approach. By a utomatically counting the p oints we found that 365 instances a re solved mor e eﬃciently thro ugh dyna mic subsumption. In some cases the gain is up to 1 order of magnitude. Of co urse, ther e exists instances wher e subsumption decreases the p erformances of M inisat (178 ins tances). The right part of the Figure 2 shows the same results with a diﬀerent rep- resentation which gives for each technique the num ber of solved instances (# instances) in less than t seco nds. This Figure conﬁrms the eﬃciency o f our dy- namic subsumption approa ch o n these problems. O n several cla sses the num ber of removed literals is very imp orta nt e.g., x * , QG * , php * , parity * . On the genurq * , mod * , and urquhart * the proble m is s impliﬁed during each conﬂict analys is . Figure 3 s hows result for Rsa t and Rs at+DS . Overall we ca n see that the ad- dition of our dynamic subsumption pro cess to R sat improv es the p er formance. The ﬁne a nalysis of the left part of Figure 3 showed that Rsa t+DS solves 327 instances mor e eﬃcient ly than Rsat , which so lves 219 problems more eﬃciently than its opp onent. Int erestingly we can rema rk that the p erformances of Rsat and Rsat+DS is worse than the o nes o f Minisat a nd Mini sat+DS . This comes fro m the rapid res ta rt strateg y used by this algor ithm which do es no t pay oﬀ on crafted problems. 0.0001 0.001 0.01 0.1 1 10 100 1000 10000 100000 0.0001 0.001 0.01 0.1 1 10 100 1000 10000 100000 Rsat+DS Rsat 0 2000 4000 6000 8000 10000 12000 250 300 350 400 450 time (seconds) # instances Rsat Rsat+DS Fig. 3. Crafted p roblems: Rsat vs Rsat+DS 4.2 Industrial problem s With these problems, the time limit was set to 3 hours. T able 1 provides detailed results on the SA T industria l problems fr om the Sat-comp etition 200 7 a nd Sat- Race 200 8. The ﬁrst column repres e n ts the instances families . The sec ond column (#inst.) indicates the total num ber of instances in each family . F ollowing columns present results for res pe c tively Rsat , R sat+DS , Minisat , and Minisat+ DS . In each of these columns the ﬁrst num ber represents the n um ber of instances solved, and the num ber in par enth esis represents the num ber of instances solved more quic kly than the opp onent. The last row of the ta ble g ives the total of each column. W e can see that Rsa t+DS a nd Minisat+DS ar e in general faster and solves mo re problems than Rsat a nd Minisat r esp ectively . T able 2 , fo cuses o n some industrial families. In these families , the sp eed- up are relatively imp or tant. F or ins ta nce, if w e consider the vmp c family , w e can see that o ur dynamic s impliﬁca tion a llows a one or der of mag nitude im- prov emen t with Rsa t+DS (instances 2 4, and 25 ). On the same family , on the 9 s olved instances by Rsat+DS and Rsat , Rsat+D S is b etter on 8 instances. While Minisat+DS is b etter than Minisa t on 5 instances among the 7 so lved instances. Overall, o ur exp eriments allow us to demonstrate tw o things . Firs t our tech- nique do es no t degr ade and often improv e the p erforma nce o f DPLLs on in- dustrial pro blems. Second, it enha nces the applicability of these alg orithms on classes of problems which a r e ma de to be challenging for them. Since the imple- men ting of our a lgorithm is ra ther simple, we think that overall, it r epresents an int eresting contribution for the robustness of mo dern DPLLs. 5 Related W orks In Da rras et al. [5 ], the autho rs prop os ed a prepro cess ing ba s ed on unit pr opaga- tion for sub-clauses deduction. Consider ing the implication gr a ph ge ne r ated by the constraint propa gation pro cess a s a resolution tree, the prop osed approach familie s # inst. Rsat Rsat+DS Minisat Mini sat+DS IBM * 53 15(7) 17 ( 10 ) 15( 10 ) 15(7) APro * 16 12( 7 ) 12(5) 14 (6) 13( 8 ) mizh * 10 10( 7 ) 10(3) 10(5) 10(5) P artial * 20 6(2) 7 ( 5 ) 1(0) 2 ( 2 ) total * 20 13(6) 13( 8 ) 10 (5) 9( 6 ) dated * 20 15(6) 16 ( 10 ) 14 ( 10 ) 13(4) braun * 7 4(1) 4( 3 ) 5(2) 5( 3 ) vele v * 10 2 (0) 2( 2 ) 2 ( 2 ) 1(0) sort * 5 2( 2 ) 2(0) 2(0) 2( 2 ) manol * 10 8(3) 8( 5 ) 8( 5 ) 9 (4) vmp c * 10 9(1) 9( 8 ) 6(2) 7 ( 5 ) clause * 5 3( 2 ) 3(1) 3(0) 3( 3 ) cub e * 4 4(2) 4(2) 4(1) 4( 3 ) gold * 4 2 ( 2) 2 ( 0 ) 2(0) 2( 2 ) safe * 4 2(0) 2( 2 ) 1(0) 1( 1 ) simon * 5 5 ( 4 ) 5 (1) 5( 3 ) 5(2) block * 2 2( 2 ) 2(0) 2(0) 2( 2 ) dspam * 10 10(5) 10(5) 10(5) 10(5 ) sc hup * 3 3(2) 3(1) 3( 2 ) 3(1) p ost * 10 8(3) 8( 5 ) 5(3) 6 (3) ibm * 20 20(6) 20( 14 ) 19(6) 19( 13 ) T otal 248 155(70) 159 ( 90 ) 141(67) 141( 81 ) T abl e 1. In dustrial problems instances Rsat Rsat+DS Min isat Minisat+D S vmp c 33 5540 1562 – – vmp c 29 2598 1302 – 1252 vmp c 30 366 105 3111 2039 vmp c 27 593 327 1159 637 vmp c 31 – – – – vmp c 25 39 1 830 318 vmp c 26 182 69 1239 1235 vmp c 34 3366 944 – – vmp c 24 43 8 82 210 vmp c 28 173 488 3859 547 8 partial-5-11-s 931 176 – 2498 partial-5-13-s 503 71 3248.38 669 partial-5-15-s 737.064 825 – – partial-10-11-s 1242 875 – – partial-5-19-s 1134 498 – – partial-5-17-s 7437.82 10610 – – partial-10-13-s – 3237 – – ibm-2002-04r-k80 90 33 113 152 ibm-2002-11r1-k45 67 29 102 65 ibm-2002-18r-k90 265 157 1044 769 ibm-2002-20r-k75 36 185 2112 668 ibm-2002-22r-k60 738 691 5480 3434 ibm-2002-22r-k75 363 349 1109 688 ibm-2002-22r-k80 285 298 894 642 ibm-2002-23r-k90 1477 965 7127 2670 ibm-2002-24r3-k100 273 256 133 249 ibm-2002-25r-k10 3104 311 8 2877 3172 ibm-2002-29r-k75 353 248 272 1107 ibm-2002-30r-k85 3853 592 – – ibm-2002-31 1r3-k30 1203 652 1150 998 ibm-2004-01-k90 114 30 251 726 ibm-2004-1 11-k80 394 222 559 329 ibm-2004-23-k100 326 687 3444 2743 ibm-2004-23-k80 465 563 2060 1584 ibm-2004-29-k25 290 210 1061 1017 ibm-2004-29-k55 533 16 558 124 ibm-2004-3 02 3-k95 1 2 1 2 T abl e 2. Zo om on industrial families deduces sub-clauses from the original formula. Howev er, their pro p o sed dyna mic version is clea rly time consuming. The ex pe r imental ev aluation is only given in term of nu m ber o f no des. In [16], an algorithm for main taining a subsumption-free CNF clause database is presented. It eﬃciently detects and remov es s ubsumption when a lea rnt cla use is added. Additionally , the algorithm compac ts the database greedily b y recur - sively applying resolutio ns in order to decrement the size of the data base. Conﬂict-clause shrinking was intro duced in Minisat 1.14 [7]. It is also im- plement ed in PicoSA T [4]. It r e moves litera ls from learnt clauses by resolving recursively with clauses of the implica tion gra ph. Remark that in the prev ious exp eriments, our base solvers Minisat and Rsat implement this technique. 6 Conclusion This pap er pr e sents a new subs umption technique for Bo olea n CNF formulae. It makes an original use o f learning to reduce origina l or lea r nt clauses. At each conﬂict, and during the a sserting clause deriv a tion pro cess , subsumption betw een the g enerated resolvents and some clauses enco ded in the implication graph is c hec ked using an eﬃcien t suﬃcient condition. Interestingly , s inc e our subsumption technique relies on the clauses used in the deriv a tion of an ass erting clause, it tends to simplify pa rts of the for m ula iden tiﬁed as impo rtant by the activity-based search strategy . Exp erimental results show that the integration of our metho d within tw o state-of-the-ar t SA T solvers Minisat and Rsat par ticularly b eneﬁts to cr afted problems and achiev es interesting improv emen ts on several indus tr ial families. As a future w ork, we plan to in v estigate how to eﬃciently extend our a p- proach to achieve ex haustive clauses subs umption. Another int eresting path of resear ch would be to exploit our subsumption framework to ﬁne tune the ac- tivit y bas ed stra tegy . Indeed, ea ch time a literal is eliminated, this mea n that a new conﬂict clause is derived and all the resolven ts used in such deriv ation ar e useless and c an b e dropp ed from the implication gr aph. References 1. R ob erto J. Bay ardo, Jr. and Rob ert C. S chrag . Using CSP look-back techniques to solve real-wo rld SA T instances. In Pr o c e e dings of the F ourte enth National Con- fer enc e on Artiﬁcial I ntel li genc e (AAAI’ 97) , pages 203–208, 1997. 2. Paul Beame, Henry A. K autz, and Ashish Sabharwa l. Understanding t he p ow er of clause learning. In Georg Gottlob and T oby W alsh, editors, IJCAI , p ages 1194– 1201. Morgan Kaufmann, 2003. 3. A Biere and N . E´ en. Eﬀective preprocessing in sat through va riable and clause elimination. In Pr o c e e di ngs of the Eighth International Confer enc e on The ory and Applic ations of Satisﬁabili ty T esting (SA T’05) , 2005. 4. A rmin Biere. Picosat essen tials. Journal on Satisﬁability, Bo ole an Mo deling and Computation (JSA T) , 4(1):75–97, 2008. 5. S . Darras, G. Deq uen, L. Devendeville, B. Mazure, R. O stro wski, and L. Sa ¨ ıs. Using Bo olean constraint propagation for sub-clauses deduction. In Pr o c e e dings of the Eleventh International C onfer enc e on Principles and Pr actic e of Constr ai nt Pr o gr amming(CP’05) , pages 757–761, 2005. 6. M. Da vis, G. Logemann, and D. W. Lov eland. A machine program for theorem- proving. Communi c ations of the ACM , 5(7):394–397, 1962. 7. N . Een and N. S¨ orensson. Minisat - a sat solv er with conﬂict- clause minimization. In Pr o c e e dings of the Eighth International Confer enc e on The ory and Applic ations of Satisﬁabili ty T esting (SA T’ 05) , 2005. 8. N . E´ en and A . Biere. Eﬀective prepro cessing in S A T through va riable and clause elimination. In Pr o c e e di ngs of the Eighth International Confer enc e on The ory and Applic ations of Satisﬁabili ty T esting (SA T’05) , pages 61–75, 2005. 9. N iklas E´ en and Niklas S¨ orensson. An exten sible sat-solver. In Pr o c e e dings of the Sixth International Confer enc e on The ory and Applic ations of Satisﬁability T esting (SA T ’ 03) , p ages 502–518, 2002. 10. Carla P . Gomes, Bart Selman, and Henry Kaut z. Bo osting combinatorial search through randomization. In Pr o c e e dings of the Fi fte enth National Confer enc e on Ar tiﬁcial I ntel li genc e (AAAI’97) , pages 431–437, Madison, Wisconsin, 1998. 11. H . Kautz, E. Horvitz, Y. Ru an, C. Gomes, and B. S elman. D ynamic restart p oli- cies. In Pr o c e e di ngs of the Eighte enth National Confer enc e on Artiﬁcial Intel ligenc e (AAAI’02) , pages 674–682 , 2002. 12. Joao P . Marques-Silv a and Karem A . Sak all ah. GRASP - A New Searc h Algo- rithm for Satisﬁability . In Pr o c e e di ngs of IEEE/ACM I nternational Confer enc e on Computer-A ide d Design , pages 220–227 , Nov em b er 1996. 13. M. W. Moskew icz, C. F. Madigan, Y. Zhao, L. Zhang, and S. Malik. Chaﬀ: En- gineering an eﬃcient SA T solver. In Pr o c e e dings of the 38th Design Automation Confer enc e (DA C’ 01) , p ages 530–535, 2001. 14. Kn ot Pipatsrisa w at and Adnan Darwiche. A ligh t w eigh t comp onent cac h ing sc heme for satisﬁabilit y solv ers. In Pr o c e e di ngs of 10th International C onfer enc e on The ory and Appl i c ations of Satisﬁability T esting(SA T) , p ages 294–299 , 2007. 15. Sath iamoorthy Subbaray an and Dhira j K. Pradhan. N iVER: Non-increasing v ari- able elimination resolution for prepro cessing SA T instances. In Pr o c e e dings of the Seventh International Confer enc e on The ory and Applic ations of Satisﬁability T esting (SA T’04) , pages 276–291, 2004. 16. Lintao Zhang. On su b sumption remov al and on- the-ﬂy cnf simpliﬁcation. In SA T’2005 , pages 482–489, 2005.

Learning for Dynamic subsumption

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment