cs0211012/paper.tex
1: \documentclass[11pt]{article}
2: %\usepackage{latexsym,amsmath,amsfonts}
3: 
4: \usepackage{times}
5: %\usepackage{palatino}
6: 
7: \usepackage{amssymb}  
8: 
9: \usepackage{epsfig} 
10: %\usepackage{psfig} 
11: \setlength{\textheight}{9.2 in}
12: \setlength{\textwidth}{6.55 in}
13: \setlength{\oddsidemargin}{0 in}
14: \setlength{\evensidemargin}{0 in}
15: \setlength{\topmargin}{-0.5 in}
16: \setlength{\parskip}{2pt}     
17: 
18: \newcommand{\tinyspacing}{\baselineskip = 0.7\normalbaselineskip}
19: \newcommand{\smallspacing}{\baselineskip = \normalbaselineskip}
20: \newcommand{\morespacing}{\baselineskip = 1.1\normalbaselineskip}
21: \newcommand{\newspacing}{\baselineskip=1.04\normalbaselineskip}
22: \newcommand{\oldspacing}{\baselineskip= 1.0\normalbaselineskip}      
23: 
24: 
25:   \renewcommand*{\Pr}{\mathop{\mathrm{Prob}}}
26: 
27:   
28: 
29:   \def \qedbox{\hfill\vbox{\hrule\hbox{\vrule 
30: height1.3ex\hskip0.8ex\vrule}\hrule}}
31: 
32:   
33: 
34:   \newcommand{\goesto}{\rightarrow}
35:   \newtheorem{theorem}{Theorem}
36:   \newtheorem{definition}{Definition}
37:   \newtheorem{proposition}{Proposition}
38:   %\newtheorem{lemma}{Lemma}
39:   \newcommand*{\proof}{\noindent {\bf Proof}.\,\,}
40: 
41:  
42: 
43: \def\NP{\rm NP}
44: \def\AND{\wedge}
45: \def\OR{\vee}
46: \def\oper{\circ}
47: \def\goesto{\rightarrow}
48: \def\implies{\Rightarrow}
49: \def\zeroone{\{0,1\}} 
50:  \def\sstar{\zeroone^{*}} 
51: \def\L{\langle}
52: \def\R{\rangle}
53: \def\HYP{\hbox{-}}
54: \def\IFF{\leftrightarrow}
55: \def\Ldef{\buildrel \rm def \over \leftrightarrow}
56: \def\Edef{\buildrel \rm def \over =}
57: \def\almostall{\hbox{\rlap{$_{\thinspace\forall}$}{$^{^\infty}$}}}
58: \def\infoften{\hbox{\rlap{$_{\thinspace\exists}$}{$^{^\infty}$}}}
59: \def\N{{\bf N}}
60: \def\setdelta{\bigtriangleup}
61: \def\cminus{\dot{-}} 
62: \def\plusminus{\pm} 
63: \def\PR{{\rm Pr}}
64: \def\HSAT{{\rm HORN}\hbox{-}{\rm SAT}}
65: \def\PUR{{\rm PUR}}
66: \def\beginproof{\noindent{\bf Proof.}\quad}
67: \def\endproof{}
68: 
69:  
70: 
71: 
72: 
73: \newtheorem{lemma}{Lemma}[section]
74: \newtheorem{theo}[lemma]{Theorem}
75: \newtheorem{cor}{Corollary}
76: \newtheorem{prop}[lemma]{Proposition}
77: %\newtheorem{fact}[lemma]{Fact}
78: \newtheorem{example}{Example}
79: \newtheorem{obs}{Observation}
80: \newtheorem{claim}{Claim}
81: \newtheorem{defi}{Definition}
82: %\newtheorem{obs}[lemma]{Observation}
83: \def\qed{\hfill$\Box$\newline\vspace{5mm}}
84: \newenvironment{PROOF}{\noindent{\bf Proof:}}{{\qed}}
85: \newtheorem{conj}{Conjecture}
86: 
87: \bibliographystyle{unsrt}
88: 
89: %\setpapersize{USletter}
90: %\setmarginsrb{1in}{1in}{1in}{1in}{0pt}{0mm}{0pt}{0mm}
91: 
92: \begin{document}
93: 
94: \pagestyle{empty}
95: 
96: \title{Phase transitions and {\em all that}}
97: \author{Gabriel Istrate\thanks{e-mail: istrate@lanl.gov,
98:         NISAC, National Infrastructure Simulation Analysis Center, 
99:         Los Alamos National Laboratory,
100:         Mail Stop M 997, Los Alamos, NM 87545, U.S.A.}}
101: \date{}
102: 
103: \maketitle 
104: \thispagestyle{empty}
105: \section{Introduction}
106: 
107: Since the experimental paper of Cheeseman, Kanefsky and Taylor 
108: \cite{cheeseman-kanefsky-taylor}
109: {\em phase transitions in combinatorial problems} held the promise to 
110: shed 
111: light on the ``practical'' algorithmic complexity of combinatorial 
112: problems. However, the connection conjectured in 
113: \cite{cheeseman-kanefsky-taylor} was easily seen to be inaccurate. A much more realistic
114: possible connection has been highlighted by the results (based on 
115: experimental evidence and nonrigorous arguments from Statistical Mechanics) 
116: of Monasson et 
117: al. \cite{2+p:nature} (see also \cite{2+p:rsa}). These results 
118: supported 
119: the conjecture that it is {\em first-order phase transitions} that 
120: have algorithmic implications for the complexity of restricted 
121: classes of algorithms, including the important class of 
122: {\em Davis-Putnam-Longman-Loveland (DPLL) algorithms} \cite{beame:dp}. 
123: 
124: There exists, indeed, a nonrigorous argument supporting this 
125: conjecture:
126: phase transitions amount to nonanalytical behavior of a certain  
127: {\em order parameter}; the phase transition is {\em first order} if the 
128: order parameter is actually 
129: discontinuous. At least for random $k$-SAT \cite{monasson:zecchina} the 
130: order parameter suggested by Statistical Mechanics considerations has a 
131: purely combinatorial interpretation: it is the {\em backbone} of the 
132: formula, the set of 
133: literals that assume the same value in all optimal assignments. 
134: But intuitively one can relate (see e.g. 
135: the presentation of this argument by Achlioptas, Beame and Molloy 
136: \cite{achlioptas:beame:molloy:slides}) 
137: the size of the backbone to the complexity of DPLL algorithms, 
138: when run on random $k$-SAT instances slightly above the phase 
139: transition: All literals in the 
140: backbone require well-defined values in order to satisfy the 
141: formula. But a DPLL algorithm has very few ways to know what those 
142: ``right'' values are. If w.h.p. the backbone of formulas above the 
143: transition contains a positive 
144: fraction of the literals that is bounded away from zero as we approach 
145: the transition (which happens in a case of a first-order phase 
146: transition) then, 
147: intuitively, DPLL will misassign a variable having $\Omega(n)$ height 
148: in the 
149: tree representing the behavior of the algorithm, and 
150: will be forced to backtrack on the given variable. 
151: {\em The conclusion of this intuitive argument is that a 
152: first-order phase transition implies a $2^{\Omega(n)}$ lower bound for 
153: the 
154: running time of any DPLL algorithm, valid with high probability for 
155: random instances located slightly above the transition.  }
156: 
157: While previous rigorous results 
158: \cite{2+psat:ralcom97,scaling:window:2sat,achlioptas:beame:molloy}, supported these intuitions, to date, the 
159: extent of a connection between first-order phase transitions and 
160: algorithmic complexity was unclear. 
161: 
162: {\bf The goals of this paper are
163: \begin{enumerate}
164: \item  To remedy this, and formally establish a connection between 
165: first-order phase 
166: transitions and the resolution complexity of random satisfiability 
167: problems, and
168: \item To take steps towards obtaining a complete classification of the 
169: order of phase transition in generalized satisfiability problems. 
170: \end{enumerate} 
171: }  
172: 
173: To accomplish these goals 
174: 
175: \begin{enumerate} 
176: \item we obtain (Theorem~\ref{dichotomy:threshold}) a complete 
177: characterization of sharp/coarse thresholds in the random generalized 
178: satisfiability model due to Molloy \cite{molloy-stoc2002}. 
179: ``Phyisical'' arguments (see discussion below) imply that it makes no 
180: sense to study the order of the phase transition unless the problem 
181: has a sharp threshold. 
182: \item we rigorously prove (Theorem~\ref{3sat:first-order}) that random 
183: 3-SAT has a first-order phase transition. We extend this result in 
184: several ways: first (Theorem~\ref{2+p-sat:first-order}) to 
185: random $(2+p)$-satisfiability, the original problem from 
186: \cite{2+p:nature}, obtaining further theoretical support to the 
187: heuristic results of \cite{2+p:nature}. Second we give a sufficient 
188: condition (Theorem~\ref{sufficient:first-order}) for the existence of 
189: a first-order phase transition. We then show 
190: (Theorem~\ref{implicates:first-order}) that all problems whose 
191: constraints have no implicates of size at most two satisfy this 
192: condition.\item we show that in all the cases where  
193: we can prove the 
194: existence of a first-order phase transition, such problems have   
195: a $2^{\Omega(n)}$ lower bound on their resolution complexity 
196: (and hence the complexity of DPLL algorithms as well \cite{beame:dp}). 
197: Indeed, the two phenomena ($2^{\Omega(n)}$ resolution complexity and 
198: the existence of a first-order phase transition) have 
199: common causes. 
200: \item in contrast, we show (Theorem~\ref{second:order}) that, for {\em 
201: any generalized 
202: satisfiability problem},  a second-order phase transition implies, for 
203: every $\alpha >0$, a $O(2^{\alpha \cdot n})$ upper 
204: bound on the resolution complexity of their random instances (in the 
205: region where most formulas are unsatisfiable).  
206: \end{enumerate} 
207: 
208: \section{Preliminaries} 
209: 
210: Throughout the paper we will assume familiarity with the general 
211: concepts of phase transitions in combinatorial problems (see e.g. 
212: \cite{martin:monasson:zecchina}), 
213: random structures \cite{bol:b:random-graphs}, proof complexity 
214: \cite{beame:proof:survey}. Some papers whose concepts and methods we use in 
215: detail (and we assume greater familiarity with) 
216: include \cite{friedgut:k:sat}, \cite{chvatal:szemeredi:resolution},  
217: \cite{ben-sasson:resolution:width}. 
218: 
219: Consider a monotonically increasing problem $A=(A_{n})$, 
220:  under the constant probability model $\Gamma(n,p)$, that independently 
221: sets to 1 with probability $p$ 
222: each bit of the random string. As usual, for 
223: $\epsilon >0$ let $p_{\epsilon}= p_{\epsilon}(n)$ define the canonical
224: probability such that $\Pr_{x \in \Gamma(n,p_{\epsilon}(n))}[x \in
225: A]= \epsilon$. 
226: 
227: The probability that a random sample $x$ satisfies property $A$ (i.e. 
228: $x\in A$) is a monotonically increasing function of $p$. {\em Sharp 
229: thresholds} are those for which this function has a ``sudden jump'' from 
230: value 0 
231: to 1:  
232: 
233: \begin{definition} \label{sharp}
234: Problem $A$ has a {\em sharp threshold} iff for every $0<\epsilon < 
235: 1/2$, we have $\lim_{n\goesto \infty} \frac{p_{1-\epsilon}(n)- 
236: p_{\epsilon}(n)}{p_{1/2}(n)} = 0$. 
237: $A$ has {\em a coarse threshold} if for some $\epsilon > 0$ it holds 
238: that
239: $\underline{\lim}_{n\goesto \infty} \frac{p_{1-\epsilon}(n)- 
240: p_{\epsilon}(n)}{p_{1/2}(n)} > 0$. 
241: \end{definition} 
242: 
243: For satisfiability problems (whose complements are monotonically 
244: increasing) the constant probability model amounts to adding every constraint 
245: (among those 
246: allowed by the syntactic specification of the model) to the random 
247: formula independently 
248: with probability $p$. Related definitions can be given for the other 
249: two models for 
250: generating random structures, 
251: the {\em counting model} and {\em the multiset model} 
252: \cite{bol:b:random-graphs}. Under reasonable 
253: conditions \cite{bol:b:random-graphs} 
254: these models are equivalent, and we will liberally switch 
255: between them. In particular, for 
256: satisfiability problem $A$, and an instance $\Phi$ of $A$, 
257: $c_{A}(\Phi)$ will denote its {\em constraint density}, the ratio 
258: between 
259: the number of clauses and the number of variables of $\Phi$. To specify 
260: the random model in this 
261: latter cases we have to specify the constraint density as a function of 
262: $n$, the number of variables. 
263: We will use  $c_{A}$ to denote the value of the constraint density 
264: $c_{A}(\Phi)$ (in the counting/multiset 
265: models) corresponding to taking $p=p_{1/2}$ in the constant probability 
266: model. 
267: $c_{A}$ is a function on $n$ that is believed to tend to a constant 
268: limit as $n\goesto \infty$. However, Friedgut's proof 
269: \cite{friedgut:k:sat} of a sharp threshold in $k$-SAT (and our results) leave this issue 
270: open.   
271: 
272: 
273: The original investigation of the order of the phase transition in 
274: $k$-SAT used an order parameter called {\em the backbone}.  
275: Bollob\'{a}s et al. \cite{scaling:window:2sat} have investigated the 
276: order of the phase transition in 2-SAT under a different order parameter, 
277: a ``monotonic version'' of the 
278: backbone called {\em the spine}. 
279: 
280: \begin{equation}\label{spine:initial} 
281: Spine(\Phi) = \{ x\in Lit | (\exists) \Xi \subseteq \Phi, \Xi \in SAT,
282: \Xi \AND \{\overline{x}\}\in \overline{SAT}\}. 
283: \end{equation}
284: 
285: They showed that random 2-SAT has a continuous (second-order) phase
286: transition: the size of the spine, normalized by dividing it by the 
287: number of 
288: variables, approaches zero ( 
289: as $n\goesto \infty$) for $c<c_{2-SAT}=1$, and is continuous 
290: at $c=c_{2-SAT}$. By contrast, nonrigorous 
291: arguments from Statistical Mechanics \cite{monasson:zecchina} imply the 
292: fact  
293: that for $3-SAT$ the spine jumps discontinuously from zero to 
294: positive values at the transition point $c=c_{3-SAT}$ 
295: (a first-order phase transition).  
296: 
297: It is easy to see that the intuition concerning the connection between 
298: the 
299: complexity of DPLL algorithms and the size of the backbone (discussed 
300: briefly in the introduction) extends to the spine as well. In this 
301: paper 
302: whenever we will discuss the order of a phase transition we will do it  
303: with respect to this latter order parameter.  
304: 
305: We would like to obtain a complete classification of 
306: the order of the phase transition in random satisfiability 
307: problems. 
308: A preliminary problem we have to deal with is characterizing those 
309: problems that 
310: have a sharp threshold: indeed, Physics considerations require that, 
311: in order that the study of the (order of the) phase transition to be 
312: meaningful, the order parameter (in the case of $k$-SAT the spine) has 
313: to be, w.h.p.,  concentrated around its expected value 
314: (in Physics parlance it is a {\em self averaging quantity}), and it is 
315: zero up to a certain value of the control parameter 
316: (in our case constraint density $c$) and positive above it. 
317: In the case of $k$-SAT these conditions imply the fact that 
318: $k$-SAT has a sharp threshold. The argument (a ``folklore'' one) 
319: can be formally 
320: expressed by the following 
321: 
322: \begin{lemma}\label{spine-threshold} Let $c$ be an arbitrary {\em 
323: constant} value for the constraint 
324: density function.  
325: \begin{enumerate}
326: \item If $c< \underline{lim}_{n\goesto \infty} c_{k-SAT}(n)$ then 
327: $\lim_{n \goesto \infty} \frac{|Spine(\Phi)|}{n} =0$. 
328: \item If for some $c$ there exists $\delta > 0$ such that w.h.p. (as 
329: $n\goesto \infty$) $\frac{|Spine(\Phi)|}{n} > \delta$ then 
330: $\lim_{n\goesto \infty}
331:  \Pr[\Phi \in SAT]= 0$, that is $c> \overline{lim}_{n\goesto \infty} 
332: c_{k-SAT}(n)$. 
333: \end{enumerate}
334: \end{lemma}
335: 
336: The argument is generic enough to extend to {\em all} constraint 
337: satisfaction problems. So a necessary condition for the study of 
338: the phase transition to be meaningful is that the problem have a 
339: sharp threshold.     
340: 
341: \section{Coarse and sharp thresholds of random generalized 
342: satisfiability 
343: problems} 
344: 
345: 
346: In this section we obtain a complete classification of thresholds of 
347: random satisfiability problems, under 
348:  Molloy's recent model of random constraint satisfaction problems from 
349: \cite{molloy-stoc2002} (specialized to satisfiability problems, i.e. 
350: problems with domain $\{0,1\}$).  This affirmatively solves 
351: an open problem raised in \cite{creignou:daude:sat2002}.  
352: 
353: 
354: \begin{definition}\label{model} 
355: Consider the set of all $2^{2^{k}}-1$ potential nonempty binary 
356: constraints on $k$ variables 
357: $X_{1}, \ldots, X_{k}$. We specify a probability distribution ${\cal 
358: P}$ which selects a single 
359: random constraint, and let ${\cal C}= supp({\cal P})$ be the set of 
360: constraints on which ${\cal P}$ 
361: assigns positive probability. 
362: 
363: A random formula from $SAT_{n,M}({\cal P})$ is specified by the 
364: following procedure: 
365: \begin{itemize} 
366: \item $n$ is the number of variables. 
367: \item $M$ is the number of clauses, chosen by the following procedure: 
368: first select, uniformly at 
369: random and with repetition $m$ hyperedges of the uniform hypergraph on 
370: $n$ variables. 
371: \item for each hyperedge choose a random ordering of the variables 
372: involved. Choose a random 
373: constraint according to ${\cal P}$ and apply it on the list of 
374: (ordered) variables. 
375: \end{itemize} 
376: $SAT({\cal C})$ refers to the random model corresponding to ${\cal P}$ 
377: being the uniform distribution on ${\cal C}$. 
378: \end{definition} 
379: 
380: It turns out we face a technical difficulty when studying sharp and 
381: coarse thresholds in Molloy's model; it cannot  be directly mapped onto 
382: the 
383: constant probability model for which the notion of a sharp threshold in 
384: Definition~\ref{sharp} works. The definition of a sharp threshold we 
385: need 
386: to employ is the one from \cite{molloy-stoc2002} 
387: 
388: \begin{definition}\label{sharp:2} 
389: $SAT({\cal P})$ is said to have {\em a sharp threshold of 
390: satisfiability} if there exists a function $c(n)$ bounded away from 0 such that, for 
391: any $\epsilon >0$ if $M<(c(n)-\epsilon) n$
392: then $SAT_{n,M}({\cal P})$ is a.s. satisfiable and if 
393: $M>(c(n)+\epsilon) n$ then $SAT_{n,M}({\cal P})$ is a.s. unsatisfiable. On the other 
394: hand, if there exist two functions $M_{1}(n), M_{2}(n)$ 
395: such that $M_{1}(n)/M_{2}(n)$ is bounded away from zero, and the 
396: satisfaction probability of random instances from $SAT_{n,M_{1}}({\cal P})$, 
397: $SAT_{n,M_{2}}({\cal P})$ is bounded away from 
398: both 0 and 1 then $SAT({\cal P})$ is said to have {\em a coarse 
399: threshold.} 
400: \end{definition} 
401: 
402: However, just as in \cite{molloy-stoc2002} (where this was done in the 
403: case when ${\cal P}$ is the uniform distribution), for $k\geq 3$ 
404: one can map Molloy's model onto a modified version of the constant 
405: probability model, defined as follows: 
406: Let $p_{1}, \ldots p_{r}$ be positive numbers between zero and 1. A 
407: random sample $x$ from the model $\Gamma_{p_{1},\ldots p_{r}}(n,p)$ is 
408: obtained in the 
409: following way: divide the bits of $x$ into $r$ equal groups. Set each 
410: of the bits in the $i$'th group to 1 independently with probability 
411: $p\cdot p_{i}$. For this model the definitions of $p_{\epsilon}$ and sharp/coarse threshold from Definition~\ref{sharp} carry over, and are equivalent to 
412: those from Definition~\ref{sharp:2}. 
413: 
414: Indeed, let $r$ be the cardinality of the support of distribution ${\cal P}$, 
415: and $p_{1}, \ldots, p_{r}$ be the associated positive 
416: probabilities. 
417: 
418: In its general setting Molloy's model is specified as follows: divide 
419: the potential constraints into 
420: groups of $rk!$ constraints, corresponding to all possible applications 
421: of the $r$ constraint templates on a fixed set of $k$ variables. 
422: For each such group, independently with probability $p$, we make the 
423: decision to include at least one of the constraints in the group 
424: with probability $p=\frac{M}{rk!{{n}\choose {k}}}$ (going from $M$ 
425: clauses to including each potential edge independently with probability $p$ 
426: can be done just as in the uniform case from \cite{molloy-stoc2002}). 
427: 
428: Each realization of  constraint constraint template $i$ is chosen with 
429: probability probability $p_{i}/k!$. Denote this model by $M(n,p,p_{1}, 
430: \ldots, p_{r})$. 
431: 
432: Defining $f(x) = [(1+x pp_{1}/k!)\cdot (1+xpp_{2}/k!) \cdot \ldots 
433: \cdot (1+xpp_{r})/k!]^{k!}- x$ we have $f(1)>0 $ and, since (by a simple 
434: calculus argument) 
435: the minimum of $f(x)$ over the choices of $p_{i}\geq 0$, $\sum p_{i}=1$  
436:  is obtained when one of them 
437: is 1 and the others are zero,  
438: \[
439: f(\frac{1}{1-p})\geq [1+\frac{p}{k!(1-p)}]^{k!}-\frac{1}{1-p}\geq 0.  
440: \]
441: 
442:  
443: Let $\alpha= \alpha(n) >0$  be the smallest solution of the equation 
444: $f(\alpha) =0$. Thus $\frac{1}{1-p} \leq  \alpha$. 
445: 
446: 
447: 
448: Define, for $i=1,r$, 
449: \[
450: p^{\prime}_{i}= \frac{1/k! \cdot \alpha \cdot p_{i}}{1+1/k! \cdot 
451: \alpha pp_{i}}
452: \]
453: 
454: 
455: 
456: \begin{claim}
457: The following hold for any $p=\theta(n^{1-k})$:  
458: \begin{enumerate}
459: \item For every formula $\Phi$ such that no two constraints on the same 
460: set of variables appear in it, 
461: \[
462: \Pr_{M(n,p,p_{1}, \ldots, p_{r})}(\Phi)\geq 
463: \Pr_{\Gamma_{p^{\prime}_{1}, \ldots, p^{\prime}_{r}}(n,p)}[\Phi]. 
464: \]
465: Consequently 
466: \begin{eqnarray*}
467: \Pr_{M(n,p,p_{1}, \ldots, p_{r})}[SAT({\cal P})]\geq 
468: \Pr_{\Gamma_{p^{\prime}_{1}, \ldots, p^{\prime}_{r}}(n,p)}[SAT({\cal P})| \\ 
469: \mbox{ no two constraints on the same set of variables appear in }\Phi 
470: ].
471: \end{eqnarray*}
472: \item On the other hand, there exists $f(n) = 1+o(1)$ such that for 
473: every formula $\Phi$ such that no two constraints on the same set of 
474: variables appear in it, 
475: \[
476: \Pr_{M(n,p,p_{1}, \ldots, p_{r})}(\Phi)\leq f(n) 
477: \Pr_{\Gamma_{p^{\prime}_{1}, \ldots, p^{\prime}_{r}}(n,p)}[\Phi]. 
478: \]
479: Consequently 
480: \begin{eqnarray*}
481: \Pr_{M(n,p,p_{1}, \ldots, p_{r})}[SAT({\cal P})]\leq 
482: (1+o(1))\Pr_{\Gamma_{p^{\prime}_{1}, \ldots, p^{\prime}_{r}}(n,p)}[SAT({\cal P})| \\ 
483: \mbox{ no two constraints on the same set of variables appear in }\Phi 
484: ].
485: \end{eqnarray*}
486: \end{enumerate}
487: \end{claim}
488: 
489: Indeed, consider the set of constraints on a fixed set of given 
490: variables. The probability (under $\Gamma_{p^{\prime}_{1}, \ldots, 
491: p^{\prime}_{r}}$) that a given clause of type $i$ is 
492: included, and none of the others are is equal to 
493: $\frac{pp^{\prime}_{i}}{1- pp^{\prime}_{i}}\cdot 
494: [(1-pp^{\prime}_{1})\ldots (1-pp^{\prime}_{r})]^{k!} $. 
495: But
496: \[
497: \frac{pp^{\prime}_{i}}{1-pp^{\prime}_{i}}= \alpha pp_{i}/k!. 
498: \]
499: 
500: Also 
501: \[
502: 1-pp^{\prime}_{i}= \frac{1}{1+\alpha pp_{i}/k!}, 
503: \]
504: 
505: so, by the definition of $\alpha$, 
506:  
507: \[
508: [(1-pp^{\prime}_{1})\ldots (1-pp^{\prime}_{r})]^{k!}= \frac{1}{\alpha}.
509: \]
510: 
511: This means that the probability that in a given set of constraints 
512: exactly one constraint (of type $i$)  is chosen is equal to $pp_{i}/k!$, 
513: the same as in model $M$. On the other 
514: hand the probability that {\em no} constraint is chosen is equal to  
515: $[(1-pp^{\prime}_{1})\ldots (1-pp^{\prime}_{r})]^{k!}= \frac{1}{\alpha}$. 
516: But the same probability in model $M$ 
517: is $1-p$, and we know that $1-p \geq \frac{1}{\alpha}$. In both model 
518: decisions on different sets of $k$ variables are independent. The 
519: conclusion is that $M$ assigns a larger probability 
520: than $\Gamma_{p^{\prime}_{1}, \ldots, p^{\prime}_{r}}$ to any sample 
521: $x$ to which it assigns positive probability. Point (1) follows. 
522: 
523: Point (2) has a similar proof: by calculus the maximum value of $f(x)$ 
524: is obtained when the $p_{i}$'s are equal, so 
525: \[
526: 0 = f(\alpha) \leq (1+\frac{\alpha p}{rk!})^{rk!} - \alpha \leq 
527: e^{\alpha p} - \alpha. 
528: \]
529: 
530: Since $p=o(1)$, for large enough $n$ $e^{\alpha p} \leq 1+ \alpha p + 
531: (\alpha p)^{2}$, so $(\alpha p)^{2}+ \alpha (p-1) +1 > 0$, in other 
532: words
533: \[
534: \alpha (1-p) \leq 1+ (\alpha p)^{2}.
535: \]
536: 
537: But the ratio of the probabilities associated to any given $\Phi$ by 
538: $M$ and $\Gamma_{p^{\prime}_{1}, \ldots, p^{\prime}_{r}}(n,p)$ verifies 
539: \[
540: \frac{ \Pr_{M(n,p,p_{1}, \ldots, 
541: p_{r})}(\Phi)}{\Pr_{\Gamma_{p^{\prime}_{1}, \ldots, p^{\prime}_{r}}(n,p)}[\Phi]}\leq [\alpha (1-p)]^{rk! 
542: {{n}\choose {k}}}\leq (1+ (\alpha p)^{2})^{rk! {{n}\choose {k}}}. 
543: \]
544: Since $p=\theta(n^{1-k})$ and $k\geq 3$ the right-hand side is a 
545: function of $n$ that is $1+o(1)$. 
546: 
547: To prove the result it is enough to observe that for $k\geq 3$ the 
548: expected number of times a random formula in $\Gamma_{p^{\prime}_{1}, 
549: \ldots, p^{\prime}_{r}}(n,p)$ contains two different 
550: clauses on the same set of variables is $o(1)$, since that will imply 
551: that the satisfaction probabilities in the two models are related by a 
552: $1-o(1)$ factor. 
553: Indeed, this number is 
554: \[
555: {{n}\choose {k}} \cdot [\sum_{\alpha,\beta} 
556: \frac{pp^{\prime}_{\alpha}pp^{\prime}_{\beta}}{(k!)^{2}}], 
557: \]
558: 
559: where indices $\alpha,\beta$ span the set of different pairs of clauses 
560: from a group. 
561: Since each group is finite (contains $rk!$ clauses) and 
562: $p=\theta(n^{1-k})$, this expected value is $\theta(n^{2-k})$, which is $o(1)$ for 
563: $k\geq 3$. 
564: \qed
565: 
566:  
567: \begin{definition} 
568: Constraint $C_{2}$ is {\em an implicate of $C_{1}$}
569:  iff every satisfying assignment for $C_{1}$ satisfies $C_{2}$.  
570: \end{definition} 
571: 
572: 
573: \begin{definition} 
574: A boolean constraint $C$ {\em strongly depends on a literal} if 
575: it has an unit clause as an implicate. 
576: \end{definition} 
577: 
578: \begin{definition} 
579: A boolean constraint $C$ {\em strongly depends on a 2-XOR relation} if 
580: $\exists i,j\in \overline{1,k}$ 
581: such that constraint ``$x_{i}\neq x_{j}$'' is an implicate of $C$. 
582: \end{definition} 
583: 
584: Our result is: 
585: 
586: \begin{theorem}\label{dichotomy:threshold} 
587: Consider a generalized satisfiability problem $SAT({\cal P})$ (that is 
588: not 
589: trivially satisfiable by the ``all zeros'' or ``all ones'' 
590: assignment). Let ${\cal C}= supp({\cal P})$.   
591: \begin{enumerate}
592: \item if some constraint in ${\cal C}$ strongly depends on one 
593: component then $SAT({\cal P})$ has a coarse threshold. 
594: \item if some constraint in ${\cal C}$ strongly depends on a 
595: 2XOR-relation then $SAT({\cal P})$ has a coarse threshold. 
596: \item in all other cases $SAT({\cal P})$ has a sharp threshold. 
597: \end{enumerate}
598: \end{theorem}
599:  
600: \begin{proof}
601: 
602: \begin{enumerate}
603: \item 
604: Suppose some clause $C$ implies a unit clause. We claim that $SAT({\cal 
605: P})$ has a coarse threshold in the region where the expected number of 
606: clauses is $\theta(\sqrt{n})$. 
607: 
608: That the probability that such a formula is bounded away from zero in 
609: this region it is easy to see: consider a random formula with $c\sqrt n$ 
610: constraints, and let $H$ be the $k$-uniform formula hypergraph. 
611: The expected number of pairs of edges $C_{i}$, $C_{j}$ that have 
612: nonempty intersection is 
613: \[
614: {{c\cdot \sqrt n}\choose {2}}\cdot (1 - \frac{{{n-k}\choose {k} 
615: }}{{{n}\choose {k} }})\leq \frac{(ck)^{2}}{2}
616: \]
617: 
618: Therefore with constant positive probability (that depends on $c$), all 
619: vertices will have degree at most 1 in the hypergraph $H_{n}$, and the 
620: formula will be satisfiable. 
621: 
622: If, on the other hand, both positive and negative unit clauses 
623: are implicates of constraints in ${\cal P}$ then one can adapt the 
624: well-known lower bound on the probability of intersection of two 
625: random sets of size $\theta(\sqrt{n})$ to show that, with constant 
626: probability a random formula will contain two contradictory unit 
627: clauses as implicates, and be unsatisfiable. 
628: 
629: The proof is similar in the case when only one type of unit clauses 
630: (w.l.o.g 
631: assume it's the positive unit clauses) are implicates of constraints in 
632: ${\cal C}$. Since $SAT({\cal C})$ is not trivial there exists a 
633: constraint $C_{1}\in {\cal C}$ with an implicate of the type 
634: $\overline{x_{1}}\OR \ldots \OR \overline{x_{b}}$, $b\geq 2$. We deal first with the 
635: case when there exists a constraint 
636: $C_{2}\neq C_{1}$ such that $C_{2}$ has an unit clause as implicate. 
637: Then it is easy to construct a formula $F$ consisting of $b$ copies of  
638: $C_{1}$and one copy of $C_{2}$ that implies the (unsatisfiable) formula 
639: $\{x_{1}, \ldots, x_{b},\overline{x_{1}}\OR \ldots \OR 
640: \overline{x_{b}}\}$. 
641: It is easy to see that the expected number of copies of $F$ in a random 
642: instance of $SAT({\cal P})$ with $\theta(\sqrt{n})$ clauses is 
643: constant, 
644: so the probability that the instance is unsatisfiable is bounded away 
645: from zero. 
646: 
647: Finally, in the case when the only constraint in ${\cal C}$ that has an 
648: unit 
649: implicate is $C_{1}$. In this case one can use a trick similar to that 
650: used 
651: in the last paragraph of subsection~\ref{all:together}: we use half of 
652: the 
653: random copies of $C_{1}$ to imply (random) unit clauses, and the other 
654: half 
655: to imply (random) copies of $\overline{x_{1}}\OR \ldots \OR 
656: \overline{x_{b}}$. This way we can produce, with constant probability, a copy of 
657: the formula $F$. 
658: 
659: \item 
660: Suppose now that $C$ does not fall in the first case but 
661: has a 2XOR implicate. In this case Creignou and 
662: Daud\'{e} have shown when ${\cal P}$ is the uniform distribution 
663: (and this extends directly to the case of a general 
664: probability distribution as well) that $p_{1/2}= \Omega(n^{1-k}))$ and  
665: the expected number of constraints is $\theta{n}$. Let $c_{SAT({\cal 
666: P})}\cdot n$ be the expected number of constraints corresponding to 
667: $p_{1/2}$. Then there exists $\delta > 0 $ such that, for every $n$, 
668: $c_{SAT({\cal P})}= c_{SAT({\cal P})}(n) > \delta$. 
669: 
670: Let us consider a random formula with  $c\cdot n$ of constraints. 
671: By the well known result on triangles in random graphs it follows that 
672: with positive probability one can use $C$ to create a ``contradictory
673: triangle''. Therefore it is easy to see that for every $c>0$ 
674: the satisfaction probability is bounded away from 1. It is easy to see 
675: than this statement, together with the fact that $c_{SAT({\cal P})}(n) 
676: > \delta$ together imply that $SAT({\cal P})$ has a coarse threshold.  
677:    
678: \item 
679: We will concentrate in 
680: the sequel on the last one. As discussed previously, for $k\geq 3$ 
681: Molloy's model can be mapped onto 
682: a version of the constraint probability model. In the case $k=2$ we can 
683: establish the existence of a sharp 
684: threshold in a direct manner, by the same method as the one used by 
685: Chv\'{a}tal and Reed for 2-SAT \cite{mickgetssome} 
686: (the complete proof of this case will be presented in the full 
687: version).  Indeed, by the first two points of the Theorem, and the assumption 
688: $k=2$ 
689: the only possible constraints in ${\cal P}$ can be constraints $x\OR 
690: y$, $\overline{x} \OR \overline{y}$, $\overline{x}\OR y$, $x \OR 
691: \overline{y}$, $x=y$, and 
692: the first two are always present.   
693: 
694: Let us now consider the case $k\geq 3$, using the modified version of 
695: the constant probability model. We note first that there exists a simple 
696: observation
697:  that allows us to reduce the problem to the case when ${\cal P}$ is 
698: the uniform probability: the Friedgut-Bourgain 
699: result on sharp/coarse threshold properties in monotone problems 
700: \cite{friedgut:k:sat} uses the following result, an easy consequence of the 
701: Mean Value Theorem: 
702: if a monotonic property $A$ does {\em not}  have a sharp threshold 
703: (under model $\Gamma(n.p)$) then there exists $p^{*}=p(n)$ and a constant 
704: $C>0$ such that (for 
705: infinitely many $n$) 
706: 
707: \begin{equation}\label{coarse}
708: p^{*}\cdot I(p^{*}) < C,
709: \end{equation}
710: 
711: where $I(p^{*})=\frac{d\mu_{p}(A)}{dp}|_{p=p^{*}}$. 
712: 
713: The same argument works when $A$ is considered under model 
714: $\Gamma_{p_{1},\ldots p_{r}}(n,p)$. Moreover, it is an easy consequence of Russo's 
715: Lemma for $\Gamma_{p_{1},\ldots p_{r}}(n,p)$
716: that if equation~\ref{coarse} holds for $p^{*}$ and some tuple 
717: ${p_{1},\ldots p_{r}}$, then it also holds (with a different constant $C$) for  
718: $p^{*}$ and 
719: tuple $p_{1}= \ldots p_{r}=1/r$. In other words it is enough to obtain 
720: a contradiction to the assumption that $SAT({\cal P})$ did not have a 
721: sharp 
722: threshold in the case when ${\cal P}$ is the uniform probability, which 
723: is what we show next. 
724: 
725: \subsection{A base case} 
726: 
727: To prove the theorem in the uniform case we will first consider a 
728: ``base case'' that is 
729: easier to  
730: explain, and will be of use in solving the general case: let $a,b$ be 
731: two integers 
732: (not necessarily equal), both greater or equal to 2. Let $S$ be a set 
733: consisting of two constraints $C_{1}, C_{2}$ of arity $a$,
734: respectively $b$, specified by 
735: $C_{1}= \overline{x_{1}}\OR \ldots \overline{x_{a}}$, $C_{2}= x_{1}\OR 
736: \ldots x_{b}$. One can represent $SAT(S)$ in the framework of 
737: Definition~\ref{model}
738: by ``simulating'' $C_{1}$, $C_{2}$ by suitable constraints of arity 
739: $\max\{a,b\}$. 
740: 
741: We first outline how to prove that $SAT(S)$ has a sharp threshold: 
742: we apply the Friedgut-Bourgain result \cite{friedgut:k:sat} and infer 
743: that if 
744: $SAT(S)$ did not have a sharp  threshold than, for some $\epsilon, 
745: \delta_{0}, K>0$ and some probability 
746: $p=p(n)\in [p_{\epsilon}, p_{1-\epsilon}]$ 
747: 
748: \begin{enumerate}
749: \item either $ 
750: \Pr_{p=p(n)} [\Phi \mbox{ contains some } F\in \overline{SAT}\mbox{ 
751: with }|F|\leq K] > \delta_{0}$, or  
752: \item there exists a fixed satisfiable formula $F_{0}$, $|F_{0}|\leq K$  
753: such that $
754: \Pr_{p=p(n)} [\Phi \AND F_{0} \in \overline{SAT}] - \Pr [ \Phi \in 
755: \overline{SAT}] > 
756: \delta_{0}$. 
757: 
758: \end{enumerate} 
759: 
760: One easy observation is that in the second alternative we can always 
761: assume 
762: that $F$ consists of a conjunction of 
763: unit clauses: if $F$ is satisfiable and satisfies (2), then so does 
764: the conjunction of unit clauses specifying 
765: one satisfying assignment of $F$. The first alternative is eliminated 
766: by a result (Proposition 4.6) from 
767: \cite{creignou:daude:sat2002}. 
768: The key to disproving the second alternative, in the case of $k$-SAT, 
769: is 
770: a geometric result, Lemma 5.7 in \cite{friedgut:k:sat}. 
771: We restate it here for completeness. 
772: 
773: 
774: \begin{lemma}\label{friedgut}
775: For a sequence $A=(A_{n})$ of subsets of the $n$-dimensional hypercube, 
776:  $A\subseteq \{0,1\}^{n}$, define $A$ to be {\em 
777: $(d,m,\epsilon)$-coverable} if the probability for a union 
778: of a random choice of $d$ subcubes (hyperplanes) of codimension $m$ to 
779: cover $A$ is greater than $\epsilon$ for large enough $n$. 
780:  
781: Let $f(n)$ be any function that tends to infinity as $n\goesto \infty$. 
782: For fixed $k$, $d$, and $\epsilon$ any $A$ that is 
783: $(d,1,\epsilon)$-coverable is $(f(n),k, \epsilon)$-coverable. 
784: \end{lemma}
785: 
786: The connection with satisfiability can be explained as follows: the set 
787: $A$ 
788: in the application of the Lemma~\ref{friedgut} will (intuitively) refer 
789: to the set of satisfying assignments of random formula $\Phi$. 
790: Hyperplanes 
791: of codimension 1 are associated to unit clauses, more precisely to the 
792: set 
793: of assignments {\em forbidden} by a given unit clause. The fact that 
794: $A$ 
795: can be covered with probability $\epsilon$ by a union of $d$ random 
796: hyperplanes of codimension 1 parallels the fact that with probability 
797: $\epsilon$, 
798: $\Phi \AND F_{0}$ becomes unsatisfiable. This is what the result of 
799: Friedgut-Bourgain gives us (for $\epsilon = \delta_{0}$, under the 
800: assumption that $k$-SAT does not have a sharp 
801: threshold). Hyperplanes of 
802: codimension $k$ correspond to the set of assignments forbidden by a 
803: given $k$-clause, and the conclusion of the geometric lemma is that adding 
804: any small (but unbounded) number $f(n)$ of random 
805: $k$-CNF clauses to random formula $\Phi$ boosts the probability of {\em 
806: not} being satisfiable at least as much as the addition of the 
807: (constantly many) unit clauses in $F_{0}$.   
808: 
809: For small enough $f(n)$ this statement can be directly refuted, by 
810: concentration results for the binomial distribution (Lemma 5.6  in 
811: \cite{friedgut:k:sat}). A simpler 
812: and more general way to derive it is given as
813: Lemma 3.1 in \cite{achlioptas:friedgut:kcol}.  
814: 
815: The same outline works for the case we consider. To state the geometric 
816: result we need, however, to 
817: work with two types of hyperplanes: 
818: 
819: \begin{definition}
820: Let $H_{n}=\{0,1\}^{n}$ be the $n$-dimensional hypercube, and let 
821: $w_{i}$ denote the value of the $i$'th 
822: bit of element $w\in H_{n}$. 
823: A {\em positive hyperplane of codimension $d$} is a subset of points 
824: of  $H_{n}$ defined by a system of equations $
825: x_{i_{1}}= \ldots = x_{i_{d}}=1$, where the $x_{i}$'s are distinct 
826: variables. Negative hyperplanes have 
827: a similar definition. 
828: \end{definition}
829: 
830: Our version of the geometric Lemma is 
831: 
832: \begin{lemma}\label{geometric} 
833: For a sequence $A=(A_{n})$ of subsets of the $n$-dimensional hypercube, 
834:  $A_{n}\subseteq \{0,1\}^{n}$  define $A$ to be {\em $(n_{1}, d_{1}, 
835: d_{2},m_{1}, m_{2},\epsilon)$-coverable} if the probability of a union        
836: of a random choice of $d_{1}$ negative hyperplanes of codimension 
837: $m_{1}$ 
838: and $d_{2}$ positive hyperplanes of codimension $m_{2}$ to cover 
839: $A_{n}$ is at least $\epsilon$ if 
840: $n\geq n_{1}$. Let $f(n)$, $g(n)$ be any 
841: functions that tends to infinity as $n\goesto \infty$. For fixed 
842: $k_{1}$,$k_{2}$, $d$, and $0<\delta <\epsilon$, 
843: there exists $n_{2}$ that depends on $k_{1}, k_{2}, d,\epsilon,\Delta, 
844: n_{1}$ (but {\em not} $A$) such that for any 
845: $n\geq n_{2}$ any $A_{n}\subseteq \{0,1\}^{n}$ that is 
846: $(n_{1},d_{1},d_{2},1,1,\epsilon)$-coverable is 
847: $(n_{2},f(n),g(n),k_{1},k_{2}, \epsilon - \delta)$-coverable. 
848: \end{lemma}
849: We will in fact prove a stronger version of the Lemma: 
850: 
851: \begin{lemma}\label{geometric:2} 
852: For a sequence $A=(A_{n})$ of subsets of the $n$-dimensional hypercube, 
853: assume that  
854: \[
855: \Pr[A\subseteq H_{1}\cup \ldots H_{d}]\geq \epsilon
856: \]
857: 
858: for all $n\geq n_{1}$, 
859: where the $H_{i}$'s are random hyperplanes of codimension 1, 
860: $d_{1}$ of them negative, $d_{2}$ of them positive. 
861: 
862: 
863: Let $f(n)$, $g(n)$ be any 
864: functions that tends to infinity as $n\goesto \infty$. For fixed 
865: $k_{1}$,$k_{2}$, $d$, and $\delta >0$, 
866: there exists $n_{*}=n(n_{1},d_{1},d_{2},k_{1}, k_{2}, 
867: \epsilon,\delta,f,g)$, (however it does {\em not} depend on $A$) 
868: such that for any 
869: $n\geq n_{*}$ and any $i$, $0\leq i\leq d$
870: \begin{equation}\label{conclusion}
871: Pr[A\subseteq P_{1}\cup \ldots \cup P_{\frac{if(n)}{d}}\cup N_{1}\ldots 
872: \cup N_{\frac{ig(n)}{d}}\cup H_{i+1}\cup 
873: \ldots \cup H_{d}]\geq 
874: Pr[A\subseteq H_{1}\cup \ldots \cup H_{d}]-\frac{i\delta}{d},  
875: \end{equation}
876: 
877: where the $N_{i}$'s are random negative hyperplanes of codimension 
878: $k_{1}$ and the 
879: $P_{i}$'s are random positive hyperplanes of codimension $k_{2}$. 
880: \end{lemma}
881: 
882: \begin{proof} 
883: 
884: It is easy to see that one can assume that $d|f(n)$, $d|g(n)$ (since it 
885: is enough to prove the lemma for 
886: $\overline{f}(n)=d\lfloor f(n)/d \rfloor$, $\overline{g}(n)=d\lfloor 
887: g(n)/d \rfloor$). 
888:  
889: We will prove the lemma by double induction on $d_{1}, d_{2}$. By 
890: symmetry we only need to consider two ``base cases:'' \\
891: 
892: {\bf Case 1: $d_{1}=0$, $d_{2}=1$} \\
893: 
894: In this case (and the dual, $d_{1}=1$, $d_{2}=0$) we can replace, for 
895: $i=1$, 
896:  equation~\ref{conclusion}
897: by the stronger: 
898: \begin{equation}\label{conclusion:d=1}
899: \Pr[A\subseteq P_{1}\cup \ldots \cup P_{f(n)}\cup N_{1}\ldots \cup 
900: N_{g(n)}]\geq 
901: 1-\delta.  
902: \end{equation} 
903: 
904: The hypothesis implies that for $n\geq n_{1}$ 
905: there exist $n\cdot \epsilon$ positive hyperplanes of codimension $1$ 
906: such that 
907: \[
908: A_{n} \subseteq P^{(n)}_{1}\cap \ldots \cap P^{(n)}_{n\cdot \epsilon}. 
909: \]
910: 
911: We will assume, w.l.o.g., in what follows that $A$ is in fact {\em 
912: equal} to 
913: the right hand side. 
914: If $KP$ is a random positive hyperplane of 
915: codimension $k_{1}$ 
916: then 
917: \[
918: \Pr[A \not \subseteq KP] = 1-\frac{{{n\cdot \epsilon}\choose 
919: {k_{1}}}}{{{n}\choose {k_{1}}}}. 
920: \]  
921: 
922: Indeed, suppose KP is specified by the (random set of) equations  
923: $x^{(n)}_{1}= \ldots =x^{(n)}_{k_{1}}=1$. The condition that 
924: $A\subseteq KP$ is 
925: equivalent to 
926: \[
927: \{x^{(n)}_{1}, \ldots x^{(n)}_{k_{1}}\} \subseteq \{p^{(n)}_{1}, 
928: \ldots, p^{(n)}_{n\cdot \epsilon}\}, 
929: \]
930: 
931: where $\{p^{(n)}_{1}, \ldots, p^{(n)}_{n\cdot \epsilon}\}$ are the 
932: literals that 
933: specify the hyperplanes $P^{(n)}_{1}, \ldots, P^{(n)}_{n\cdot 
934: \epsilon}$.  
935: 
936: Thus the probability that $A$ is included in the union of $g(n)$ random 
937: positive hyperplanes $KP_{i}$ of codimension $k_{1}$ is at least $1 - 
938: \Pr[ (\forall i): A\not \subseteq KP_{i}]$, which is equal to 
939: \[
940: 1 - (1-\frac{{{n\cdot \epsilon}\choose {k_{1}}}}{{{n}\choose 
941: {k_{1}}}})^{g(n)} \sim 1- (1-\epsilon^{k_{1}})^{g(n)}\goesto 1 \mbox{ 
942: as } 
943: n\goesto \infty.
944: \]
945: 
946: It follows that there exists $n_{*}=n(n_{1},d_{1},d_{2},k_{1}, k_{2}, 
947: \epsilon,\delta,f,g)$
948: such that for $n\geq n_{*}$ the 
949: left-hand side is larger than $1-\delta$. 
950: 
951: \vspace{5mm}
952: {\bf Case 2: $d_{1}+d_{2}>1$} \\
953: 
954: It is enough to prove that there exists 
955: $n_{i}=n(n_{1},d_{1},d_{2},k_{1}, k_{2}, \epsilon,\delta,f,g,i)$
956: such that~\ref{conclusion} holds, for a fixed value of $i$, $0\leq i 
957: \leq d$, when 
958: $n\geq n_{i}$. Then we can take $n_{*}=max\{n_{i}\}$. 
959: 
960: We prove this on induction on $i$. The claim is clearly true for $i=0$. 
961: Assume, therefore, that 
962: the claim is true up to $i$; we will prove it for  $i+1$. 
963: 
964: Denote for all $j$
965: \[
966: p_{j}= \Pr[A \subseteq P_{1}\cup \ldots \cup 
967: P_{\frac{(j-1)f(n)}{d}}\cup N_{1}\cup \ldots \cup 
968: N_{\frac{(j-1)g(n)}{d}}\cup H_{j}\cup \ldots \cup H_{d}]. 
969: \]
970: 
971: To accomplish that it is enough to show that 
972: \begin{equation}\label{inductive:step}
973: p_{i+1}\geq p_{i}-\delta, 
974: \end{equation} 
975:  
976: By the Bayes formula: 
977:  
978: \begin{eqnarray*}\label{bayes}
979: p_{i}=\Pr[A \subseteq P_{1}\cup \ldots \cup P_{\frac{(i-1)f(n)}{d}}\cup 
980: N_{1}\cup \ldots \cup 
981: N_{\frac{(i-1)g(n)}{d}}\cup H_{i}\cup \ldots \cup H_{d}]  =  \\
982:  =  \sum_{B} 
983: \ Pr[B\subseteq H_{i}| A\setminus (P_{1}\cup \ldots \cup 
984: P_{\frac{(i-1)f(n)}{d}}\cup N_{1}\cup \ldots \cup 
985: N_{\frac{(i-1)g(n)}{d}}) = B] \cdot \\
986: \cdot \Pr[A\setminus (P_{1}\cup \ldots \cup P_{\frac{(i-1)f(n)}{d}}\cup 
987: N_{1}\cup \ldots \cup 
988: N_{\frac{(i-1)g(n)}{d}}) = B]
989: \end{eqnarray*} 
990: 
991: Assume without loss of generality that $H_{i}$ is a positive 
992: hyperplane. 
993: 
994: Let $\gamma= \frac{\delta}{2d}$. Let 
995: \[
996: C_{\gamma}=\{B\subseteq \{0,1\}^{n}: Pr[B\subset P] > \gamma\}.
997: \]
998: 
999: Let $\alpha$ be the sum of those terms in~\ref{bayes} corresponding to 
1000: sets $B\in C_{\gamma}$, 
1001: and let $\beta$ be the sum corresponding to sets $B\in 
1002: \overline{C_{\gamma}}$. 
1003: 
1004: From the definition of $C_{\gamma}$ it follows that 
1005: \[
1006: 0\leq \beta \leq \gamma,  
1007: \]
1008:  therefore
1009: \begin{eqnarray*}\label{inequality} 
1010: \Pr[A\setminus (P_{1}\cup \ldots \cup P_{\frac{(i-1)f(n)}{d}}\cup 
1011: N_{1}\cup \ldots \cup 
1012: N_{\frac{(i-1)g(n)}{d}}) \in C_{\gamma}]\geq \\ 
1013: \geq \frac{1}{\gamma}\cdot [\Pr[A \subseteq P_{1}\cup \ldots \cup 
1014: P_{\frac{(i-1)f(n)}{d}}\cup N_{1}\cup \ldots \cup 
1015: N_{\frac{(i-1)g(n)}{d}}\cup H_{i}\cup \ldots \cup H_{d}] -\gamma]= \\
1016: = \frac{1}{\gamma}\cdot [p_{i}-\gamma].
1017: \end{eqnarray*} 
1018: 
1019: On the other hand 
1020: 
1021: \begin{eqnarray*}\label{bayes:2}
1022: p_{i+1}=\Pr[A \subseteq P_{1}\cup \ldots \cup P_{\frac{if(n)}{d}}\cup 
1023: N_{1}\cup \ldots \cup 
1024: N_{\frac{ig(n)}{d}}\cup H_{i+1}\cup \ldots \cup H_{d}]  =  \\
1025:  =  \sum_{B} 
1026: \ Pr[B\subseteq P_{\frac{(i-1)f(n)}{d}+1}\cup \ldots \cup 
1027: P_{\frac{if(n)}{d}} 
1028: \cup N_{\frac{(i-1)g(n)}{d}+1}\cup \ldots \cup N_{\frac{ig(n)}{d}}| \\
1029: A\setminus (P_{1}\cup \ldots \cup P_{\frac{(i-1)f(n)}{d}}\cup N_{1}\cup 
1030: \ldots \cup 
1031: N_{\frac{(i-1)g(n)}{d}}) = B] \cdot \\
1032: \cdot \Pr[A\setminus (P_{1}\cup \ldots \cup P_{\frac{(i-1)f(n)}{d}}\cup 
1033: N_{1}\cup \ldots \cup 
1034: N_{\frac{(i-1)g(n)}{d}}) = B]
1035: \end{eqnarray*} 
1036: 
1037: Let $\overline{f}(n)=f(n)/d$, $\overline{g}(n)= g(n)/d$. Since the 
1038: $N_{i}$'s, $P_{i}$'s 
1039: are random hyperplanes, one can 
1040: rewrite the previous recurrence as 
1041: 
1042: \begin{eqnarray*}\label{bayes:3}
1043: p_{i+1}=\Pr[A \subseteq P_{1}\cup \ldots \cup P_{\frac{if(n)}{d}}\cup 
1044: N_{1}\cup \ldots \cup 
1045: N_{\frac{ig(n)}{d}}\cup H_{i+1}\cup \ldots \cup H_{d}]  =  \\
1046:  =  \sum_{B} 
1047: \Pr[B\subseteq P_{1}\cup \ldots \cup P_{\overline{f(n)}} 
1048: \cup N_{1}\cup \ldots \cup N_{\overline{g(n)}}| \\
1049: A\setminus (P_{1}\cup \ldots \cup P_{\frac{(i-1)f(n)}{d}}\cup N_{1}\cup 
1050: \ldots \cup 
1051: N_{\frac{(i-1)g(n)}{d}}) = B] \cdot \\
1052: \cdot \Pr[A\setminus (P_{1}\cup \ldots \cup P_{\frac{(i-1)f(n)}{d}}\cup 
1053: N_{1}\cup \ldots \cup 
1054: N_{\frac{(i-1)g(n)}{d}}) = B]
1055: \end{eqnarray*} 
1056: 
1057: 
1058: Since all terms are nonnegative, one can obtain a lower bound on the 
1059: left-hand size of this 
1060: latter terms by only considering those $B\in C_{\gamma}$. 
1061: 
1062: Applying the induction hypothesis from case one for all $B\in 
1063: C_{\gamma}$ and 
1064: $n\geq 
1065: n_{i}=n_{*}(n_{1},0,1,k_{1},k_{2},\gamma,\delta,\overline{f},\overline{g})$,  we 
1066: infer that for all such $B$, 
1067: \[
1068: \Pr[B\subseteq P_{1}\cup \ldots \cup P_{\overline{f(n)}} 
1069: \cup N_{1}\cup \ldots \cup N_{\overline{g(n)}}]\geq (1-\gamma), 
1070: \]
1071: therefore 
1072: \begin{eqnarray*}
1073: p_{i+1}= \Pr[A \subseteq P_{1}\cup \ldots \cup P_{\frac{if(n)}{d}}\cup 
1074: N_{1}\cup \ldots \cup 
1075: N_{\frac{ig(n)}{d}}\cup H_{i+1}\cup \ldots \cup H_{d}]\geq \\
1076: (1-\gamma)\cdot 
1077: \sum_{B\in C_{\gamma}} Pr[A\setminus (P_{1}\cup \ldots \cup 
1078: P_{\frac{(i-1)f(n)}{d}}\cup N_{1}\cup \ldots \cup 
1079: N_{\frac{(i-1)g(n)}{d}}) = B] \\
1080: = (1-\gamma) \cdot Pr[A\setminus (P_{1}\cup \ldots \cup 
1081: P_{\frac{(i-1)f(n)}{d}}\cup N_{1}\cup \ldots \cup 
1082: N_{\frac{(i-1)g(n)}{d}})\in C_{\gamma}] \\ 
1083: \geq \frac{(1-\gamma)}{\gamma}\cdot [\Pr[A \subseteq P_{1}\cup \ldots 
1084: \cup P_{\frac{(i-1)f(n)}{d}}\cup N_{1}\cup \ldots \cup 
1085: N_{\frac{(i-1)g(n)}{d}}\cup H_{i}\cup \ldots \cup H_{d}] -\gamma] = \\
1086: \frac{(1-\gamma)}{\gamma}\cdot [p_{i}-\gamma].  
1087: \end{eqnarray*}
1088: 
1089: Since $\gamma \leq 1$, 
1090: \[
1091: p_{i+1}\geq (1-\gamma)\cdot (p_{i}-\gamma)= p_{i} - \gamma \cdot 
1092: [1+p_{i}-\gamma]\geq p_{i}-2\gamma.  
1093: \]
1094: 
1095: which is precisely equation~\ref{inductive:step} (that we wanted to 
1096: prove). 
1097: 
1098: \end{proof}
1099: \qed
1100: 
1101: \subsection{How to contradict Lemma~\ref{geometric}}
1102: 
1103: It is now easy to infer the fact that $SAT(S)$ has a sharp threshold, 
1104: by 
1105: using the previous lemma with $d_{1}= |F_{0}\cap Var|$, 
1106: $d_{2}=|F_{0}|-d_{1}$, 
1107: $k_{1}=b$, $k_{2}=a$, $\epsilon = \delta_{0}$, $\Delta = \delta_{0}/2$ 
1108: and a refutation of the geometric lemma similar to the 
1109: one for 3-SAT.  
1110: 
1111: The conclusion of the Geometric Lemma (similar to the one for $k$-SAT) 
1112: is that, by adding any number $f(n)$ of copies of $x_{1}\OR \ldots \OR 
1113: x_{a}$ 
1114: and $g(n)$ copies of $\overline{x_{1}}\OR \ldots \OR x_{b}$ suffices to 
1115: boost the unsatisfiability probability by a constant. 
1116: 
1117: However, Lemma 3.1 from \cite{achlioptas:friedgut:kcol} asserts that 
1118: adding up to $o(\sqrt(n))$ random clauses is not enough to boost the 
1119: unsatisfiability 
1120: probability by more than $o(1)$. 
1121: 
1122: Because of the nature of the random model, adding $H(n)=o(\sqrt(n))$ 
1123: random clauses insures that w.h.p. we have $\Theta(H(n))$ copies 
1124: of each type of clause, as long as $H(n)$ grows faster than some power 
1125: of $n$. Taking the number of such copies to be the functions 
1126: $f(n)$, $g(n)$ contradicts the consequence of the Geometric Lemma. 
1127: 
1128: 
1129: Note that we do {\em not} make use of all the details of the random 
1130: model (such as the precise number of copies of each clause in random 
1131: instances at $p=p(n)$), but only that: 
1132: \begin{itemize}
1133: \item the expected number of copies of both $C_{1}$ and $C_{2}$ in a 
1134: random formula at $p=p(n)$ is unbounded. This is enough to make the 
1135: analog 
1136: of Lemma 3.1 from \cite{achlioptas:friedgut:kcol} work.  
1137: \item  the clauses are independent. 
1138: \end{itemize}  
1139: 
1140: \subsection{Putting it all together} \label{all:together}
1141: 
1142: In the previous section we have proved a geometric lemma that is used 
1143: to prove that the above-defined set $SAT(S)$ has a sharp threshold. 
1144: 
1145: Consider now a set ${\cal C}$ of constraints that satisfies the 
1146: condition (3) 
1147: of the theorem. Since $SAT({\cal C})$ is not trivially satisfied by the 
1148: ``all zero'' (all ones) assignment, there exist constraints $C_{1}$, 
1149: $C_{2}$ in ${\cal C}$ and $a,b \geq 1$ such that $C_{1}\models x_{1}\OR 
1150: \ldots x_{a}$, 
1151: $C_{2}\models \overline{x_{1}}\OR \ldots \OR \overline{x_{b}}$. In fact 
1152: $a,b \geq 2$, otherwise some constraint in ${\cal C}$ would strongly 
1153: depend on one variable. 
1154: 
1155: Just as in the base case, condition (i) in the Friedgut-Bourgain result 
1156: is eliminated by the result of Creignou and Daud\'{e}, and  the formula 
1157: $F_{0}$ in condition 
1158: (ii) can be assumed to consist of a conjunction of unit clauses.
1159: Reflecting this fact, the geometric lemma needed for the
1160: general case has the same hypothesis as 
1161: the one of Lemma~\ref{geometric}: the set $A$ can be covered by a union 
1162: of random hyperplanes of codimension 1. 
1163: However, the covering desired in the conclusion no longer consists 
1164: of hyperplanes, but of (general) sets of points in the hyperplane, 
1165: corresponding to sets of assignments forbidden by a certain constraint 
1166: $C$. 
1167: 
1168: The critical observation (easiest to make in the case when $C_{1}\neq 
1169: C_{2}$ )
1170: is that {\bf Lemma~\ref{geometric} implies the geometric result we need 
1171: for this case}: 
1172: since $C_{1}\models x_{1}\OR \ldots x_{a}$, the ``forbidden set'' 
1173: associated to  $C_{1}$ contains the ``forbidden set'' associated to 
1174: constraint 
1175: $x_{1}\OR \ldots x_{a}$ in Lemma~\ref{geometric}, the hyperplane 
1176: $x_{1}= \ldots = x_{a}=0$. A similar result holds for $C_{2}$ 
1177: and the positive hyperplanes. 
1178: 
1179: Thus each ``covering set'' in a conclusion of the (general case of 
1180: the ) geometric lemma contains a corresponding ``covering set'' from 
1181: the conclusion of Lemma~\ref{geometric}. In other words {\bf the 
1182: geometric lemma for $SAT({\cal C})$ follows from the geometric lemma for the 
1183: base case by monotonicity}.  
1184: 
1185: All is left to show is that the analog of Lemma 3.1 from
1186: \cite{achlioptas:friedgut:kcol} also 
1187: works in this general case. We have previously observed that this 
1188: amounts 
1189: to showing that the expected number of copies of $C_{1}$, $C_{2}$ in 
1190: a random formula is unbounded. But Proposition 3.5 in 
1191: \cite{creignou:daude:sat2002} (slightly generalized to probability distributions ${\cal 
1192: P}$ that are not uniform) and the details of the random model imply 
1193: that in fact this number is linear. 
1194: 
1195: A simple modification of this argument holds when $C_{1}=C_{2}$. To see 
1196: this, note that in the proof of the fact that 
1197: Lemma~\ref{geometric} holds for {\em some} small enough (but unbounded) 
1198: $f(n),g(n)$ is enough to obtain a contradiction. 
1199:  
1200: The expected number of copies of $C_{1}$ in a random formula is linear, 
1201: denote it by $h(n)$. Dividing the set of such copies into two (random) 
1202: sets of cardinality $h(n)/2$ yields 
1203: infinitely many random copies of $C_{1}$ 
1204: used to imply  $x_{1}\OR \ldots \OR x_{a}$ in the previous argument
1205: and infinitely many  copies used to imply   
1206: $\overline{x_{1}}\OR \ldots \OR \overline{x_{b}}$. We then apply the 
1207: same strategy as in the first case. 
1208: 
1209: To summarize: the proof follows from the corresponding 
1210: argument for the base case by monotonicity. It 
1211: critically uses the fact that we are {\em not} in cases (i) or (ii) of 
1212: the Theorem, since it is only under these conditions when the first 
1213: alternative in the 
1214: Friedgut-Bourgain argument can be eliminated. 
1215: 
1216: 
1217: \end{enumerate} 
1218: \end{proof}
1219: \qed
1220: 
1221: \section{3-SAT has a first-order phase transition}
1222: 
1223: \begin{theorem}\label{3sat:first-order}
1224: $k$-SAT, $k\geq 3$ has a first-order phase transition. In other words
1225: there exists  
1226: $\eta > 0 $ such that  for every sequence $p = p(n)$
1227:  we have 
1228: \begin{equation}\label{jump} 
1229: \lim_{n\goesto \infty} \Pr_{p=p(n)}[\Phi \in SAT] = 0 \implies
1230: \lim_{n\goesto \infty}\Pr_{p=p(n)}[ \frac{|Spine(\Phi)|}{n}\geq \eta]= 
1231: 1. 
1232: \end{equation}  
1233: \end{theorem}
1234: 
1235: \begin{proof}
1236: 
1237: We start by giving a simple sufficient condition for a literal to
1238: belong to the spine of the formula: 
1239: 
1240: \begin{claim}\label{spine:unsat}
1241: Let $\Phi$ be a minimally unsatisfiable formula, and let $x$ be a
1242: literal that appears in $\Phi$. Then $x\in Spine(\Phi)$. 
1243: \end{claim}
1244: \begin{proof}
1245: Let $C$ be a clause that contains $x$. By the minimal unsatisfiability
1246: of $\Phi$, $\Phi \setminus \{C\} \in SAT$. On the other hand $\Phi 
1247: \setminus \{C\}
1248: \AND \{x\} \in \overline{SAT}$, otherwise $\Phi$ would also be
1249: satisfiable. 
1250: Thus $x \in Spine(\Phi \setminus \{C\})$. 
1251: \end{proof}
1252: \qedbox
1253: 
1254: Thus, to show that 3-SAT has a first-order phase transition it is 
1255: enough to show that a random unsatisfiable formula contains w.h.p. 
1256: a minimally unsatisfiable subformula containing a linear number of 
1257: literals. A way to accomplish this is by using the two ingredients of 
1258: the Chv\'{a}tal-Szemer\'{e}di proof \cite{chvatal:szemeredi:resolution} 
1259: that random 3-SAT has exponential
1260: resolution size w.h.p. They are explicitly stated to make the 
1261: argument self-contained:   
1262: 
1263: \begin{claim} 
1264: There exists a constant $\delta>0$ such that for every constant $c>c_{3 
1265: SAT}$ with high 
1266: probability (as $n\goesto \infty$) a random formula $\Phi$ with $n$ 
1267: variables and $cn$ 
1268: clauses has no minimally unsatisfiable subformula of size less than 
1269: $\delta \cdot n$.  
1270: \end{claim}
1271: 
1272: \begin{claim}
1273: There exists $\eta >0$ so that w.h.p. for every $c>c_{3-SAT}$ all 
1274: subformulas 
1275: of a random formula $\Phi$ having between $(\delta/2)\cdot n$ and 
1276: $\delta\cdot n$
1277: clauses contain at least $\eta\cdot n$ (pure) literals (corresponding 
1278: to different variables).  
1279: \end{claim}
1280: 
1281: The argument is now transparent: if $\Phi$ is unsatisfiable then
1282: w.h.p. a minimally unsatisfiable subformula $\Xi$ of $\Phi$ has 
1283: size at least $\delta n$. By the second claim, applied to an arbitrary 
1284: subformula of $\Xi$ of size $(3\delta n)/4$, we infer that w.h.p. $\Xi$ 
1285: contains at least many $\eta\cdot n$ different variables.  
1286: 
1287: \end{proof}
1288: 
1289: 
1290: 
1291: \section{First-order phase transitions and resolution complexity 
1292: of  random generalized satisfiability problems} 
1293: 
1294: In this section we extend the previous result (and the connection 
1295: between first-order phase transitions and resolution complexity) to 
1296: other 
1297: classes of satisfiability problems. Interestingly, we find that a 
1298: condition 
1299: Molloy investigated in \cite{molloy-stoc2002} is a sufficient condition 
1300: for the 
1301: existence of a first-order phase transition. 
1302: 
1303: It turns out that there are differences between the case of random 
1304: $k$-SAT and the general case that force us to employ an 
1305: alternative definition of the spine. The most obvious one is that 
1306: formula (~\ref{spine:initial}) involves 
1307: negations of variables, whereas Molloy's model does not. But it has 
1308: more serious problems: consider for example {\em $k$-uniform hypergraph 
1309: 2-coloring}
1310: specified as $SAT(\{C_{0}\})$, where $C_{0}(x_{1}, \ldots, x_{k})$ has 
1311: the interpretation ``not all of $x_{1}, \ldots x_{k}$ are equal''. 
1312: Because of the built-in symmetry to permuting colors 0 and 1,  the 
1313: spine of {\em any} instance is empty (under 
1314: definition~\ref{spine:initial}). 
1315: Similar phenomena have appeared before (and forced a different 
1316: definition of the backbone/spine) 
1317: in  {\em k-coloring \cite{frozen:development}} (symmetry = permutation 
1318: of colors) or {\em graph partition} \cite{graph-partition:transition} 
1319: (symmetry = permutation of sides). 
1320: There are other ways (to be detailed in the journal version of the 
1321: paper) in which the original definition of the 
1322: spine behaves differently in the general case than in that of random 
1323: $k$-SAT. Our solution is to define the concept of spine of a 
1324: random instance of a satisfiability problem $SAT(\cal P)$ in a slightly 
1325: different way. The definition  is consistent with those in 
1326: \cite{frozen:development}, \cite{graph-partition:transition}. 
1327: 
1328: \begin{definition}$
1329: Spine(\Phi) = \{ x\in Var | (\exists) \Xi \subseteq \Phi, \Xi \in SAT, 
1330: (\exists) C\in {\cal C}, x\in C,\mbox{ such that } 
1331: \Xi \AND C  \in \overline{SAT}\}$. 
1332: 
1333: \end{definition} 
1334: 
1335: It is easy to see that, for $k$-CNF formulas whose (original) spine 
1336: contains at 
1337: least three literals a variable $x$ is in the (new version of the) 
1338: spine if and only if either $x$ or $\overline{x}$ 
1339: were present in the old version. In particular the new definition does 
1340: not change the order of the 
1341: phase transition of random $k$-SAT. Moreover the proof of 
1342: Claim~\ref{spine:unsat} carries over to the general case.  
1343: The {\em resolution complexity} of an instance $\Phi$ of $SAT({\cal 
1344: P})$ is defined as the resolution complexity of the formula obtained by 
1345: converting each constraint of $\Phi$ to CNF-form. 
1346: 
1347: 
1348: \begin{definition} Let ${\cal P}$ be such that $SAT({\cal P})$ has a 
1349: sharp 
1350: threshold.
1351: Problem $SAT({\cal P})$ has a {\em first-order phase transition} if
1352: there exists  
1353: $\eta > 0 \mbox{ such that  for every sequence }  p = p(n)$
1354:  we have 
1355: \begin{equation}\label{jump:general} 
1356: \lim_{n\goesto \infty} \Pr_{p=p(n)}[\Phi \in SAT] = 0 \implies
1357: \lim_{n\goesto \infty}\Pr_{p=p(n)}[ \frac{|Spine(\Phi)|}{n}\geq \eta]= 
1358: 1. 
1359: \end{equation}  
1360: If, on the other hand, for every $epsilon >0$ there exists $p^{\epsilon}(n)$ 
1361: with 
1362: \begin{equation}\label{jump:continuous} 
1363: \lim_{n\goesto \infty} \Pr_{p=p^{\epsilon}(n)}[\Phi \in SAT] = 0 \mbox{ and }
1364: \lim_{n\goesto \infty}\Pr_{p=p(n)}[ \frac{|Spine(\Phi)|}{n}\geq \epsilon]= 
1365: 0 
1366: \end{equation} 
1367: we say that $SAT({\cal P})$ has a {\em second-order phase 
1368: transition}\footnote{strictly speaking the order of the 
1369: phase transition is {\em at least two}.}.   
1370: \end{definition} 
1371: 
1372: A first observation is that a second-order phase transition 
1373: has computational implications: 
1374: 
1375: \begin{theorem}\label{second:order} 
1376: Let ${\cal P}$ be such $SAT({\cal P})$ has a second-order phase 
1377: transition. Then for every constant
1378: $c>\overline{lim}_{n\goesto \infty} c_{SAT({\cal P})}(n)$, and {\em 
1379: every $\alpha>0$}, random formulas of constraint density $c$ 
1380: have w.h.p. resolution complexity $O(2^{k\cdot \alpha \cdot n})$.  
1381: \end{theorem} 
1382: 
1383: \begin{proof} 
1384: 
1385: By the analog of Claim~\ref{spine:unsat} for the general case, if 
1386: $SAT({\cal P})$ has a second-order phase transition) 
1387: then for every $c>c_{SAT({\cal P})}$ and for every $\alpha>0$, 
1388: minimally unsatisfiable subformulas of 
1389: a random formula $\Phi$ with constraint density $c$ have w.h.p. size at 
1390: most $\alpha \cdot n$.  
1391: Consider 
1392: the backtrack tree of the natural DPLL algorithm (that tries to 
1393: satisfies clauses one at a time) 
1394: on such a minimally unsatisfiable subformula $F$. By the usual 
1395: correspondence between DPLL trees and resolution 
1396: complexity (e.g. \cite{beame:dp}, pp. 1) it yields a resolution proof 
1397: of the unsatisfiability of $\Phi$
1398: having size at most $2^{k\cdot \alpha \cdot n+1}$.   
1399: 
1400: \end{proof}
1401: \qed
1402: 
1403: \begin{definition} 
1404: For a formula $F$ define $
1405: c^{*}(F)= \max\{ \frac{|Constraints(G)|}{|Var(G)|}: \emptyset \neq G 
1406: \subseteq F\}$. 
1407: \end{definition} 
1408: 
1409: The next result gives a sufficient condition for a generalized 
1410: satisfiability problem 
1411: to have a first-order phase transition. 
1412: 
1413: \begin{theorem} \label{sufficient:first-order} 
1414: Let $C$ be a set of constraints such that $SAT({\cal C})$ has a sharp 
1415: threshold. If 
1416: there exists $\epsilon > 0$ such that for every minimally unsatisfiable 
1417: formula $F$ it holds 
1418: that 
1419: \[
1420: c^{*}(F) > \frac{1+\epsilon}{k-1}
1421: \] 
1422: then for every ${\cal P}$ with $supp({\cal P})=C$  
1423: 
1424: $SAT({\cal P})$ has a first-order phase transition.  
1425: 
1426: \end{theorem} 
1427: \begin{proof} 
1428: 
1429: We first recall the following concept from \cite{chvatal:szemeredi:resolution}: 
1430: 
1431: \begin{definition} 
1432: Let $x,y>0$. A $k$-uniform hypergraph with $n$ vertices is {\em ($x$,$y$)-sparse} if every set of $s\leq xn$ vertices contains at most $ys$ edges.  
1433: \end{definition} 
1434: 
1435: We also recall Lemma 1 from the same paper.  
1436: \begin{lemma}\label{sparsity:hypergraph}  
1437: Let $k,c>0$ and $y$ be such that $(k-1)y>1$. Then w.h.p. the random $k$-uniform hypegraph with $n$ vertices and $cn$ edges is $(x,y)$-sparse, where 
1438: \begin{eqnarray*}
1439: \epsilon = y-1/(k-1), \\
1440: x = (\frac{1}{2e}(\frac{y}{ce})^{y})^{1/\epsilon}, \\
1441: \end{eqnarray*} 
1442: \end{lemma} 
1443: 
1444: The critical observation is that the existence of a minimally 
1445: unsatisfiable formula of size $xn$ and with $c^{*}(F) > \frac{1+\epsilon}{k-1}$ 
1446: implies that the 
1447: $k$-uniform hypergraph associated to the given formula is {\em  not}  
1448: $(x,y)$-sparse, for $y= 
1449: \frac{\epsilon}{k-1}$. 
1450: 
1451: But, according to Lemma~\ref{sparsity:hypergraph}, w.h.p. a random $k$-uniform hypergraph with $cn$ edges is 
1452: $(x_{0},y)$ 
1453: sparse, for $x_{0}=(\frac{1}{2e}(\frac{y}{ce})^{y})^{1/\epsilon}$ (a 
1454: dirrect application of Lemma 1 in their paper). We infer that any formula 
1455: with less than $x_{0}\cdot n /K$ 
1456: constraints is satisfiable, therefore the same is true for any formula 
1457: with $x_{0}\cdot n/K$ clauses picked up from the clausal representation 
1458: of constraints in $\Phi$.  
1459: 
1460: The second condition (expansion of the formula hypergraph) can be 
1461: proved similarly. 
1462: 
1463: \end{proof} 
1464: \qedbox
1465: 
1466: One can give an explicitly defined class of satisfiability 
1467: problems for which the previous result applies: 
1468: 
1469: \begin{theorem}\label{implicates:first-order}
1470: Let ${\cal P}$ be such that $SAT({\cal P})$ has a 
1471: sharp 
1472: threshold. If  {\em no} clause $C\in {\cal C}=supp({\cal P})$ has an 
1473: implicate of 
1474: length 
1475: at most 2 then 
1476: \begin{enumerate}
1477: \item for every minimally unsatisfiable formula $F$
1478: \[
1479: c^{*}(F)\geq \frac{2}{2k-3}. 
1480: \]
1481: Therefore $SAT({\cal P})$ satisfies the conditions of the previous 
1482: theorem, i.e. it has a first-order 
1483: phase transition.  
1484: \item Moreover $SAT({\cal P})$ also has $2^{\Omega(n)}$ 
1485: resolution complexity\footnote{this result subsumes some of the recent results 
1486: in \cite{mitchell:cp02}}.    
1487: \end{enumerate} 
1488: \end{theorem} 
1489: 
1490: \begin{proof}
1491: 
1492: \begin{enumerate}
1493: \item 
1494: For any real $r \geq 1$, formula $F$ and set of clauses $G\subseteq F$, 
1495:  define the {\em $r$-deficiency of $G$}, $\delta_{r}(G)= 
1496: r|Clauses(G)|-|Vars(G)|$. 
1497: 
1498: Also define 
1499: \begin{equation} \label{max}
1500: \delta^{*}_{r}(F)= \max\{\delta_{r}(G): \emptyset \neq G \subseteq F\}
1501: \end{equation} 
1502: 
1503: We claim that for any minimally unsatisfiable $F$, 
1504: $\delta^{*}_{2k-3}(F)\geq 0$. Indeed, 
1505: assume  
1506: this was not true. Then there exists such $F$ such that: 
1507: \begin{equation}\label{deff}  
1508: \delta_{2k-3}(G)\leq 
1509: -1\mbox{ for all }\emptyset \neq G\subseteq F. 
1510: \end{equation} 
1511: \begin{proposition} \label{1-transversal}
1512: Let $F$ be a formula for which condition~\ref{deff} holds. Then there 
1513: exists 
1514: an ordering $C_{1}, \ldots, C_{|F|}$ of constraints in $F$ 
1515: such that each constraint $C_{i}$ contains 
1516: at least $k-2$ variables that appear in {\em no} $C_{j}$, $j<i$. 
1517: \end{proposition} 
1518: 
1519: \begin{proof}
1520: Denote by $v_{i}$ the number of variables that appear in {\em exactly} $i$ constraints of $F$. We have 
1521: \[
1522: \sum_{i\geq 1} i\cdot v_{i} = k\cdot |Constraints(F)|.
1523: \]
1524: 
1525: therefore $2|Var(F)|-v_{1}\leq k\cdot |Constraints(F)|$. This can be 
1526: rewritten as $v_{1}\geq 2|Var(F)|-k|Constraints(F)|> |Constraints(F)|\cdot 
1527: (2k-3 - k)= (k-3)\cdot |Constraints(F)|$ (we have used the upper bound on $c^{*}(F)$. Therefore there exists at least one constraint in $F$  with at least $k-2$ variables that are free in $F$. We set $C_{|F|}=C$ and 
1528: apply this argument recursively to $F\setminus C$.  
1529: \end{proof} 
1530: \qed
1531: 
1532: Call the $k-2$ new variables of $C_{i}$ {\em free in $C_{i}$}. Call the 
1533: other 
1534: two variables {\em bound in $C_{i}$}. Let us show now that $F$ cannot 
1535: be 
1536: minimally unsatisfiable. Construct a satisfying assignment for $F$ 
1537: incrementally: Consider constraint $C_{j}$. At most two of the 
1538: variables in 
1539: $C_{j}$ are bound for $C_{j}$. Since $C$ has no implicates of size at 
1540: most two, 
1541: one can set the remaining variables in a way that satisfies $C_{j}$. 
1542: This yields a satisfying 
1543: assignment for $F$, a contradiction with our assumption that $F$ was 
1544: minimally unsatisfiable. 
1545: 
1546: Therefore $\delta^{*}_{2k-3}(F)\geq 0$, a 
1547: statement equivalent to our conclusion.  
1548: 
1549: \item To prove the resolution complexity lower bound we use the size-width 
1550: connection for resolution complexity obtained in 
1551: \cite{ben-sasson:resolution:width}: 
1552: we prove that there exists $\eta >0$ such that w.h.p. random instances 
1553: of $SAT({\cal P})$ having constraint density $c$ have resolution 
1554: width at least $\eta \cdot n$. 
1555: 
1556: We use the same strategy as in \cite{ben-sasson:resolution:width} 
1557: \begin{enumerate}
1558: \item prove that w.h.p. minimally unsatisfiable subformulas are ``large''. 
1559: \item prove that any clause implied by a satisfiable formula of ``intermediate'' size will contain ``many'' literals.   
1560: \end{enumerate} 
1561: 
1562: Indeed, define for a unsatisfiable formula $\Phi$ and (possibly empty) clause 
1563: $C$
1564: \[
1565: \mu(C)=\min\{|\Xi|: \Xi\subseteq  \Phi, \Xi \models C\}. 
1566: \]
1567: 
1568: \begin{claim}
1569: There exists $\eta_{1}>0$ such that for any $c>0$, w.h.p. 
1570: $\mu(\Box)\geq \eta_{1}\cdot n$ (where  $\Phi$ is a random instance 
1571: of $SAT({\cal P})$ having constraint density $c$). 
1572: \end{claim} 
1573: 
1574: \begin{proof}
1575: In the proof of Theorem~\ref{sufficient:first-order} we have shown that there 
1576: exists $\eta_{0}>0$ such that w.h.p. any unsatisfiable subformula of a given 
1577: formula has at least $\eta_{0} \cdot n$ constraints. Therefore {\em any} 
1578: formula made of {\em clauses} in the CNF-representation of constraints in 
1579: $\Phi$, and which has less than $\eta_{0} \cdot n$ clauses is satisfiable, 
1580: and the claim follows, by taking $\eta_{1} = \eta_{0}$. 
1581: \end{proof} 
1582: \qed 
1583: 
1584: The only (slightly) nontrivial step of the proof, which critically uses the 
1585: fact that constraints in ${\cal P}$ do not have implicates of length at most 
1586: two, is to prove that clause implicates of subformulas of ``medium'' size 
1587: have ``many'' variables. Formally: 
1588: 
1589: \begin{claim}\label{expansion} 
1590: There exists $d>0$ and $\eta_{2}>0$ such that w.h.p., for every clause 
1591: $C$ such that $d/2\cdot n <\mu(C)<=dn$, $|C|\geq \eta_{2}\cdot n$.   
1592: \end{claim} 
1593: 
1594: \begin{proof} 
1595: Take $0<\epsilon$. It is easy to see that if $c^{*}(F)<\frac{2}{2k-3+\epsilon}$ then w.h.p. for every subformula $G$ of $F$, at least $\frac{\epsilon}{3} \cdot |Constraints(G)|$ have at least $k-2$ private variables: Indeed, since 
1596: $c^{*}(G)<\frac{2}{2k-3+\epsilon}$, by a reasoning similar to the one we made 
1597: previously $v_{1}(G)\geq (k-3+\epsilon)|Constraints(G)|$. Since constraints in $G$ have arity $k$, at least $\epsilon/3 \cdot |Constraints(G)|$ have at 
1598: least $k-2$ ``private'' variables. 
1599: 
1600: Choose $y=\frac{2}{2k-3+\epsilon}$ in Lemma~\ref{sparsity:hypergraph} for $\epsilon>0$ a small 
1601: enough constant. Since the problem has a sharp threshold in the region where 
1602: the number of clauses is linear, 
1603: \[
1604: d=\inf\{x(y,c): c>= c_{SAT({\cal P})}\} >0.  
1605: \]
1606: 
1607: W.h.p. all subformulas of $\Phi$ having size less than  
1608: $d/k \cdot n$ have a formula hypergraph that is $(x,y)$-sparse, 
1609: therefore fall under the scope of the previous argument. 
1610: 
1611: Let $\Xi$ be a subformula of $\Phi$, having minimal size, such that 
1612: $\Xi \models C$. We claim: 
1613: 
1614: \begin{claim}
1615: For every clause $P$ of $\Xi$ with $k-2$ ``private'' variables, (i.e. one that does not appear in any other clause), at least one of these ``private'' variables appears in $C$. 
1616: \end{claim} 
1617: 
1618: Indeed, supppose there exists a clause $D$ of $\Xi$ such that none of its 
1619: private variables appears in $C$. 
1620: 
1621: Because of the minimality of $\Xi$ there exists an assignment $F$ that satisfies $\Xi \setminus \{D\}$ but does not satisfy $D$ or $C$. Since $D$ has no implicates 
1622: of size two, there exists an assignment $G$, that differs from $F$ only on 
1623: the private variables of $D$, that satisfies $\Xi$. But since $C$ does not 
1624: contain any of the private variables of $D$, $F$ coincides with $G$ on variables in $C$. The conclusion is that $G$ does not satisfy $C$, which contradicts 
1625: the fact that $\Xi\models C$. 
1626: 
1627: The proof of Claim~\ref{expansion} (and of item 2. of Theorem~\ref{implicates:first-order}) follows: since for any clause $K$ of one of the original constraints $\mu(K)=1$, since $\mu(\Box)>\eta_{1}\cdot n$ and since w.l.o.g. $0<d<\eta_{1}$ (otherwise replace $d$ with the smaller value) 
1628: there exists a clause $C$ such that 
1629: \begin{equation}\label{intermediate}
1630: \mu(C)\in [d/2k \cdot n, d/k \cdot n].
1631: \end{equation}
1632: 
1633: Indeed, 
1634:  let $C^{\prime}$ be a clause in the resolution refutation of $\Phi$ minimal with 
1635: the property that $\mu(C^{\prime})> dn$. Then at least one clause $C$, of the 
1636: two involved in deriving $C^{\prime}$ satisfies equation~\ref{intermediate}.  
1637: 
1638: By the previous claim it 
1639: $C$ contains at least one ``private'' variable from each clause of $\Xi$. Therefore $|C|\geq \eta_{2}\cdot n$, with $\eta_{2}=d/2k\cdot \epsilon$. 
1640:  
1641: \end{proof}
1642: 
1643: \end{enumerate} 
1644:    
1645: \end{proof} 
1646: \qedbox
1647: 
1648: It is instructive to note that the condition in the theorem is violated 
1649: (as expected) by random 2-SAT, as well as by random 1-in-$k$ SAT: the 
1650: formula $C(x_{1}, x_{2}, \ldots, x_{k-1}, x_{k})\AND 
1651: C(\overline{x_{k}}, 
1652: x_{k+1},  \ldots , x_{2k-2}, x_{1})\AND C(\overline{x_{1}}, x_{2k-1}, 
1653: \ldots, x_{3k-3}, \overline{x_{k}})
1654: \AND C(x_{k}, x_{3k-2}, \ldots, x_{4k-4}, x_{1})$ (where $C$ is the 
1655: constraint ``1-in-$k$'') is minimally unsatisfiable, but 
1656: has clause/variable ratio $1/(k-1)$ and implicates 
1657: $\overline{x_{1}} \OR \overline{x_{k}}$ and $x_{1}\OR x_{k}$. 
1658: 
1659: 
1660: It would be tempting to speculate that whenever both $x\OR y$ and 
1661: $\overline{x}\OR \overline{y}$ are implicates of clauses in  ${\cal C}$ 
1662: then 
1663: $SAT({\cal P})$ has a second-order phase transitions for every 
1664: distribution ${\cal P}$ with $supp({\cal P})= {\cal C}$. That is, 
1665: however, {\em 
1666: not} true, at least 
1667: for some distributions ${\cal P}$. Consider the random $(2+p)$-SAT 
1668: model of Monasson et al. 
1669: \cite{2+p:nature}. In this model $p$ is a fixed real in $[0,1]$. A 
1670: random instances of $(2+p)$-SAT with $n$ 
1671: variables and $c\cdot n$ clauses is obtained by choosing $pcn$ random 
1672: clauses of length 3 and $(1-p)cn$ random clauses of length 2. 
1673: It was shown in \cite{2+p:nature} (using the nonrigorous replica 
1674: method) that
1675: \begin{enumerate}
1676: \item $(2+p)$-SAT has a second-order phase transition for $0\leq p \leq 
1677: p_{0}\sim 0.413...$. 
1678: \item the transition becomes first-order for $p > p_{0}$. 
1679: \item when the transition changes from second-order to first-order the 
1680: complexity 
1681: of a certain DPLL algorithm changes from polynomial to exponential. 
1682: \end{enumerate}
1683: 
1684: 
1685: 
1686: Several rigorous results have complemented these findings. Achlioptas 
1687: et al. \cite{2+psat:ralcom97} 
1688: have shown that for $0\leq p \leq 0.4$ the phase transition in 
1689: $(2+p)$-SAT only 
1690: depends on the ``2-SAT part''. One can perhaps use the techniques 
1691: of \cite{scaling:window:2sat} to confirm statement (i). 
1692: 
1693: 
1694: 
1695: It is easily seen that 
1696: $(2+p)$-SAT can be represented in Molloy's framework. 
1697: On the other hand Achlioptas, Beame and Molloy 
1698: \cite{achlioptas:beame:molloy} have shown that for 
1699: those $p$ for 
1700: which $(2+p)$-SAT does {\em not} behave like 2-SAT the resolution 
1701: complexity of 
1702: the problem is exponential. Using Theorem~\ref{second:order} and 
1703: and results in \cite{achlioptas:beame:molloy} we get: 
1704: 
1705: \begin{theorem}\label{2+p-sat:first-order}
1706: Let $p\in [0,1]$ be s.t. there exist $\epsilon >0$ and 
1707: $c<\frac{(1-\epsilon)}{1-p}$ s.t.  
1708: random instances of $(2+p)$-SAT with $n$ variables and $c\cdot n$ 
1709: clauses are w.h.p. unsatisfiable. Then $(2+p)$-SAT 
1710: has a first-order phase transition. 
1711: \end{theorem} 
1712: 
1713: 
1714: It would be interesting to obtain a complete characterization of the 
1715: order of the phase 
1716: transition in an arbitrary problem $SAT({\cal P})$. Such a 
1717: characterization, however, requires substantial advances: Exactly locating the 
1718: ``tricritical point'' $p_{0}$ in random $2+p$-SAT (or merely deciding 
1719: whether it is equal or not to 0.4) is an open problem. A complete 
1720: characterization would yield a solution to this problem as a byproduct. 
1721: 
1722: On the other hand Theorem~\ref{2+p-sat:first-order} suggests an 
1723: interesting conjecture: whenever the location of the phase transition  
1724: is {\em not} determined by the implicates of size at most two in the 
1725: given formula, the phase transition in $SAT({\cal P})$ is first-order. 
1726: Perhaps techniques in \cite{achlioptas:beame:molloy} can help settle 
1727: this question.   
1728: 
1729: 
1730: 
1731: \section{Discussion} 
1732: 
1733: We have shown that the existence of a first-order phase transition 
1734: in a random satisfiability problem is often correlated with a 
1735: $2^{\Omega(n)}$ peak in the complexity of resolution/DPLL algorithms at 
1736: the 
1737: transition point. 
1738: 
1739: As for the extent of the connection it is easy to see that it does not 
1740: extend to a substantially larger class of algorithms: consider random 
1741: $k$-XOR-SAT, the problem of testing the satisfiability of random 
1742: systems of linear equations of size $k$ over ${\bf Z}_{2}$.  
1743: $k$-XOR-SAT is a version of XOR-SAT, one of the polynomial time 
1744: cases of satisfiability from Schaefer's Dichotomy Theorem 
1745: \cite{schaefer-dich}. 
1746: Indeed, 
1747: it is easily solved by Gaussian elimination. But Ricci-Tersenghi et al. 
1748: \cite{zecchina:kxorsat} have presented 
1749: a non-rigorous argument using the replica method that supports 
1750: the existence of a first-order phase transition, and we can show 
1751: this formally (as a direct consequence of 
1752: Theorem~\ref{implicates:first-order}):   
1753: 
1754: \begin{proposition}
1755: Random $k$-XOR-SAT, $k\geq 3$, has a first-order phase transition. 
1756: \end{proposition} 
1757: 
1758: To sum up: {\bf the intuitive argument states that a first-order phase 
1759: transition correlates with a $2^{\Omega(n)}$ lower bound of the 
1760: complexity 
1761: of DPLL algorithms at the transition. This is true in many cases, and 
1762: the underlying reason is that the two phenomena (the jump in the 
1763: order parameter and the resolution complexity lower bound) have 
1764: common causes. However, at least for satisfiability problems, 
1765: this connection does not extend substantially beyond the class of 
1766: DPLL algorithms.} 
1767: 
1768: \section*{Acknowledgments} 
1769: 
1770: I thank Madhav Marathe, Anil Kumar and Cris Moore for useful comments. 
1771: In particular Cris made the observation that led to the realization 
1772: that 
1773: my previous results implied Theorem~\ref{second:order}. 
1774: 
1775: This work has been supported by the Department of Energy under contract 
1776: W-705-ENG-36.
1777: 
1778: %\bibliography{/u1/gistrate/bib/bibtheory}
1779: \bibliography{bibtheory}
1780: 
1781: 
1782: 
1783: \end{document}
1784: 
1785: 
1786: