0504:cond-mat0504070/prl9.tex

1: %\documentclass[a4paper,10pt,superscriptaddress,showpacs]{revtex4}

2:

3: \documentclass[aps, prl, twocolumn, superscriptaddress, showpacs]{revtex4}

4:

5: \usepackage{amsfonts,amssymb,amsmath,amsbsy}

6: \usepackage{epsfig}

7:

8: \newcommand{\ud}{\,\mathrm{d}}

9: \newcommand{\e}{\mathrm{e}}

10:

11: \newtheorem{theo}{Theorem}

12: \newtheorem{lem}{Lemma}

13: \newtheorem{conj}{Conjecture}

14: \newtheorem{fact}{Fact}

15: \newtheorem{claim}{Claim}

16: \newtheorem{nul}{}

17: \newtheorem{defi}{Definition}

18: \newtheorem{corr}{Corrolary}

19:

20: \vglue .7cm

21: \begin{document}

22: \title{Clustering of solutions in the random satisfiability problem.}

23:

24: % Evidence of clustering in spin glasses and combinatorial problems

25:

26: \author{M. M\'ezard}

27: \affiliation{Laboratoire de Physique Th\'eorique et Mod\`eles Statistiques,

28: b\^atiment 100, Universit\'e Paris-Sud, F--91405 Orsay, France.}

29: \author{T. Mora}

30: \affiliation{Laboratoire de Physique Th\'eorique et Mod\`eles Statistiques,

31: b\^atiment 100, Universit\'e Paris-Sud, F--91405 Orsay, France.}

32: \author{R. Zecchina}

33: \affiliation{Abdus Salam International Center for Theoretical Physics,

34: Strada Costiera 11, 34100 Trieste, Italy}

35:

36:

37: \date{\today}

38:

39: \begin{abstract}

40:

41: Using elementary rigorous methods we prove the existence of a clustered phase

42: in the random $K$-SAT problem, for $K\geq 8$. In this phase the solutions are grouped into

43: clusters which are far away from each other. The results are in agreement with

44: previous predictions of the cavity method and give a rigorous confirmation to

45: one of its main building blocks. It can be generalized to other systems of

46: both physical and computational interest.

47:

48:

49: \end{abstract}

50: %\pacs{75.10.Nr, 75.40.-s, 75.40.Mg}

51: \pacs{02.50.-r, 75.10.Nr, 89.70.+c, 05.20.-y}

52:

53: \maketitle

54:

55: Constraint satisfaction problems (CSPs) provide one of the main building blocks

56: for complex systems studied in computer science, information theory and

57: statistical physics, and may even turn out to be important in the

58: statistical studies of biological networks. Typically, they involve a large

59: number of discrete variables, each one taking a finite number of values, and a

60: set of constraints: each constraint involves a few variables, and forbids some

61: of their joint assignments. A simple example is the $q$-coloring of a graph,

62: where one should assign to each vertex of the graph a color in

63: $\{1,\dots,q\}$, in such a way that two vertices related by an edge have

64: different colors. In the case $q=2$, this is nothing but the zero temperature

65: limit of an antiferromagnetic problem, which is known to display a spin-glass

66: behaviour when the graph is frustrated and disordered. CSPs also appear naturally in the studies

67: of structural glasses \cite{sellittoetal} and rigidity percolation \cite{barreetal}.

68:

69: Given an instance of a CSP, one wants to know whether there exists a solution,

70: that is an assignment of the variables which satisfies all the constraints

71: (e.g. a proper coloring). When it exists the instance is called SAT, and one

72: wants to find a solution. Most of the interesting CSPs are NP-complete: in the

73: worst case the number of operations needed to decide whether an instance is

74: SAT or not is expected to grow exponentially with the number of variables. But

75: recent years have seen an upsurge of interest in the theory of typical-case

76: complexity, where one tries to identify random ensembles of CSPs which are hard

77: to solve, and the reason for this difficulty. Random ensembles of CSPs are

78: also of great theoretical and practical importance in communication theory:

79: some of the best error correcting codes (the so-called low density parity

80: check codes) are based on such constructions \cite{gallagher,mckaybook}.

81:

82: The archetypical example of CSP is Satisfiability (SAT). This is a core

83: problem in computational complexity: it is the first one to have been shown

84: NP-complete \cite{cook}, and since then thousands of problems have been shown

85: to be computationally equivalent to it. Yet it is not so easy to find

86: difficult instances. The main ensemble which has been used for this goal is

87: the random $K$-satisfiability ($K$-SAT) ensemble. The variables are $N$ binary

88: variables ---\,Ising spins\,--- $\vec\sigma=\{\sigma_i\}\in\{-1,1\}^{N}$. The

89: constraints are called $K$-clauses. Each of them involves $K$ distinct spin

90: variables, randomly chosen with uniform distribution, and it forbids one

91: configuration of these spins, randomly chosen among the $2^K$ possible ones. A

92: set of $M$ clauses defines the problem. This corresponds to generating a

93: random logical formula in conjunctive normal form, which is a very generic

94: problem appearing in logic. $K$-SAT can also be written as the problem of

95: minimizing a spin-glass-like energy function which counts the number of

96: violated clauses and in this respect random $K$-SAT is seen as a prototypical

97: diluted spin-glass \cite{Monasson-Zecchina}. Here we shall keep to the most

98: interesting case $K\ge 3$ (for $K=2$ the problem is polynomial).

99:

100: In the recent years random $K$-SAT has attracted much interest in computer

101: science and in statistical physics

102: \cite{AI,TCS,Selman-Kirkpatrick,Nature}. The interesting limit is

103:  the thermodynamic limit $N \to\infty,\  M \to \infty$

104: at fixed clause density $\alpha=M/N$. Its most striking feature is

105: certainly its sharp threshold.

106:  It is strongly believed that there exists a phase transition for this

107: problem:  Numerical and heuristic analytical

108: arguments are in support of the so-called

109:  \emph{Satisfiability Threshold Conjecture}:\\

110: {\it  There exists $\alpha_c(K)$ such that with high probability:\\

111: -- if $\alpha<\alpha_c(K)$, a random instance is satisfiable ; \\

112: -- if $\alpha>\alpha_c(K)$, a random instance  is unsatisfiable. \\}

113: In all this paper, `with high probability' (w.h.p.) means with a probability

114: going to one in the $N \to \infty$ limit. Although this conjecture remains

115: unproven, Friedgut has come close to it by establishing

116:  the existence of a non-uniform sharp threshold

117: \cite{Friedgut}. A lot of efforts have been devoted to understanding this

118: phase transition. This is interesting both from the physics point of view,

119: but also from the computer science one, because the random instances with $\alpha$

120: close to $\alpha_c$ are the hardest to solve. The most important rigorous

121: results so far are bounds for the threshold $\alpha_c(K)$. The best upper

122: bounds  were derived using first moment methods

123: \cite{kirousis,dubois}. Lower bounds can be found by

124: analyzing some algorithms which find SAT assignments \cite{franco,frieze-suen}, but recently a

125: new method, based on second moment methods, has found better and

126: algorithm-independent lower bounds \cite{achliomoore,achlioperes}. Using these bounds, it

127: was shown that $\alpha_c(K)$ scales as $2^K \ln(2)$ when $K\to\infty$.

128:

129:

130: On the other hand, the cavity method, which is a powerful tool from the

131: statistical physics of disordered systems \cite{Cavity}, is claimed to be able

132: to compute the exact value of the threshold \cite{MPZ,MZ,MMZ-RSA}, giving for

133: instance $\alpha_c(3)\simeq 4.2667...$ It is a non-rigorous method but the

134: self-consistency of its results have been checked by a `stability analysis'

135: \cite{montanariricci,monparric,MMZ-RSA}, and it also leads to the development

136: of a new algorithmic strategy, `survey propagation', which can solve very

137: large instances at clause densities which are very close to the threshold

138: (e.g. $N=10^6$ and $\alpha=4.25$).

139:

140: The main hypothesis on which the cavity analysis of random $K$-satisfiability

141: relies is the existence, in a region of clause density $[\alpha_d,\alpha_c]$

142: close to the threshold, of an intermediate phase called the `hard-SAT' phase.

143: In this phase the set $\cal S$ of

144: solutions (a subset of the vertices in the$N$-dimensional hypercube)

145:  is supposed to split into many disconnected

146: \emph{clusters} ${\cal S} = {\cal S}_1 \cup

147: {\cal S}_2\cup\dots$. If one considers two solutions $X,Y$ in the same

148: cluster ${\cal S}_j$, it is possible to walk from $X$ to $Y$ (staying in $\cal

149: S$) by flipping at each step a finite numbers of spins. If on the other hand

150: $X$ and $Y$ are in different clusters, in order to walk from $X$ to $Y$

151: (staying in $\cal S$), at least one step will involve an extensive number

152: (\emph{i.e.} $\propto N$) of spin flips.

153: This clustered  phase is held responsible for entrapping many local search

154: algorithms into non-optimal metastable states \cite{montanarisemerjian}. This

155: phenomenon is not exclusive to random $K$-SAT. It is also predicted to appear in

156: many other hard satisfiability and optimization

157: problems such as Coloring \cite{mulet,braunstein} or the Multi-Index Matching

158: Problem \cite{martinmezardrivoire}, and corresponds to a `one step replica

159: symmetry breaking' (1RSB) phase in the language of statistical physics. It is

160: also a crucial limiting feature for decoding algorithms in some  error correcting

161: codes~\cite{montanari}.

162: So far, the only CSP

163: for which the existence of the clustering phase has been established rigorously  is the simple

164: polynomial problem

165: of random XOR-SAT \cite{XORSAT-MRZ,XORSAT-CDMM}. In other cases it is an hypothesis,

166: the self-consistency of which is checked by the cavity method.

167:

168: In this paper we provide rigorous arguments which show the existence

169: of the clustering phenomenon in random $K$-SAT, for large

170: enough $K$, in some region of $\alpha$ included in the interval

171: $[\alpha_d(K),\alpha_c(K)]$ predicted by the statistical physics

172: analysis. Our result is not able to confirm all the details of this

173: analysis but it provides strong evidence in favour of its validity.

174:

175: Given an instance $F$ of random $K$-satisfiability, we define a SAT-$x$-pair as  a pair of

176: assignments $(\vec \sigma,\vec \tau)\in\{-1,1\}^{2N}$, which both

177: satisfy $F$, and which are at a Hamming distance

178: $ d_{\sigma\tau}\equiv\sum_{i=1}^{N}(1-\sigma_i\tau_i)/2$ specified by $x$ as follows:

179: \begin{equation}

180: d_{\sigma\tau}\in [Nx-\epsilon(N),Nx+\epsilon(N)]

181: %\nonumber

182: \end{equation}

183: Here $x$ is the normalized distance between the two

184: configurations, which we keep fixed as $N$ and $d$ go to infinity.

185: The  resolution $\epsilon(N)$ must be such that $\lim_{N \to \infty} \epsilon(N)/N=0$,

186: but its precise form is

187:  unimportant for our large $N$ analysis. One can choose for instance $\epsilon(N)=\sqrt{N}$.

188:

189: We call \emph{$x$-satisfiable} a formula for which such a pair of

190: solutions exists.

191: Our study mimicks the usual steps which are taken in rigorous studies of

192: $K$-SAT, but taking pairs of assignments at a fixed distance

193: instead of single assignments.

194:

195: We first formulate the \emph{$x$-Satisfiability Threshold Conjecture}:\\

196: {\it For all $K\geq 2$ and for all $x$, $0<x<1$, there exists an

197: $\alpha_c(K,x)$ such that w.h.p.:\\ -- if $\alpha<\alpha_c(K,x)$, a

198: random $K$-CNF is $x$-satisfiable; \\ -- if $\alpha>\alpha_c(K,x)$, a

199: random $K$-CNF is $x$-unsatisfiable,\\} which generalizes the usual

200: satisfiability threshold conjecture (obtained for $x=0$).  We shall find

201: explicitly below two functions, $\alpha_{LB}(K,x)$ and $\alpha_{UB}(K,x)$

202: which give lower and upper bounds for $\alpha$ for $x$-satisfiability at

203: a given value of $K$.

204: Numerical computations of

205: these bounds show that $\alpha(K,x)$ is \emph{non monotonous} as a function of

206: $x$ for $K\geq 8$, as illustrated in Fig.\ref{alpha8}.  This in turn

207: shows that, for $K$ large enough and in some well chosen interval of

208: $\alpha$ below the satisfiability threshold, SAT-$x$-pairs exist for $x$

209: close to $0$ ($\vec\sigma$ and $\vec\tau$ in the same cluster) and $x$ close to $.5$ ($\vec\sigma$ and $\vec\tau$ in different clusters), but there is an intermediate $x$

210: region where they do not exist. Fig.\ref{alpha8} shows an explicit example of this scenario for a particular value of $\alpha$.

211:

212:

213:

214: \begin{figure}

215: \begin{center}

216: \epsfig{file=curveK8.eps,width=8cm,angle=-0} \caption{\label{alpha8} \small

217: Lower and Upper Bounds for the $x$-satisfiability threshold $\alpha_c(K=8,x)$.

218: The upper curve is obtained by the first moment method. Above this curve there

219: exists no SAT-$x$-pair, w.h.p.. The lower curve is obtained by the second

220: moment method. Below this curve there exists a SAT-$x$-pair w.h.p.. For values

221: of $\alpha$ lying between $164.735$ and $170.657$, these bounds guarantee the

222: existence of a clustering phenomenon.

223: The horizontal line gives an example of this phenomenon for $\alpha=166.1$.

224:  We exhibit

225: the successive phases as one varies $x$: $x$-SAT regions

226: are represented by a thick solid line, $x$-UNSAT regions by a wavy line, and

227: ``don't know'' regions by a dotted line. The $x$-SAT region near $x=0$

228: corresponds to intra-cluster pairs, whereas the $x$-SAT region around $x=.5$

229: corresponds to inter-cluster pairs. In this example, the intermediate $x$-UNSAT region

230: around $x \sim .13$ shows

231:  the existence of a ``gap'' between clusters. We recall that the

232: best refined lower and upper bounds for the satisfiability threshold

233: $\alpha_c(K=8)$ from \cite{dubois,achlioperes} are respectively $173.253$ and

234: $176.596$. The cavity prediction is $176.543$ \cite{MMZ-RSA}. }

235: \end{center}

236:

237: \end{figure}

238:

239: In what follows we first establish a rigorous and explicit upper bound

240: using a simple first moment method.  Subsequently, we provide a

241: (numerical) lower bound using a second moment method

242: \cite{achliomoore,achlioperes}.  Both results are based on elementary

243: probabilistic techniques which could be generalized to other physical

244: systems or random combinatorial problems.

245:

246: {\it Upper bound: the first moment method.}  We use the fact that,

247: when $Z$ is a non-negative random variable:

248: \begin{equation}

249: \mathbf{P}(Z \geq 1)\leq \mathbf{E}(Z)\ .

250: \label{lemfirst}

251: %\nonumber

252: \end{equation}

253: Given a formula $F$, we take $Z(F)$ to be the number of pairs of solutions at fixed distance (with

254: resolution $\epsilon(N)$):

255: \begin{equation}\label{Z1}

256: Z(F)=\sum_{\vec \sigma,\vec \tau}\delta\left(\frac{d_{\sigma\tau}}{N}\simeq x \right)

257: \;\delta(\vec\sigma, \vec\tau \in S(F)) \; \ ,

258: \end{equation}

259: where

260: $S(F)$ is the set of solutions to $F$.  Throughout this

261: paper $\delta(A)$ is an indicator function, equal to $1$ if the

262: statement $A$ is true, and to $0$ otherwise.  Since $Z(F)\geq 1$

263: is equivalent to ``$F$ is $x$-satisfiable'', (\ref{Z1}) gives an upper

264: bound for the probability of $x$-satisfiability.

265: The expected value of the double sum over the  choice of a random $F$ is:

266: \begin{equation}

267: \mathbf{E}(Z(F))=2^N\binom{N}{Nx}{\mathbf{E}\left[\delta(\vec\sigma,\vec\tau\in S(c))\right]}^{M}.

268: %\nonumber

269: \end{equation}

270: We have used  $\delta(\vec\sigma,\vec\tau\in S(F))=\prod_c

271: \delta(\vec\sigma,\vec\tau\in S(c))$, where $c$ denotes  the

272:  clauses, and the fact that  clauses are drawn independently.

273: The expectation

274: $\mathbf{E}\left[\delta(\vec\sigma,\vec\tau\in S(c))\right]$ is equal to: $1-2^{1-K}+2^{-K}(1-x)^{K}$ (there

275: are only two realizations of the clause among $2^K$ that do not

276: satisfy $c$ unless the two configurations overlap exactly on the

277: domain of $c$).

278:

279: In the thermodynamic limit, $\ln \mathbf{E}(Z(F))/N \to \Phi_1(x,\alpha) $,

280: where:

281: \begin{eqnarray}

282: \Phi_1(x,\alpha)

283: =\ln 2+H_2(x) +\alpha\ln\left[1-2^{-K}(2-(1-x)^{K})\right],

284: \nonumber

285: \end{eqnarray}

286: where $H_2(x)=-x\ln x-(1-x)\ln(1-x)$ is the two-state entropy

287: function. This gives the upper bound:

288: \begin{equation}

289: \alpha_{UB}(K,x)=-\frac{\ln 2+H_2(x)}{\ln(1-2^{1-K}+2^{-K}(1-x)^{K})}.

290: \end{equation}

291:

292: {\it Lower bound: the second moment method. } We use the

293: fact that, when $Z$ is a non-negative random variable:

294: \begin{equation}

295: \mathbf{P}(Z>0)\geq \frac{\mathbf{E}(Z)^2}{\mathbf{E}(Z^2)}.

296: %\nonumber

297: \end{equation}

298: However using this formula with $Z$ equal to the number of solutions

299: fails, and one must instead use a weighted sum \cite{achliomoore}.

300: We follow the strategy recently developed in \cite{achlioperes}, which we generalize

301:  to SAT-$x$-pairs by taking:

302: \begin{equation}\label{Zw}

303: Z(F)=\sum_{\vec\sigma,\vec\tau}\delta\left(\frac{d_{\sigma\tau}}{N}\simeq x\right)

304:  \prod_c W(\vec\sigma,\vec\tau,c)\ .

305: %\nonumber

306: \end{equation}

307: $W(\vec\sigma,\vec\tau,c)$ is a weight associated with the clause $c$, given the couple $(\vec\sigma,\vec \tau)$, and is defined as follows: Suppose that $c$ is satisfied by $n_\sigma$ among the $K$ $\vec\sigma$-variables involved in $c$, and by $n_\tau$ among the $K$ $\vec\tau$-variables. Call $n_0$ the number of common values between the $\vec\sigma$- and $\vec\tau$-variables involved in $c$. Then define:

308: \begin{equation}

309: W(\vec\sigma,\vec\tau,c)=\left \{\begin{array}{ll}\lambda^{n_\sigma+n_\tau}\nu^{n_0}&\textrm{if }n_\sigma>0\textrm{ and }n_\tau>0,\\

310: 0& \textrm{otherwise.}\end{array}\right.

311: \label{Wdef}

312: \end{equation}

313: Note that with this definition of $Z$, the choice $\lambda=1,\nu=1$ simply yields the number of solutions (\ref{Z1}).

314:

315: Let us now compute the first two moments of $Z$ (\cite{MMZ_inprep}):

316: \begin{equation}

317: \mathbf{E}(Z)=2^N\binom{N}{Nx}\left[f_1^{(\lambda,\nu)}(x)\right]^M \ ,

318: \end{equation}

319: where $f_1^{(\lambda,\nu)}(x)=\mathbf{E} (W(\vec\sigma,\vec\tau,c))$ can be calculated by simple combinatorics (via multinomial sums). To compute $\mathbf{E}(Z^2)$, we sum over  four spin configurations $\vec\sigma,\vec\tau,\vec\sigma',\vec\tau'$. Symmetry allows to fix $\sigma_i=1$. Let $Na(t,s,t')$ be the number of sites $i$ such that $\tau_i=t$, $\sigma_i'=s'$ and $\tau_i'=t'$ (where $t,s,t' \in\{\pm 1\}$). It turns out that the term of the sum depends only on these 8 numbers $a(\pm 1,\pm 1,\pm 1)$. We collect them into a vector $\mathbf{a}$ and get:

320: \begin{equation}\label{E2}

321: \mathbf{E}(Z^2)=2^N \int_V d \mathbf{a} \; \frac{N!}{\prod_{t,s',t'}(Na(t,s',t'))!}

322: \left[f_2^{(\lambda,\nu)}(\mathbf{a})\right]^M \ ,

323: \end{equation}

324: where $f_2^{(\lambda,\nu)}(\mathbf{a})=\mathbf{E}(W(\vec\sigma,\vec\tau,c)W(\vec\sigma',\vec\tau',c))$ can

325: be calculated by simple combinatorics

326: in the same way as $f_1$. The integration set $V$ is a 5-dimensional simplex taking into account the normalization $\sum_{t,s',t'} a(t,s',t')=1$ and the two constraints: $d_{\sigma\tau}/N\simeq x$, $d_{\sigma'\tau'}/N\simeq x$.

327:

328: A saddle point evaluation of eq.(\ref{E2}) gives, for $N \to \infty$:

329: \begin{equation}\label{cond}

330: \frac{\mathbf{E}(Z)^2}{\mathbf{E}(Z^2)}\geq C_0\exp(-N\max_{\mathbf{a}\in V}\Phi_2(\mathbf{a})),

331: %\nonumber

332: \end{equation}

333: where $C_0$ is a constant depending on $K$ and $x$, and:

334: \begin{equation}\label{defphi}

335: \Phi(\mathbf{a})=H_8(\mathbf{a})-\ln 2-2H_2(x)+\alpha  \ln f_2^{(\lambda,\nu)}(\mathbf{a})-2 \alpha \ln f_1^{(\lambda,\nu)}(x),

336: %\nonumber

337: \end{equation}

338: with $H_8(\mathbf{a})=-\sum_{t,s',t'} a(t,s',t') \ln a(t,s',t')$. In general $\max_{\mathbf{a}\in V}\Phi(\mathbf{a})$ is non-negative

339: and one must choose appropriate weights

340:  $W(\vec \sigma,\vec \tau,c)$ in such a way that

341: $\max_{\mathbf{a}\in V}\Phi(\mathbf{a})= 0$.

342: We notice that at the particular point $\mathbf{a}^*$ where $(\vec\sigma,\vec\tau)$ is uncorrelated with $(\vec\sigma',\vec\tau')$, we have $\Phi(\mathbf{a}^*)=0$.

343: We  fix the  parameters $\lambda$ and $\mu$ defining the weights (\ref{Wdef})

344: in such a way that $\mathbf{a}^*$

345: be a local maximum of $\Phi$. This gives two algebraic equations in $\lambda$ and $\nu$

346: which  have a unique

347: solution $\lambda>0,$ $\nu>0$. Fixing

348: $\lambda$ and $\nu$ to these values, $\alpha_{LB}$ is the largest value of

349: $\alpha$ such that the local maximum at $\mathbf{a}^*$ is a \emph{global} maximum,

350: i.e. such that there exists no $\mathbf{a}\in V$ with $\Phi(\mathbf{a})>0$:

351: \begin{equation}\label{alpha2}

352: \alpha_{LB}(K,x)=\inf_{\mathbf{a}\in V}\frac{\ln

353:   2+2H_2(x)-H_8(\mathbf{a})}{\ln f_2^{(\lambda,\nu)}(\mathbf{a})-2\ln f_1^{(\lambda,\nu)}(x)},

354: \end{equation}

355:

356: We devised several numerical

357: strategies to evaluate

358: $\alpha_{LB}(K,x)$. The implementation of Powell's method starting from each

359: point of a grid of size $\mathcal{N}^5$ ($\mathcal{N}=10,15,20$) on

360: $V$ turned out to be the most efficient and reliable.  The results are

361: given by Fig.\ref{alpha8} for $K=8$, the smallest $K$ such that the

362: clustering conjecture is confirmed. We found a clustering phenomenon

363: for all the values of $K \ge 8$ that we checked, and in fact the

364: relative difference $[\alpha_{UB}(K,x)-\alpha_{LB}(K,x)]/ \alpha_{LB}(K,x)$

365: seems to go to zero at large $K$.

366:

367: We have shown a simple probabilistic argument which shows rigorously the existence of

368: a clustered `hard-SAT' phase. The prediction from the cavity method

369: is in fact a weaker statement. It can be stated

370:  in terms of the overlap distribution function $P(x)$, which is the

371: probability, when two SAT-assignments are taken randomly (with uniform

372: distribution), that their distance is given by $x$. The cavity method finds that

373: this distribution has a support concentrated on two values: a large value

374: $x_1$, close to one, gives the characteristic `radius' of a cluster, a smaller

375: value $x_0$ gives the characteristic distance between clusters. This

376:  does not imply that there exists no pair of solution for values of

377: $x$ distinct from $x_0,x_1$: it just means that such pairs are exponentially

378: less numerous than the typical ones. Our rigorous result shows that in fact

379: there exists a true gap in $x$, with no SAT-$x$-pairs, at least for $K \ge 8$.

380: More sophisticated moment computations might allow to get some results for

381: smaller values of $K$. Still the conceptual simplicity of our computation

382: makes it a useful tool for proving similar phenomena in other systems of

383: physical or computational interests, like for instance the graph-coloring

384: (antiferromagnetic Potts) problem.

385:

386: This work has been supported in part by the EC through the network

387: MTR 2002-00319 `STIPCO' and the FP6 IST consortium `EVERGROW'.

388:

389: \begin{thebibliography}{99}

390:

391: \bibitem{sellittoetal} M. Sellitto, G. Biroli and C. Toninelli, Europhys. Lett. {\bf 69}, 496 (2005).

392: \bibitem{barreetal} J. Barr\'e et al., cond-mat/0408385

393:

394:

395: \bibitem{gallagher} Robert G. Gallagher. {\it Information Theory and

396:   Reliable Communication.} Wiley, New York, 1968.

397:

398: \bibitem{mckaybook} David J.C. MacKay. {\it Information Theory,

399:   Inference \& Learning Algorithms.} Cambridge University Press,

400:   Cambridge, 2002.

401:

402: \bibitem{cook} Stephen Cook. The complexity of theorem proving

403:   procedures. In {\it Proceedings of the Third Annual ACM Symposium on

404:   Theory of Computing}, pages 151-158, 1971.

405:

406: \bibitem{Monasson-Zecchina} R. Monasson, R. Zecchina, Phys. Rev. E {\bf 56},

407: 1357 (1997).

408:

409: \bibitem{AI} T. Hogg, B.A. Huberman, C. Williams, C. (eds), Artificial

410: Intelligence {\bf 81} I \& II (1996).

411:

412: \bibitem{TCS} Special Issue on {\it NP-hardness and Phase

413: transitionls}, edited by O. Dubois, R. Monasson, B. Selman and

414: R. Zecchina, {Theor. Comp. Sci.} {\bf 265}, Issue: 1-2 (2001).

415:

416: \bibitem{Selman-Kirkpatrick}S. Kirkpatrick, B. Selman, {Science} {\bf

417:   264}, 1297 (1994).

418:

419: \bibitem{Nature} R. Monasson, R. Zecchina, S. Kirkpatrick, B. Selman,

420: and L. Troyanski, {Nature} {\bf 400}, 133 (1999).

421:

422: \bibitem{Friedgut} E. Friedgut, {Journal of the A.M.S.} {\bf 12}, 1017 (1999).

423:

424: \bibitem{kirousis}L. M. Kirousis, E. Kranakis, D. Krizanc. Technical report TR-96-09, School of Computer Science, Carleton University, 1996.

425:

426: \bibitem{dubois} O. Dubois, Y. Boufkhad, {J. Algorithms} {\bf

427:   24}, 395 (1997).

428:

429: \bibitem{franco} M.-T. Chao, J. Franco, {Inform. Sci.} {\bf

430:   51}(3), 289 (1990).

431:

432: \bibitem{frieze-suen} A. M. Frieze, S. Suen, {J. Algorithms} {\bf

433:   20}, 312 (1996).

434:

435: \bibitem{achliomoore} D. Achlioptas, C. Moore, Proc. Foundations of Computer Science (2002).

436:

437: \bibitem{achlioperes} D. Achlioptas, Y. Peres, {Journal of the

438:   AMS} {\bf 17} 947 (2004).

439:

440: \bibitem{Cavity}{M. M{\'e}zard, G. Parisi, G. , J. Stat. Phys. {\bf 111}

441:   (2003).}

442:

443: \bibitem{MZ} M. M\'ezard, R. Zecchina, Phys. Rev. E {\bf 66}, 056126

444: (2002).

445:

446: \bibitem{MPZ} M. M\'ezard, G. Parisi, R. Zecchina, Science {\bf 297}, 812

447: (2002).

448:

449: \bibitem{MMZ-RSA} S. Mertens, M. M\'ezard, R. Zecchina, Threshold

450: values of Random K-SAT from the cavity method, cs.CC/0309020 (2003),

451: to appear in Random Structure and Algorithms.

452:

453: \bibitem{montanariricci} A. Montanari, F. Ricci-Tersenghi, {Eur. Phys. J.} {\bf B 33}, 339 (2003).

454:

455: \bibitem{monparric} A. Montanari, G. Parisi, F. Ricci-Tersenghi, {J. Phys. A} {\bf 37}, 2073 (2004).

456:

457: \bibitem{montanarisemerjian} G. Semerjian, R. Monasson,

458: Proceedings of the SAT 2003 conference, E. Giunchiglia and A. Tacchella eds., Lecture Notes in Computer Science (Springer) 2919, 120 (2004).

459:

460: \bibitem{mulet} R. Mulet, A. Pagnani, M. Weigt, R. Zecchina, {Phys. Rev. Lett.} {\bf 89}, 268701 (2002).

461:

462: \bibitem{braunstein} A. Braunstein, R. Mulet, A. Pagnani, M. Weigt,

463:   R. Zecchina, {Phys. Rev. E} {\bf 68}, 036702 (2003).

464:

465: \bibitem{martinmezardrivoire} O. C. Martin, M. M\'ezard, O. Rivoire,

466: {Phys. Rev. Lett.} {\bf 93}, 217205 (2004).

467:

468: \bibitem{montanari}

469: A. Montanari,

470: {Eur. Phys. J.} {\bf B 23}, 121 (2001).

471:

472: \bibitem{XORSAT-MRZ} M. M\'ezard, F. Ricci-Tersenghi, R. Zecchina,

473: J. Stat. Phys. {\bf 111}, 505 (2003).

474:

475: \bibitem{XORSAT-CDMM} S. Cocco, O. Dubois, J. Mandler, R. Monasson,

476: Phys. Rev. Lett. {\bf 90}, 047205 (2003).

477:

478: \bibitem{MMZ_inprep} M. M\'ezard, T. Mora, R. Zecchina, in preparation.

479:

480:

481: \end{thebibliography}

482:

483:

484: \end{document}

485: