0305:math0305151/max.tex

1: \documentclass{article}

2:

3: \usepackage{verbatim,amsfonts,amsthm,epsfig,fullpage,amsmath,psfig,color}

4:

5: \begin{document}

6:

7:

8: \newcommand{\chek}[1]{{{#1}}}

9: \newcommand{\dima}[1]{{{#1}}}

10: %\newcommand{\dima}[1]{{\color{blue}{#1}}}

11: %\newcommand{\chek}[1]{{\color{red}{#1}}}

12:

13: \newtheorem{definition}{Definition}

14: \newtheorem{lemma}{Lemma}

15: \newtheorem{claim}{Claim}

16: \newtheorem{theorem}{Theorem}

17: \newtheorem{corollary}{Corollary}

18: \newtheorem{conjecture}{Conjecture}

19: \newtheorem{fact}{Fact}

20: \newtheorem{proposition}[lemma]{Proposition}

21:

22: \def\whp{w.h.p.}

23: \def\wupp{with uniformly positive probability}

24: \def\Whp{W.h.p.}

25: \def\eg{e.g.\ }

26: \def\ie{i.e.\ }

27:

28: \newcommand{\p}{p}

29: \newcommand{\q}{q}

30: \renewcommand{\log}{\ln}

31:

32: \newcommand{\viol}{\not\models}

33: \newcommand{\ra}{\rightarrow}

34:

35: \newcommand{\e}{\epsilon}

36: \newcommand{\dd}{\delta}

37:

38: \newcommand{\rc}{{c}}

39: \newcommand{\fc}{{\omega}}

40:

41: \newcommand{\Z}{Z}

42:

43: \newcommand{\up}{*}

44: \newcommand{\rr}{T}

45: \newcommand{\ttk}{t_k}

46:

47:

48: \newcommand{\amax}{\alpha_{\max}}

49: \newcommand{\gmax}{g_{\max}}

50: \def\a{\alpha}

51:

52: \def\e{\varepsilon}

53: \def\FF{F}

54:

55: \def\ex{{\mathbf E}}

56: \def\E{{\mathbf E}}

57: \newcommand{\one}{{\mathbf{1}}}

58: \newcommand{\PS}{\mathbf{P}}

59: \newcommand{\Pa}{{\mathbf P_{\sigma}}}

60: \newcommand{\Ea}{{\mathbf E_{\sigma}}}

61:

62: \title{On the Maximum Satisfiability of Random Formulas}

63:

64: \author{

65:     Dimitris Achlioptas\thanks{Part of this work was done while visiting UC Berkeley.} \\

66:     Microsoft Research, Redmond, Washington \\

67:     {\tt optas@microsoft.com}

68:     \and

69:     Assaf Naor \\

70:     Microsoft Research, Redmond, Washington \\

71:     {\tt anaor@microsoft.com}

72:     \and Yuval Peres\thanks{Research supported by NSF Grant

73: DMS-0104073 and a Miller Professorship at UC Berkeley.}\\

74: Departments of Statistics and Mathematics, University of California, Berkeley\\

75: {\tt peres@stat.berkeley.edu} }

76:

77: \date{\empty}

78:

79: \maketitle

80:

81: \begin{abstract}

82: Maximum satisfiability is a canonical NP-hard optimization problem

83: that \dima{appears empirically hard for random instances}. In

84: particular, its apparent hardness on random $k$-CNF formulas

85: \dima{of certain densities} was recently suggested by Feige as a

86: starting point for studying inapproximability. At the same time,

87: it is rapidly becoming a canonical problem for statistical

88: physics. In both of these realms, evaluating new ideas relies

89: crucially on knowing the maximum number of clauses one can

90: typically satisfy in a random $k$-CNF formula. In this paper we

91: give asymptotically tight estimates for this quantity.

92: Specifically, let us say that a $k$-CNF  is {\em $\p$-satisfiable}

93: if there exists a truth assignment satisfying $1-2^{-k}+\p 2^{-k}$

94: of all clauses (observe that every $k$-CNF is 0-satisfiable).

95: Also, let $F_k(n,m)$ denote a random \mbox{$k$-CNF} on $n$

96: variables formed by selecting uniformly and independently $m$ out

97: of all $2^k \binom{n}{k}$ possible $k$-clauses.

98:

99: Let \dima{$\tau(p) = 2^k \ln 2/({\p+(1-\p)\log (1-\p)})$}. It is

100: easy to prove that for every $k \geq 2$ and every $\p \in (0,1]$,

101: if $r \geq \tau(\p)$ then the probability that $F_k(n,rn)$ is

102: $\p$-satisfiable tends to 0 as $n \to \infty$. We prove that

103: \dima{there exists a sequence $\delta_k \to 0$ such that if $r

104: \leq (1-\delta_k) \tau(\p)$ then the probability that $F_k(n,rn)$

105: is $\p$-satisfiable tends to 1 as $n \to \infty$. The sequence

106: $\delta_k$ tends to 0 exponentially fast in $k$. Indeed, even for

107: moderate values of $k$, \eg $k=10$, our result gives very tight

108: bounds for the number of satisfiable clauses in a random $k$-CNF.

109: In particular, for $k>2$ it improves upon all previously known

110: such bounds.}

111: \end{abstract}

112:

113: \newpage

114:

115: \section{Introduction}

116: Given a Boolean CNF formula $F$, the Satisfiability problem is to

117: determine whether there exists a truth assignment that satisfies

118: $F$. When $F$ has exactly $k$ literals in each clause,

119: Satisfiability is known as $k$-SAT and is NP-complete~\cite{cook}

120: for all $k \geq 3$. A natural generalization of satisfiability is

121: determining whether there exists a truth assignment that satisfies

122: a given number of clauses in $F$. For $k$-CNF this problem is

123: known as Max $k$-SAT and is NP-complete for all $k\geq 2$

124: (see~\cite{GaJo79}).

125:

126: Optimization problems with random inputs are pervasive in

127: operations research (e.g., the travelling salesman problem and

128: variants), in statistical physics (determining ground states of

129: spin glasses) and in computer science. An interesting source of

130: Max $k$-SAT instances comes from considering $k$-CNF chosen

131: uniformly at random (see below). Historically, the motivation for

132: studying such formulas has been the desire to understand the

133: hardness of ``typical'' instances.

134:  Random $k$-CNF are

135: by now the most studied generative model for random formulas and

136: have been a very popular benchmark for testing and tuning

137: satisfiability algorithms. In fact, some of the better practical

138: ideas in use today come from insights gained by studying the

139: performance of algorithms on random

140: $k$-CNF~\cite{slm:gsat,sk:walksat,heavytail}.

141:

142: A natural starting point for considering Max $k$-SAT is the

143: observation that for every $k$-CNF formula there exists a truth

144: assignment satisfying at least $(1-2^{-k})$ of all clauses.

145: Indeed, if such a formula has $m$ clauses, the average over all

146: $2^n$ truth assignments of the number of satisfied clauses is

147: precisely $(1-2^{-k})m$. With this in mind, we will say that a

148: $k$-CNF formula is {\em $\p$-satisfiable}, where $p \in [0,1]$, if

149: there exists a truth assignment satisfying $1-2^{-k}+\p 2^{-k}$ of

150: all clauses.

151:

152: \dima{To consider random $k$-CNF formulas, let $C_k$ denote the

153: set of all $(2n)^k$ possible disjunctions of $k$ literals on some

154: canonical set of $n$ Boolean variables. To form a random $k$-CNF

155: formula $F_k(n,m)$ with $m$ clauses we select uniformly,

156: independently and with replacement $m$ clauses from $C_k$ and take

157: their conjunction\footnotemark[2].} We will say that a sequence of

158: random events ${\mathcal E}_n$ occurs {\em with high probability\/

159: } (w.h.p.) if $\lim_{n \ra \infty} \Pr[{\mathcal E}_n]=1$ and {\em

160: with uniformly positive probability\/ } if $\liminf_{n \ra \infty}

161: \Pr[{\mathcal E}_n]>0$. We emphasize that throughout the paper $k$

162: is arbitrarily large but fixed, while $n \rightarrow \infty$. For

163: every $k \geq 2$ and $\p \in (0,1]$, let

164: \begin{eqnarray*}

165: r_k(p)  & \equiv &

166:         \sup\{r : F_k(n,rn)\mbox{ is $p$-satisfiable    \whp}\} \\

167:         & \leq &

168:         \inf\{r : F_k(n,rn)\mbox{ is {\em not\/} $p$-satisfiable  \whp}\}

169:         \; \equiv \; r_k^\up(p) \enspace .

170: \end{eqnarray*}

171:

172: \footnotetext[2]{Our discussion and results hold in all common

173: models for random $k$-CNF, \eg when clause replacement is not

174: allowed and/or when each $k$-clause is formed by selecting $k$

175: distinct, non-complementary literals with/without ordering. The

176: model defined here is best suited for our calculations. We further

177: comment on its relationship to other models in the end of

178: Section~\ref{outline}.}

179:

180: One of the most intriguing aspects of random formulas is the {\em

181: Satisfiability Threshold Conjecture} which asserts that

182: $r_k(1)=r_k^\up(1)$ for every $k \geq 3$.  Much work has been done

183: to bound $r_k(1)$ and $r_k^\up(1)$. Currently, the best rigorous

184: bounds for general $k \geq 3$, from~\cite{yuval,DuBo}

185: respectively, are: $2^k \ln 2 - O(k) < r_k \le r_k^\up< 2^k \ln 2

186: - O(1)$. \dima{For $p<1$, the bounds for $r_k(\p), r_k^\up(\p)$

187: were much further apart.}

188:

189: The state of the art for general $k$ was presented in an important

190: recent paper by Coppersmith, Gamarnik, Hajiaghayi, and

191: Sorkin~\cite{sork}, where it was proved \dima{(see~\eqref{sor_or}

192: for a more precise formulation)} that there exists an absolute

193: constant $c>0$ such that for all $k$ and all $p \in (0, p_0(k)]$,

194: \begin{equation} \label{sor2}

195: \frac{c}{k} \,  \frac{2^{k+1} \ln 2}{\p^{2}} \le r_k(\p) \le

196: r_k^\up(\p) \leq \frac{2^{k+1} \ln 2}{\p^{2}(1+o(1))} \enspace .

197: \end{equation}

198:

199: % sufficiently large $r$, we have as $n \to \infty$,

200: %\begin{equation} \label{sor1}

201: %\frac{1}{\sqrt{k}} \times \sqrt{\frac{A_k}{r}} \le \Phi_k(n,rn)

202: %\le \sqrt{\frac{B_k}{r}} \enspace ,

203: %\end{equation}

204: %where $A_k/B_k \to 2/(\pi \ln 2)$. % and $A_k=\Theta(2^{-k/2})$. %(Explicitly,

205: %%$A_k=\frac{4k^2}{\pi (k+1)^2 2^k}-O(\sqrt{k/r})$ and

206: %%$B_k=\frac{(2^k-1) \ln 2}{2^{2k-1}}$.)

207: %These estimates are equivalent to the assertion that

208:

209: The upper bound in~\eqref{sor2} was proved via the first moment

210: method, while the lower bound is algorithmic. For small $k$ the

211: two are reasonably close, but the ratio between them tends to

212: infinity as $k$ grows; this naturally raises the question which

213: bound is closer to the truth. %Fortunately, for moderate and large values of

214: %$k$ where algorithms have the most difficulty, our result gives a

215: %satisfying answer.

216: Our main result \dima{resolves this question by pinpointing the

217: values of $r_k(\p)$ and $r_k^\up(\p)$ with relative error that

218: tends to zero exponentially fast in $k$.} For every $\p \in (0,1)$

219: denote

220: \begin{equation}\label{defrr}

221: \rr_k(\p)=\frac{2^k\ln 2}{\p+(1-\p)\log (1-\p)} \,,

222: \end{equation}

223: %we use the convention dictated by continuity that

224: %$0\log 0=0$

225: and let $\rr_k(1)=2^k\ln 2$ so that $\rr_k(\cdot)$ is continuous

226: on $(0,1]$.

227: \begin{theorem}\label{main}

228: There exists a  sequence $\delta_k=O(k 2^{-k/2})$, such that for

229: all $k \geq 2$ and $p \in (0,1]$,

230: \begin{equation}\label{eq:main}

231:  (1-\delta_k) \, {\rr}_k(\p) < r_k(\p)\le r_k^{\up}(\p)\le {\rr}_k(\p) \enspace .

232: \end{equation}

233: \end{theorem}

234: \dima{The upper bound in (\ref{eq:main}) follows from well-known

235: tail estimates}. Taylor expansion gives that as $p \to 0$,

236: $$

237: \rr_k(\p)=\frac{2^k\ln 2}{\p^2/2+O(\p^3)} \enspace ,

238: $$

239: %so the upper bound in Theorem~\ref{main} is consistent with (and

240: %slightly sharper than) the upper bound in~

241: so as $p \to 0$, we can sharpen \eqref{sor2} to

242: \begin{equation} \label{nsor2}

243:  (1-\delta_k) \frac{2^{k+1} \ln 2}{\p^{2}+O(p^3)} \le r_k(\p) \le

244: r_k^\up(\p) \leq \frac{2^{k+1} \ln 2}{\p^{2}+O(p^3)} \enspace .

245: \end{equation}

246:

247:

248: Our proof  of Theorem~\ref{main} actually yields an explicit lower

249: bound for $r_k(p)$ for each $k \geq 2$.

250: %%\chek{Also, one can

251: %``invert" the bounds to get upper and lower bounds on the fraction

252: %of satisfiable clauses as a function of density.} \marginpar{\tiny

253: %This is where our ``fraction satisfied" version goes.}

254: For $k =2$, \ie Max 2-SAT, the algorithm presented in~\cite{sork}

255: dominates our lower bound uniformly, \ie for every density it

256: satisfies a greater fraction of all clauses. Already for $k \geq

257: 3$, though, our methods yield a better bound, as indicated by the

258: following plots.

259:

260:

261:

262: \vspace*{0.7cm}

263:

264: \begin{minipage}{3in}

265: \begin{center}

266:         \centerline{\hbox{

267:         \psfig{figure=upper_lower_3sat.eps,height=1.5in,width=2in}

268:         }}

269:         \centerline{$k=3$}

270: \end{center}

271: \end{minipage}\    \

272: \begin{minipage}{3in}

273: \begin{center}

274:         \centerline{\hbox{

275:         \psfig{figure=upper_lower_4sat.eps,height=1.5in,width=2in}

276:         }}

277:         \centerline{$k=4$}

278: \end{center}

279: \end{minipage}

280:

281: \vspace*{0.6cm}

282:

283: \begin{minipage}{3in}

284: \begin{center}

285:         \centerline{\hbox{

286:         \psfig{figure=upper_lower_7sat.eps,height=1.5in,width=2in}

287:         }}

288:         \centerline{$k=7$}

289: \end{center}

290: \end{minipage}\    \

291: \begin{minipage}{3in}

292: \begin{center}

293:         \centerline{\hbox{

294:         \psfig{figure=upper_lower_10sat.eps,height=1.5in,width=2in}

295:         }}

296:         \centerline{$k=10$}

297: \end{center}

298: \end{minipage}

299:

300: \vspace*{-0.2cm}

301:

302:

303: \begin{center}

304: Figure 1. Upper and lower bounds for the density $r$ as a function

305: of $\q=1-p$.

306: \end{center}

307:

308:

309: Our approach in proving Theorem~\ref{main} is non-algorithmic,

310: based instead on a delicate application of the second moment

311: method to a random generating function in two variables. It is

312: notoriously difficult to obtain precise asymptotics from such

313: random multivariable generating functions; the fact that this is

314: possible for random Max $k$-SAT is technically due to the

315: surprising cancellation of four terms of equal magnitude in our

316: analysis, leaving only lower order terms. This cancellation hints

317: at the existence of some unexpected hidden structure in random Max

318: $k$-SAT; characterizing this structure combinatorially (rather

319: than just analytically) appears to us  worthy of further study.

320:

321: \subsection{Background}

322:

323: For a random formula $F_k(n,m)$, denote by $s_k(n,m)$ the random

324: variable equal to the maximum (over all truth assignments

325: $\sigma$) of the number of clauses satisfied by $\sigma$. Perhaps

326: the first rigorous study of random Max $k$-SAT appeared in the

327: work of Frieze, Broder and Upfal~\cite{BFU} where  it was shown

328: that $s_k(n,m)$ is sharply concentrated

329: around its mean. Specifically, %using standard concentration

330: %inequalities, \eg Azuma's inequality, it follows that

331: \begin{theorem}[{\cite{BFU}}]\label{conc}

332: $\displaystyle{ \Pr\biggl[\bigl|s_k(n,m) - \ex[s_k(n,m)]\bigr| > t

333: \biggr] < 2 \exp(-2t^2/m)}$.

334: \end{theorem}

335:

336:

337: The following corollary allows us to infer high probability

338: results from positive probability results.

339: \begin{corollary}\label{booster}

340: If $F_k(n,rn)$ is $\p_0$-satisfiable \wupp, then for every

341: constant $\p<\p_0$, $F_k(n,rn)$ is $\p$-satisfiable \whp\

342: \end{corollary}

343: \begin{proof}

344: Let $S \equiv (1-2^{-k}+\p_0 2^{-k})rn$. Since $F_k(n,rn)$ is

345: $\p_0$-satisfiable \wupp, $\ex[s_k(n,rn)] > S - n^{2/3}$. For,

346: otherwise, Theorem~\ref{conc} would imply that the probability of

347: $\p_0$-satisfiability is exponentially small. By the same token,

348: $\Pr[s_k(n,rn) < S- 2  n^{2/3}] =o(1)$, implying the claim.

349: %

350: % Moreover,

351: %observe that for any constants $r < r_0$ we have $\ex[s_k(n,rn)] =

352: %\ex[s_k(n,r_0 n)] -\Omega(n)$. To see this assume, without loss of

353: %generality, that $F_k(n,r_0 n)$ is generated by first generating

354: %$F_k(n, r n)$ and then adding $(r_0-r)n$ random clauses. Clearly,

355: %any fixed assignment \whp\ will satisfy $\Omega(n)$ of the

356: %$(r_0-r)n$ random clauses added in the second phase; and this is

357: %true, in particular, for any assignment that satisfies a maximum

358: %number of clauses in $F_k(n,r n)$. Thus, by invoking

359: %Theorem~\ref{conc} again, we conclude that if $F_k(n,rn)$ is

360: %$\p$-satisfiable \whp, for otherwise the probability that

361: %

362: \end{proof}

363:

364: Thus it will suffice to find, for every $\p \in (0,1]$, a value

365: $r=r(\p)$ such that $F_k(n,rn)$ is $\p$-satisfiable with uniformly

366: positive probability and rely on Corollary~\ref{booster} to get a

367: high probability result.

368:

369: \dima{Regarding the mean in Theorem~\ref{conc},} in view of the a

370: priori bound $s_k(n,m) \ge (1-2^{-k})m$, it is natural to consider

371: $\Phi_k(n,m)=\E[s_k(n,m)]-(1-2^{-k})m$, measuring how much the

372: optimum truth assignment does better than the a priori bound in

373: expectation (over random $k$-CNF). In~\cite{sork} it was shown

374: that for all $k$, for sufficiently large $r$, as $n \to \infty$

375: one has in $F_k(n,rn)$

376: \begin{equation} \label{sor1}

377: \frac{2}{k+1}\sqrt{\frac{k}{\pi 2^k}}\times {\sqrt{r}}-O(1) \le

378: \frac{\Phi_k(n,rn)}{n} \le \sqrt{\frac{(2^k-1) \ln 2}{2^{2k-1}}}

379: \times \sqrt{r}  \enspace .

380: \end{equation}

381: This is equivalent to the assertion that for $\p$ sufficiently

382: small,

383: \begin{equation}\label{sor_or}

384: \frac{k2^{k+2}}{\pi (k+1)^2}\times p^{-2} -O(\p^{-1}) \le r_k(\p)

385: \le r_k^\up(\p) \le 2(2^k-1) \ln 2

386:  \times \p^{-2} \enspace ,

387: \end{equation}

388: \dima{which is a more precise formulation of~\eqref{sor2}.}

389:

390: Since for $k=2$, the threshold for satisfiability is known, namely

391: $r_2(1)=r_2^{\up}(1)$=1, in~\cite{sork} very fine results were

392: derived for $s_k(n,rn)$ when $r \approx 1$. In particular, when

393: $r=1+\varepsilon$ one has $\E[s_2(n,m)] =

394: (1+\varepsilon-O(\varepsilon^3))n$, while for large $r>1$ the

395: bound in~\eqref{sor1} can be improved to

396: \[

397: \frac{\sqrt{8}-1}{3\sqrt{\pi}} \times {\sqrt{r}} -O(1) \le

398: \frac{\Phi_2(n,rn)}{n} \le \sqrt{\frac{3 \ln 2}{8}} \times

399: {\sqrt{r}} \enspace .

400: \]

401:

402:

403:

404:

405: \dima{Another intriguing aspect of random $k$-CNF formulas is

406: their proof complexity. In a seminal paper, Chv\'{a}tal and

407: Szemer\'{e}di~\cite{ChvSz} proved that for all $k \geq 3$ and

408: $r>2^k \ln 2$ there exists $\e=\e(r)$ such that \whp\ every

409: resolution refutation of $F_k(n,rn)$ contains at least $(1+\e)^n$

410: clauses. Since then there have been a number of extensions of this

411: result~\cite{bp:resolve,BKPS} and it is widely believed that

412: random $k$-CNF are hard for much stronger proof systems than

413: resolution. Indeed, recently, Feige~\cite{feiger} showed that a

414: hypothesis asserting that proving unsatisfiability of random

415: $k$-CNF with $r \gg 2^k \ln 2$ is hard, implies a number of strong

416: inapproximability results. A closely related hypothesis is that

417: approximating Max $k$-SAT for such formulas is also hard for all

418: $k \geq 2$. Recent work by Fernandez De la Vega and

419: Karpinski~\cite{karpinski} proves that one can approximate Max

420: 3-SAT on $F_3(n,rn)$ within $9/8$ which is better than the trivial

421: $8/7$ bound.}

422:

423:

424:

425: \section{Outline}\label{outline}

426:

427: \subsection{Understanding correlation sources in MAX $k$-SAT}

428: The following easy consequence of the Cauchy-Schwarz inequality

429: underlies the second moment method.

430: \begin{lemma}\label{lemma:sec}

431: For any non-negative random variable $X$,

432: \begin{equation}\label{eq:smm}

433: \Pr[X > 0] \,\ge\, \frac{\ex[X]^2}{\ex[X^2]} \enspace .

434: \end{equation}

435: \end{lemma}

436:

437: Thus, for any fixed $\p \in (0,1]$ one can let $X$ denote the

438: number of $\p$-satisfying assignments and

439: apply~\eqref{lemma:sec}  to bound \mbox{$\Pr[X>0]$} from below.

440: Unfortunately, it turns out that for every

441: $r>0$, there exists a constant $\beta=\beta(k,r)>1$ such that $\ex[X^2]

442: > \beta^n \ex[X]^2$. As a result, this straightforward approach

443: only gives a trivial lower bound on the probability of

444: $\p$-satisfiability.

445:

446: In~\cite{yuval}, it was shown that in the case $\p=1$ a major

447: factor in the excessive correlations behind the above failure is

448: the following form of populism: leaning toward the majority vote

449: truth assignment. To see this, first observe that truth

450: assignments that satisfy more literal occurrences than average,

451: have higher probability of being satisfying. At the same time, in

452: order to satisfy many literal occurrences such assignments tend to

453: agree with each other (and the majority truth assignment) on more

454: than half the variables. As a result, the successes of such

455: assignments tend to be highly correlated, thus dominating

456: $\ex[X^2]$. In order to avoid this pitfall, we would like, as in

457: \cite{yuval}, to apply the second moment method to truth

458: assignments that satisfy, approximately, half of all literal

459: occurrences; we call such truth assignments  {\em ``balanced''}.

460: In the context of $\p$-satisfiability, however, there are new

461: obstacles \chek{to overcome before obtaining} a lower bound for

462: $r_k(p)$ that asymptotically matches the upper bound.

463: %\subsection{Weighted second moments: a transform}

464: %Recall that our plan for establishing $\p$-satisfiability for

465: %$F_k(n,m)$ is to prove that the number of balanced

466: %\footnote{I

467: %think that talking about non-populism solves a lot of semantic

468: %problems in the context of a weighting scheme. For one thing,

469: %formally, it is on such assignments that we actually focus. That

470: %said, a better name is definitely needed? ``Contrarian" sounds too

471: %negative. ``Left wing"? ;-)}  $\p$-satisfying assignments has small

472: %variance.

473: To capture the behavior of balanced truth assignments we begin by

474: defining two ``fitness'' gauges.\medskip

475:

476: Given any $k$-CNF formula ${\FF}$ on $n$ variables

477: % and a fixed value of $\p$, recall that ${\cal S}({\FF})$ is the set of

478: %$\p$-satisfying truth assignments of ${\FF}$. Given

479: and any truth assignment $\sigma \in \{0,1\}^n$ let

480: \begin{enumerate}

481: \item

482: $H= H(\sigma,{\FF})$ be the number of satisfied {literal

483: occurrences} in ${\FF}$ under $\sigma$, minus the number of

484: unsatisfied literal occurrences in ${\FF}$ under $\sigma$.

485: \item

486: $U= U(\sigma,{\FF})$ be the number of unsatisfied clauses in

487: ${\FF}$ under $\sigma$.

488: \end{enumerate}

489:

490: We would like to focus on truth assignments that are

491: balanced and $p$-satisfying, up to fluctuations

492: one would expect from a central limit theorem, i.e.,  truth assignments

493: $\sigma$ such that

494: \begin{align}

495: |H(\sigma,F)| & \leq A\sqrt{m} \label{iopu}\\

496: |U(\sigma,{\FF}) (1-\p)2^{-k} m| & \leq A\sqrt{m} \label{colopu} \enspace .

497: \end{align}

498:

499: \chek{To do this let us write} $u_0 \equiv (1-\p)2^{-k}$ and fix

500: some $\gamma,\eta < 1$. \chek{Now,} for a random $k$-CNF formula

501: $\FF$, consider the weighted sum $F$

502: \[

503:     X = X(\gamma,\eta) = \sum_{\sigma} \gamma^{H(\sigma,{\FF})} \eta^{U(\sigma,{\FF})-u_0 m}

504: \enspace .

505: \]

506: Since $\gamma,\eta<1$ we see that in $X$ \chek{the} truth

507: assignments $\sigma$ for which $H(\sigma,F)>0$ or

508: $U(\sigma,{\FF})>u_0m$ are suppressed exponentially, whereas

509: \chek{the rest} are rewarded exponentially. Decreasing

510: $\gamma,\eta \in [0,1)$ makes this phenomenon more and more acute,

511: with the limiting case $\gamma, \eta = 0$ corresponding to a 0-1

512: weighting scheme (we adopt the convention $0^0\equiv 1$). Indeed,

513: applying the second moment method to $X$ with $\eta =0$

514: corresponds to the approach of~\cite{yuval} for the random $k$-SAT

515: threshold, where only satisfying assignments receive non-zero

516: weight $\gamma^{H(\sigma,F)}$. A key step in our analysis,

517: presented in Subsection \ref{tune}, is the tuning of the

518: parameters $\gamma,\eta$ to focus on truth assignments $\sigma$

519: for which (\ref{iopu}) and (\ref{colopu}) hold. Before doing that,

520: we establish the upper bound in Theorem~\ref{main}.

521:

522: \subsection{The upper bound in Theorem \ref{main}}

523: This upper bound can be \chek{readily established by using the

524: entropic-form Chernoff bound for the Binomial}

525: (see Lemma A.10 in \cite{AS} or Lemma 3.8 in \cite{DM}),

526: but it is more

527: informative to give a self-contained argument.

528: % for the following, more precise, statement.

529: Recall the definition of $\rr_k(\cdot)$

530: from (\ref{defrr}).

531:

532: \begin{lemma}\label{lemma:upper}

533: For all $k \geq 2$ and $\p \in (0,1]$, if $\q=1-p$ then

534: \begin{equation}\label{eq:upper}

535: r_k^{\up}(\p)\le \frac{2^k \log 2}

536: {\q \log \q -(2^{k}-\q)\log\Big(\frac{2^k-1}{2^k-q}\Big)}

537: \le {\rr}_k(\p) \enspace .

538: \end{equation}

539: \end{lemma}

540: \begin{proof}

541: The right hand inequality of (\ref{eq:upper}) follows from the

542: inequality $\log t \le t-1$ applied to $t=\frac{2^k-1}{2^k-q}$,

543: so we just need to verify the  left hand inequality. To do that,

544: write $u_0=2^k \q$. Let $\eta \in (0,1)$, and

545: observe that if  $\FF$ is $\p$-satisfiable, then $U(\sigma,{\FF})

546: \le u_0 m$ for some $\sigma$, whence

547: $$

548: X(1,\eta)=  \sum_{\sigma} \eta^{U(\sigma,{\FF})-u_0 m} \ge 1 \,.

549: $$

550: From (\ref{eq:mean}) in the next section we have that

551: \begin{equation}\label{eq:up2}

552: \PS[X(1,\eta) \ge 1] \le \ex[X(1,\eta)]=2^n \eta^{-\q rn 2^{-k}}

553:   \Big(1- (1-\eta)2^{-k}\Big)^{rn} \,.

554: \end{equation}

555: Thus, the probability of $\p$-satisfiability decays exponentially

556: in $n$ if the \chek{the $n$-th root of the RHS of} \eqref{eq:up2}

557: is strictly smaller than $1$.

558: %So if  $\p$-satisfiability holds with uniformly positive probability,

559: %we must have

560: %By the definition of $r_k^*(\p)$, we must have

561: %\begin{equation}\label{eq:up3}

562: %\log 2 \ge r_k^*(\p) \Big[\q 2^{-k} \log(\eta)-\log\Big(1-

563: %(1-\eta)2^{-k}\Big)\Big] \,.

564: %\end{equation}

565: Taking $\eta=\q (2^k-1)/(2^k-\q)$

566: %\chek{minimizes that last quantity} and

567: yields the lemma.

568: \end{proof}

569:

570:

571: \subsection{Tuning parameters and truncation} \label{tune}

572:

573: When $\eta>0$, attempting to apply the

574: second moment method to $X$ we encounter two major problems.

575:

576: The first problem is that while $X>0$ implies satisfiability when

577: $\eta=0$, when $\eta >0$ having $X>0$ does not imply

578: $\p_0$-satisfiability:

579: %even when $r$ is such that most of the contribution to $\ex[X]$ comes

580: %from assignments falsifying about $u_0m$ clauses:

581: in principle, $X$ could be positive due to the

582: contribution of assignments falsifying many more clauses than $u_0

583: m$. This necessitates restricting the sum defining $X$ to truth

584: assignments falsifying at most $u_0m+O(\sqrt{m})$ clauses, \ie

585: truncating $X$.

586:

587: The second, more severe, problem is that with or without this

588: truncation, $\ex[X]^2/\ex[X^2]$ becomes exponentially small when

589: $r$ is only, roughly, half the (asymptotically optimal) lower

590: bound of Theorem~\ref{main}.  Rather counterintuitively, we will

591: be able to delay this explosion until $r$ is within $1-o(1)$ of

592: the upper bound by also removing from the sum those ``heroic"

593: truth assignments falsifying {\em fewer\/} than $u_0 m$ clauses.

594: This affords us much tighter control of pairs of assignments that

595: agree on nearly all variables, which turn out to be the dominant

596: contributors to $\ex[X^2]$ as we approach the upper bound. The

597: idea behind this sacrifice is motivated by Cramer's classical

598: ``change of measure'' technique in large deviation theory. The

599: corresponding ``adaptive weighting" scheme requires an extremely

600: sharp asymptotic analysis, involving a number of rather miraculous

601: cancellations. Due to space limitations this analysis appears

602: entirely in the Appendix.

603:

604: Specifically, for some fixed $A>0$ let

605: $${\cal S^*}= \{ \sigma \in \{0,1\}^n :

606: H(\sigma, {\FF}) \geq 0 \mbox{ \rm and } \, U(\sigma,\FF) \in [u_0

607: m ,u_0 m +A\sqrt{m}] \}  \, .

608: $$

609: Moreover, given $u_0$, let $\gamma_0,\eta_0$ be defined by

610: \begin{eqnarray}

611: 1-\eta_0  & = & (1-\gamma_0^2)(1+\gamma_0^2)^{k-1} \nonumber\\

612: \label{eq:defzero} \\

613: u_0  & = & \frac{\eta_0}{(1+\gamma_0^2)^k-(1-\eta_0)} \nonumber

614: \enspace .

615: \end{eqnarray}

616: These two equations are designed so that the main contribution in

617: the sum defining $X$ comes from truth assignments  for which

618: (\ref{iopu}) and (\ref{colopu}) holds. The connection is made in

619: equations (\ref{EH}) and (\ref{EU}) in Section \ref{doobya}.

620:

621: \chek{We define}

622: \[

623: X_*  = \sum_{\sigma \in {\cal S^*}}

624: \gamma_0^{H(\sigma,F)}\eta_0^{U(\sigma,F)-u_0 m} \enspace .

625: \]

626: Note that, by definition, when $X_*>0$ at least one truth

627: assignment must falsify at most $u_0 m + A\sqrt{m}$ clauses. Thus,

628: if for a given $\p_0$ we can prove that there exists a constant

629: $D>0$ such that $\ex[X_*^2] < D \times \ex[X_*]^2$ then, by

630: Corollary~\ref{booster}, it follows that $F_k(n,rn)$ is \whp\

631: $\p$-satisfiable for all $\p<\p_0$.

632:

633: Bounding the second moment of $X_*$ will be accomplished in the

634: following lemmata. For $\alpha \in [0,1]$, let

635: \begin{flalign}\label{eq:deff}

636: \mbox{} \!\!\!\!\! f(\alpha,\gamma,\eta) =\eta^{-2u_0}\left[

637:     \left(\a \left(\frac{\gamma^2+\gamma^{-2}}{2}\right)+1-\a

638:     \right)^k

639:     - 2 (1-\eta)\left(\frac{\a\gamma^{-2}+(1-\a)}{2}\right)^k

640:     + (1-\eta)^2 \left(\frac{\a\gamma^{-2}}{2}\right)^k\right]

641: \end{flalign}

642: and

643: \begin{equation}

644:  g_r(\alpha,\gamma,\eta) = \frac{f(\alpha,\gamma,\eta)^r}{\alpha^{\alpha}(1-\alpha)^{1-\alpha}}\enspace .

645: \end{equation}\medskip

646: In all of the following lemmata $k \geq 2$ is a fixed integer and

647: $r>0$.

648: \begin{itemize}

649: \item

650: Lemma~\ref{middle} with $\gamma=\gamma_0$ and $\eta=\eta_0$ gives

651: us $\ex[X(\gamma_0,\eta_0)]^2$ which is $\ex[X_*]^2$ but for the

652: truncation.

653: \item

654: Lemma~\ref{doobya2} asserts that for every value of $u_0$,

655: $\ex[X_*]$ is a constant fraction of $\ex[X(\gamma_0,\eta_0)]$.

656: Thus, combined with Lemma~\ref{middle}, it gives us $\ex[X_*]^2$

657: up to a constant factor (which is all we need).

658: \item

659: Lemma~\ref{ola} expresses $\ex[X_*^2]$ as a sum with $n+1$ terms,

660: the $z$-th term capturing the contribution of the $2^n

661: \binom{n}{z}$ pairs of truth assignments with overlap $z$. The

662: contribution of each such pair is then bounded by

663: $f(z/n,\gamma,\eta)^{rn}$ where $\gamma,\eta$ are allowed to {\em

664: depend on $z$}, subject only to $\gamma \geq \gamma_0$ and $\eta

665: \geq \eta_0$ respectively. In other words, Lemma~\ref{ola} allows

666: us to adapt $\gamma$ and $\eta$ to $\alpha$, which is crucial when

667: $\p <1$.

668: \item

669: Lemma~\ref{ksekola} is based on the fact that for any ``smooth"

670: choice of sequences $\gamma(z),\eta(z)$, the sum in

671: Lemma~\ref{ola} will be dominated by the contribution of the

672: $\Theta(n^{1/2})$ terms around the maximum term. Specifically, if

673: $\chi,\omega$ express our adaptive scheme for $\gamma,\eta$, then

674: we can use the Laplace method to get that the maximum of

675: $g_r(\alpha,\chi(\alpha),\omega(\alpha))$ over $\alpha \in (0,1)$,

676: characterizes the sum in Lemma~\ref{ola} up to a constant factor.

677: \end{itemize}

678:

679:

680: \begin{lemma}\label{middle}

681: For every $u_0,\gamma,\eta \in [0,1)$,

682: \[

683: \ex[X]^2 = \Bigl(2\,g_r(1/2,\gamma,\eta)\Bigr)^n \enspace .

684: \]

685: \end{lemma}

686:

687: \begin{lemma}\label{doobya2}

688: For every $u_0$, there exists $\theta = \theta(k,A)>0$ such that

689: as $n \to \infty$,

690: \[

691: \frac{\ex[X_*]}{\ex [X(\gamma_0,\eta_0)]} \to \theta \enspace .

692: \]

693: \end{lemma}

694:

695:

696: \begin{lemma}\label{ola}

697: Let $\gamma(z),\eta(z)$ be arbitrary sequences such that

698: $\gamma(z) \geq \gamma_0$ and $\eta(z) \geq \eta(0)$ for every $0

699: \leq z \leq n$. Then, for every $u_0$,

700: \[

701: \ex[X_*^2] \leq 2^n \sum_{z=0}^n \binom{n}{z}

702: f(z/n,\gamma(z),\eta(z))^{rn} \enspace .

703: \]

704: \end{lemma}

705:

706: \begin{lemma}\label{ksekola}

707: Let $\chi : [0,1] \rightarrow [\gamma_0,1)$ and $\omega : [0,1]

708: \rightarrow [\eta_0,1)$ be arbitrary piecewise-smooth functions

709: and let $g_r(\alpha) = g_r(\alpha,\chi(\alpha),\omega(\alpha))$.

710: If there exists $\amax \in (0,1)$ such that $g_r(\amax) \equiv

711: \gmax > g_r(\alpha)$ for all $\alpha \neq \amax$, and

712: $g''_r(\amax)< 0$, then there exists a constant $D =

713: D_{\chi,\omega}(k,r,u_0)>0$ such that for all sufficiently large

714: $n$

715: \[

716: \ex[X_*^2] < D \times {

717:     \Bigl(2 \,

718:             \gmax

719:     \Bigr)^n}\enspace .

720: \]

721: \end{lemma}

722:

723: Combining Lemmata~\ref{middle}--\ref{ksekola} we  see that if for

724: a given $u_0$ and $r$ there exist $\chi,\omega$ such that for all

725: $\alpha \neq 1/2$

726: \begin{equation}\label{masterp}

727: g_{r}\Bigl(1/2,\gamma_0,\eta_0\Bigr) >

728: g_{r}\Bigl(\alpha,\chi(\alpha),\omega(\alpha)\Bigr)

729: \end{equation}

730: then $\ex[X_*^2] < D \theta^{-2} \times \ex[X_*]^2$, yielding the

731: desired conclusion $\ex[X_*^2] = O(\ex[X_*]^2)$.

732: \bigskip

733:

734: Indeed, to prove  Theorem~\ref{main} we will show that for every

735: $\p \in (0,1]$ and for the stated $r = r(\p)$, there exist

736: functions $\chi,\omega$ for which~\eqref{masterp} holds. To

737: simplify the asymptotic analysis, we use the crudest possible such

738: functions, paying the price of this simplicity in the value of

739: $k_0$ in Proposition~\ref{prop:main} below. We note that by

740: choosing a more refined (and more cumbersome) adaptation of

741: $\gamma,\eta$ to $\alpha$ this value can be improved greatly.

742: Moreover, we emphasize that for any fixed value of $k$, one can

743: get a sharper lower bound (such as those reported in the

744: Introduction) by partitioning $[0,1]$ to a large number of

745: intervals and numerically finding a good value of $\gamma,\eta$

746: for each one. We discuss this point further in

747: Section~\ref{finitek}. Finally, we note that general large

748: deviations considerations imply that for every $k$ and $\p$, the

749: condition~\eqref{masterp} is sharp for our method. That is, no

750: better lower bound can be derived by considering balanced

751: assignments and, in fact, by any argument that classifies

752: assignments according to their number of satisfied literal

753: occurrences in the formula.

754: \begin{definition}\label{skolio}

755: Let $\q = 1-\p_0 = u_0 2^k$ and let

756: \begin{eqnarray}\label{eq:defrk}

757: \ttk=\frac{2^k\ln 2}{1-\q+\q\log

758: \q}\left(1-20k2^{-k\varphi(\q)}\right)\quad \mathrm{where}\quad

759: \varphi(\q)=\frac{(1-\sqrt{\q})^2}{1-\q+\q\log \q} \enspace .

760: \end{eqnarray}

761: \end{definition}

762: Theorem~\ref{main} will follow from the following

763: Proposition.

764: \begin{proposition}\label{prop:main}

765: Let

766: \begin{eqnarray}\label{eq:defE}

767: G_r(\alpha)=\left\{\begin{array}{ll} g_r(\alpha,\gamma_0,\eta_0)

768: &\mbox{ if }

769: \alpha \in \left[\frac{3\log k}{k}, 1-\frac{3\log k}{k}\right]\\

770: \\

771: g_r(\alpha, \sqrt{\gamma_0},\sqrt{\eta_0}) & \mbox{ otherwise.}

772: \end{array}\right.

773: \end{eqnarray}

774: For all $k\ge k_0$, if\/ $r\le \ttk$ then $G_r''(1/2)<0$ and

775: $G_r(1/2)>G_r(\alpha)$ for all $\alpha\neq 1/2$.

776: \end{proposition}

777:

778:

779: \bigskip

780:

781: The proof of Proposition~\ref{prop:main}, itself, will be

782: decomposed into three lemmata of increasing difficulty. The first

783: lemma holds for any $\gamma,\eta$ and reduces the proof to the

784: case $\alpha \geq 1/2$. The second lemma reflects the behavior of

785: $f$ (and thus $g_r$) around $\a = 1/2$, motivating the judicious

786: choice $\eta=\eta_0$ and $\gamma=\gamma_0$ for $G_r$. The third

787: lemma deals with $\alpha$ near 1. That case needs a lot more work

788: in order to handle the unique local maximum of $g_r$ in that

789: region. The condition $r \leq \ttk$ and the change to

790: $\gamma=\sqrt{\gamma_0}, \eta=\sqrt{\eta_0}$ aims precisely at

791: keeping the value of $g_r$ at this other local maximum smaller

792: than $g_r(1/2,\gamma_0,\eta_0)$.

793:

794:

795: \begin{lemma}%[Reduction to the interval ${ \bf[1/2,1]}$]

796: \label{lem:biggerthanhalf} For every $0< x\le \frac12$,

797: $G_r(1/2+x)>G_r(1/2-x)$.

798: \end{lemma}

799:

800: \begin{lemma}%[Reduction to large ${\bf \alpha}$]

801: \label{lem:decreasing} For all $k\ge k_0$, if\/ $r\le

802: \frac{2^k\log 2}{1-\q+\q\log \q}$ then $G_r''(1/2)<0$ and $G_r$ is

803: strictly decreasing on $\left[\frac{1}{2},1-\frac{3\log

804: k}{k}\right]$.

805: \end{lemma}

806:

807: \begin{lemma}%[Dealing with ${\bf \alpha}$ near $1$]

808: \label{lem:bigalpha} For all $k \geq k_0$, if  $r\le \ttk$ then

809: for every $\alpha\in \left[1-\frac{3\log k}{k},1\right]$,

810: $G_r(1/2)>G_r(\alpha)$.

811: \end{lemma}

812:

813: In the following sections we prove

814: Lemmata~\ref{middle}--\ref{ksekola}, while

815: Lemmata~\ref{lem:biggerthanhalf}--\ref{lem:bigalpha} are proven in

816: the appendix. Before delving into the probabilistic calculations

817: involved in proving Lemmata~\ref{middle}--\ref{ola} a couple of

818: remarks are in order.\medskip

819:

820:

821:

822: \noindent {\bf Relationship to other $k$-CNF models:} Recall that

823: the $m$ clauses of $F_k(n,m)$ are chosen independently with

824: replacement among the $(2n)^k$ possibilities. Thus, the $m$

825: clauses $\{c_i\}_{i=1}^m$ are i.i.d.\ random variables, each $c_i$

826: being the conjunction of $k$ i.i.d.\ random variables

827: $\{\ell_{ij}\}_{j=1}^k$, each $\ell_{ij}$ being a uniformly random

828: literal. This viewpoint of the formula as a sequence of $km$

829: i.i.d.\ random literals will be very handy for our calculations.

830:

831: Clearly, in this model some clauses might be improper, \ie they

832: might contain repeated and/or contradictory literals. At the same

833: time, though, observe that the probability that any given clause

834: is improper is smaller than $k^2/n$ and, moreover, the proper

835: clauses are uniformly selected among all such clauses. Therefore

836: \whp\ the number of improper clauses is $o(n)$ implying that if

837: for a given $r$, $F_k(n,rn)$ is $\p$-satisfiable \whp\ then for

838: $m=rn-o(n)$, the same is true in the model where we only select

839: among proper clauses. The issue of selecting clauses without

840: replacement is completely analogous as \whp\ there are $o(n)$

841: clauses that contain the same $k$ variables as some other clause.

842: \medskip

843:

844:

845: \noindent {\bf Notation:} In the ensuing probabilistic

846: calculations it will be convenient to write $\sigma \viol F$ to

847: denote that the truth assignment $\sigma$ violates the formula $F$

848: where $F$ can be a literal, a clause, or an entire CNF.

849:

850:

851: \section{The first moment and proof of Lemma~\ref{middle}}

852:

853: By linearity of expectation and since the $m = rn$ clauses

854: $c_1,c_2,\ldots,c_m$ are chosen independently we have

855: \begin{eqnarray}

856: \eta^{u_0 m}  \, \ex[X]   & = &  \ex \left[\sum_{\sigma}  \gamma^{H(\sigma,{\FF})}  \eta^{U(\sigma,{\FF})}\right] \nonumber \\

857: %            & = & \sum_{\sigma}  \ex[\gamma^{H(\sigma,{\FF})} \one_{\sigma \in {\cal S(\FF)}}] \\

858:             & = &  \sum_{\sigma}   \ex\left[\prod_{c_i}

859:                             \gamma^{H(\sigma,c_i)}  \eta^{U(\sigma,c_i)}

860:                     \right] \nonumber \\

861:             & = & \sum_{\sigma}   \prod_{c_i}\ex

862:                     \left[

863:                             \gamma^{H(\sigma,c_i)}  \eta^{U(\sigma,c_i)}

864:                     \right] \enspace . \label{eq:sumonday}

865: \end{eqnarray}

866: %

867: %

868: %                    \nonumber \\

869: %            & = & \eta^{-u_0 r n} \, 2^n \,

870: %\psi(\gamma,\eta)^{rn} \enspace . \label{eq:sq_exp_psi}

871:

872: Observe now that since the clauses are identically distributed, by

873: symmetry, it suffices to consider the expectation

874: in~\eqref{eq:sumonday} for a single random clause $c= \ell_1 \vee

875: \cdots \vee \ell_k$ and a fixed truth assignment $\sigma$.

876: Moreover, observe that if we write $\gamma^H \eta^U$ as $\gamma^H

877: + \gamma^H(\eta^U-1)$ we see that the second expression is

878: non-zero only when $U>0$, \ie when $c$ is violated by $\sigma$.

879: So, since the literals $\ell_1,\ldots,\ell_k$ are i.i.d.\ we get

880: \begin{eqnarray}

881:     \ex\left[\gamma^{H(\sigma,c)} \eta^{U(\sigma,c)}\right] & = &

882:     \ex\left[\gamma^{H(\sigma,c)} - \gamma^{H(\sigma,c)}

883: \left(1-\eta^{U(\sigma,c)}\right)\right]\nonumber  \\

884:     & = & \ex\left[ \gamma^{H(\sigma,c)}\right] -

885:  \ex\left[\gamma^{-k} (1-\eta) \, \one_{\sigma \viol c}\right] \nonumber \\

886: & = &     \ex\left[\prod_{\ell_i}\gamma^{H(\sigma,\ell_i)}\right]

887: - 2^{-k} \gamma^{-k} (1-\eta) \nonumber \\

888: & = &  \prod_{\ell_i}\ex\left[\gamma^{H(\sigma,\ell_i)}\right]

889: - 2^{-k} \gamma^{-k} (1-\eta) \nonumber \\

890: & = & \left(\frac{\gamma+\gamma^{-1}}{2}\right)^k - (2\gamma)^{-k}

891: (1-\eta)

892: \nonumber \\

893: & \equiv & \Z(\gamma,\eta) \label{eq:defZ} \enspace .

894: \end{eqnarray}

895: Thus,

896: \begin{equation} \label{eq:mean}

897: \ex[X] = \eta^{-u_0 rn} \, 2^n \, \Z(\gamma,\eta)^{rn} \,.

898: \end{equation}

899: Observe now that

900: \[

901: \left(\eta^{-u_0}\Z(\gamma,\eta)\right)^2 = f(1/2,\gamma,\eta)

902: \enspace .

903: \]

904: Therefore,

905: \[

906: \ex[X]^2 = \left(\eta^{-u_0 rn} \, 2^n \,

907: \Z(\gamma,\eta)^{rn}\right)^2 =

908:  \left[\left(\eta^{-u_0 r} \, 2 \,

909: \Z(\gamma,\eta)^r\right)^2\right]^n = \left[4 \,

910: f(1/2,\gamma,\eta)^r\right]^{n} = \left[2 \,

911: g_r(1/2,\gamma,\eta)\right]^n \enspace .

912: \]

913:

914:

915:

916: \section{Proof of Lemma~\ref{doobya2}} \label{doobya}

917:

918: By linearity of expectation, it suffices to prove that there

919: exists some $\theta=\theta(k,A)>0$ such that for the values of

920: $\gamma_0, \eta_0$ satisfying \eqref{eq:defzero} and every truth

921: assignment $\sigma$, we have

922: \begin{equation} \label{favor}

923:     \frac{\ex\left[\gamma_0^{H(\sigma,\FF)}\eta_0^{U(\sigma,\FF)}

924: \one_{\sigma \in {\cal S^*({\FF})}}\right ]}

925:          {\ex \left[\gamma_0^{H(\sigma,\FF)}\eta_0^{U(\sigma,\FF)}\right]}

926:  \rightarrow \theta \enspace .

927: \end{equation}

928:

929: Recalling that formulas in our model are sequences of i.i.d.\

930: random literals $\ell_1,\ldots,\ell_{km}$, let $\PS(\cdot)$ denote

931: the probability assigned by our distribution to any such sequence,

932: \ie $(2n)^{-km}$. Now, fix any truth assignment $\sigma$ and

933: consider an auxiliary distribution $\Pa$ on $k$-CNF formulas where

934: the $m$ clauses $c_1,\ldots,c_m$ are again i.i.d.\ among all

935: $(2n)^k$ clauses, but where now for any fixed clause $\fc$

936: \begin{equation} \label{doob2}

937: \Pa(\rc_i=\fc) =

938: \frac{\gamma_0^{H(\sigma,\fc)}\eta_0^{U(\sigma,\fc)}\PS(\fc)}

939: {\Z(\gamma_0,\eta_0)} \enspace ,

940: \end{equation}

941: where

942: \begin{equation}\label{defZ2}

943: \Z(\gamma_0,\eta_0) =

944: \ex\left[\gamma_0^{H(\sigma,\rc)}\eta_0^{U(\sigma,\rc)} \right]

945: %=\left(\frac{\gamma_0+\gamma_0^{-1}}{2}\right)^k -

946: %(2\gamma_0)^{-k} (1-\eta_0)

947: \enspace ,

948: \end{equation}

949: was defined in~(\ref{eq:defZ}). (Since each fixed clause $\fc$

950: receives probability proportional to

951: $\gamma_0^{H(\sigma,\fc)}\eta_0^{U(\sigma,\fc)}$, indeed

952: $Z(\gamma_0,\eta_0)$ provides the correct normalization to a

953: probability distribution.) So, whereas under $\PS(\cdot)$ every

954: $k$-CNF formula ${\FF}$ with $m$ clauses had the same probability

955: $\PS(F)=(2n)^{-km}$, under $\Pa$ its probability is

956: \begin{equation} \label{doob3} \Pa({\FF}) =

957: \frac{\gamma_0^{H(\sigma,{\FF})}\eta_0^{U(\sigma,{\FF})}

958: \PS({\FF})} {\Z(\gamma_0,\eta_0)^m} \enspace .

959: \end{equation}

960:

961: Let $\Ea$ be the expectation operator corresponding to $\Pa$. A

962: calculation similar to that leading to (\ref{eq:defZ}), adding the

963: equal contributions from the $k$ literals, gives that for a single

964: random clause $c$

965: \begin{equation} \label{EH}

966: \Z(\gamma_0,\eta_0)\Ea[H(\sigma,\rc)]= k(\gamma_0-\gamma_0^{-1})

967: \left(\frac{\gamma_0+\gamma_0^{-1}}{2}\right)^{k-1}

968: +k(2\gamma_0)^{-k} (1-\eta_0) \enspace .

969: \end{equation}

970: Moreover,

971: \begin{equation} \label{EU}

972: \Z(\gamma_0,\eta_0)\Ea[U(\sigma,\rc)]= (2\gamma_0)^k \eta_0 \,.

973: \end{equation}

974: Thus~\eqref{eq:defzero} ensures that $\Ea[H(\sigma,\rc)] = 0$ and

975: also that $\Ea[U(\sigma,\rc)-u_0] = 0$.\medskip

976:

977: Next, we apply the multivariate central limit theorem (see, e.g.\

978: \cite{pollard}, page 182) to the i.i.d.\ mean-zero random vectors

979: $\Big(H(\sigma,\rc_i),U(\sigma,\rc_i)-u_0\Big)$ for

980: $i=1,\ldots,m$. Observe that, since $k \ge 2$,  the common law of

981: these random vectors is not supported on a line. We deduce that as

982: $n \rightarrow \infty$

983: $$

984: \Pa[\sigma \in {\cal S}^*(\FF)]=\Pa\Big[H(\sigma,{\FF}) \geq 0 \mbox{ \rm

985: and } U(\sigma,\FF) \in [m u_0 ,m u_0+A\sqrt{m}]\Big]

986:  \rightarrow \theta(k,A)>0 \enspace .

987: $$

988: Here, the right hand side is the probability that a certain

989: nondegenerate bivariate normal law assigns to a certain open set.

990: Its exact value is unimportant for our purpose. By (\ref{doob3}),

991: this is equivalent to (\ref{favor}).

992:

993: \section{Proof of Lemma~\ref{ola}}

994:

995: Linearity of expectation implies

996: \begin{eqnarray}

997: \eta_0^{2u_0 m} \, \ex [X_*^2]     & = & \ex \left[\left(\sum_{\sigma}  \gamma_0^{H(\sigma,{\FF})}  \eta_0^{U(\sigma,{\FF})} \, \one_{\sigma \in {\cal S^*}({\FF})}\right)^2 \right] \nonumber \\

998:             & = &   \sum_{\sigma,\tau} \,\ex \left[\gamma_0^{H(\sigma,{\FF})+H(\tau,{\FF})}

999:             \eta_0^{U(\sigma,{\FF})+U(\tau,{\FF})}\, \one_{\sigma,\tau \in {\cal S^*}({\FF})}

1000:         \right] \label{eq:doldrum} \enspace .

1001: \end{eqnarray}

1002: Observe now that since $\sigma \in {\cal S}^*$ implies

1003: $H(\sigma,F) \geq 0$ and $U(\sigma,F) \geq u_0 m$, we get that for

1004: every pair $\sigma,\tau$ and any $\gamma \ge \gamma_0$ and $\eta

1005: \ge \eta_0$,

1006: \begin{eqnarray}

1007:     \ex \left[\gamma_0^{H(\sigma,{\FF})+H(\tau,{\FF})}

1008:             \eta_0^{U(\sigma,{\FF})+U(\tau,{\FF})}\, \one_{\sigma,\tau \in {\cal S^*}({\FF})}

1009:         \right]    &\le&

1010: \ex\left[\gamma^{H(\sigma,{\FF})+H(\tau,{\FF})}

1011: \eta^{U(\sigma,{\FF})+U(\tau,{\FF})} \, \one_{\sigma,\tau \in

1012: {\cal S^*}({\FF})}\right] \nonumber \\

1013:    &\le&

1014:  \ex\left[\gamma^{H(\sigma,{\FF})+H(\tau,{\FF})}

1015: \eta^{U(\sigma,{\FF})+U(\tau,{\FF})}\right] \enspace .

1016: \label{eq:choice}

1017: \end{eqnarray}

1018: In other words, when using the right hand side

1019: of~\eqref{eq:choice} to bound each term of the sum

1020: in~\eqref{eq:doldrum}, we are allowed to {\em adapt\/} the value

1021: of $\gamma$ and $\eta$ to the pair $\sigma,\tau$, the only

1022: restrictions being $\gamma \geq \gamma_0$ and $\eta \geq \eta_0$.

1023: This is a crucial point and we will exploit it heavily when

1024: bounding the contribution of pairs with large overlap.

1025: \medskip

1026:

1027: To estimate the right hand side of~\eqref{eq:choice} for any pair

1028: $\sigma,\tau$ we first observe that since the $m$ clauses

1029: $c_1,c_2,\ldots,c_m$ are i.i.d., letting $c$ be a single random

1030: clause we have

1031: \begin{eqnarray}

1032: \ex\left[\gamma^{H(\sigma,{\FF})+H(\tau,{\FF})}

1033: \eta^{U(\sigma,{\FF})+U(\tau,{\FF})}\right]        & = &

1034:         \ex\left[\prod_{c_i} {\gamma^{H(\sigma,c_i)+H(\tau,c_i)}

1035: \eta^{U(\sigma,c_i)+U(\tau,c_i)}}

1036:                     \right] \nonumber \\

1037:             & = &  \prod_{c_i}\ex

1038:                     \left[{\gamma^{H(\sigma,c_i)+H(\tau,c_i)}

1039: \eta^{U(\sigma,c_i)+U(\tau,c_i)}}

1040:                     \right]\nonumber \\

1041:                     & = &

1042:                     \biggl(\ex \left[{\gamma^{H(\sigma,c)+H(\tau,c)}

1043: \eta^{U(\sigma,c)+U(\tau,c)}}

1044:                     \right]\biggr)^m \label{eq:monday} \enspace .

1045: \end{eqnarray}

1046:

1047: Next, we observe that for every pair $\sigma,\tau$, by symmetry,

1048: the expectation in~\eqref{eq:monday} depends only on the number of

1049: variables to which $\sigma,\tau$ assign the same value. So, let

1050: $\sigma,\tau$ be any pair of truth assignments that agree on

1051: exactly $z = \alpha n$ variables, \ie have overlap $z$. By first

1052: rewriting (again) $\gamma^H \eta^U$ as $\gamma^H +

1053: \gamma^H(\eta^U-1)$ and then observing that $\eta^{U(\tau,c)}$ is

1054: distributed identically with $\eta^{U(\sigma,c)}$ we  get

1055: \begin{align}

1056: \lefteqn{\ex

1057:     \biggl[

1058:               \gamma^{H(\sigma,{c})}\eta^{H(\sigma,{c})} \gamma^{H(\tau,{c})}  \eta^{H(\tau,{c})}

1059:     \biggr]} \nonumber \\

1060: \mbox{} \quad \mbox{} & = \ex

1061:     \Biggl[

1062:               \biggr(\gamma^{H(\sigma,c)}-\gamma^{H(\sigma,c)}\left(1-\eta^{U(\sigma,c)}\right)\biggr)

1063:               \biggr(\gamma^{H(\tau,c)}-\gamma^{H(\tau,c)}\left(1-\eta^{U(\tau,c)}\right)\biggr)

1064:     \Biggr]

1065: \nonumber \\

1066: & =        \ex\biggl[\gamma^{H(\sigma,c)+H(\tau,c)}\biggr]

1067:             - 2 \,

1068:             \ex\biggl[\gamma^{H(\sigma,c)+H(\tau,c)}\left(1-\eta^{U(\sigma,c)}\right)\biggr]+

1069: \ex\biggl[\gamma^{H(\sigma,c)+H(\tau,c)}\left(1-\eta^{U(\sigma,c)}\right)\left(1-\eta^{U(\tau,c)}\right)\biggr]

1070: \nonumber

1071:             \\

1072: & =        \ex\left[\gamma^{H(\sigma,c)+H(\tau,c)}\right]

1073:             - 2(1-\eta)

1074:             \ex\left[\gamma^{H(\sigma,c)+H(\tau,c)} \one_{\sigma \viol c}\right]+

1075:             2^{-k}\alpha^k \gamma^{-2k} (1-\eta)^2   \enspace .

1076:             \label{eq:merge}

1077: \end{align}

1078:

1079: Now, to estimate~\eqref{eq:merge} we note that since the literals

1080: $\ell_1,\ell_2,\ldots \ell_k$ comprising $c$ are i.i.d.\ we have

1081: \begin{align*}

1082:     \ex\left[\gamma^{H(\sigma,c)+H(\tau,c)}

1083:     \right] & =

1084:     \ex\left[

1085:             \prod_i \gamma^{H(\sigma,\ell_i)+H(\tau,\ell_i)}

1086:         \right]

1087:       =

1088:         \prod_i \ex\left[\gamma^{H(\sigma,\ell_i)+H(\tau,\ell_i)}

1089:         \right]

1090:         = \left(\a \left(\frac{\gamma^2+\gamma^{-2}}{2}\right)+1-\a

1091:         \right)^k \nonumber \\

1092: \intertext{and, similarly,}

1093: \ex\left[\gamma^{H(\sigma,c)+H(\tau,c)} \one_{\sigma \viol c}

1094:     \right] & =

1095:     \ex\left[

1096:             \prod_i \gamma^{H(\sigma,\ell_i)+H(\tau,\ell_i)} \one_{\sigma

1097:             \viol \ell_i}

1098:         \right]

1099:      =

1100:         \prod_i \ex\left[\gamma^{H(\sigma,\ell_i)+H(\tau,\ell_i)} \one_{\sigma \viol \ell_i}

1101:         \right]  = \left(\frac{\a\gamma^{-2}+(1-\a)}{2}

1102:         \right)^k .

1103: \end{align*}

1104: Substituting these last two equations in~\eqref{eq:merge} we get

1105: \begin{align} \lefteqn{\eta ^{-2u_0}\,\ex

1106:     \left[

1107:               \gamma^{H(\sigma,{c})}\eta^{H(\sigma,{c})} \gamma^{H(\tau,{c})}  \eta^{H(\tau,{c})}

1108:     \right]} \nonumber \\

1109: & = \nonumber \eta^{-2u_0}\left[

1110:     \left(\a \left(\frac{\gamma^2+\gamma^{-2}}{2}\right)+1-\a

1111:     \right)^k

1112:     - 2 (1-\eta)\left(\frac{\a\gamma^{-2}+(1-\a)}{2}\right)^k

1113:     + (1-\eta)^2 \left(\frac{\a\gamma^{-2}}{2}\right)^k\right] \\

1114: & =  f(\alpha,\gamma,\eta) \enspace . \label{eq:gotcha}

1115: \end{align}

1116:

1117: So, in conclusion, since the number of ordered pairs with overlap

1118: $z$ is $ 2^n \,\binom{n}{z}$ we get that

1119: \begin{equation}

1120:  \ex[X_*^2] \leq 2^n \sum_{z=0}^n  \binom{n}{z} f(z/n,\gamma(z),\eta(z))^{m}

1121:  \enspace ,

1122: \label{eq:naesum}

1123: \end{equation}

1124: for any set of choices for $\gamma(z),\eta(z)$ such that

1125: $\gamma(z) \geq \gamma_0$ and $\eta(z)\geq \eta_0$ for all $0\leq

1126: z \leq n$.\bigskip

1127:

1128: \subsection{Proof of Lemma~\ref{ksekola}}

1129:

1130: If $\chi : [0,1] \rightarrow [\gamma_0,1]$ and $\omega : [0,1]

1131: \rightarrow [\gamma_0,1]$ are piecewise smooth, then from the

1132: definition of $f$ we see that

1133: $f(\alpha,\chi(\alpha),\omega(\alpha))$ is also piecewise smooth.

1134: Thus, we can decompose the sum in~\eqref{eq:naesum} into a fixed

1135: number of sums such that $f(\alpha,\chi(\alpha),\omega(\alpha))$

1136: is smooth in the range of each sum. To bound each such sum, then,

1137: we use the following lemma whose proof is implied by the proof of

1138: Lemma~2 in~\cite{naesat} (that lemma is stated with the

1139: requirement that $f$ is analytic, a condition not needed for the

1140: proof; in fact, it suffices for $f$ to only be twice

1141: differentiable.) The idea is that each of these sums is dominated

1142: by the contribution of $\Theta(n^{1/2})$ terms around the maximum

1143: term. Since the number of sums is finite the lemma follows.

1144: \begin{lemma} Let $\phi$ be any

1145: real, positive, twice-differentiable function on $[0,1]$ and let

1146: $$

1147: S_n = \sum_{z=0}^n \binom{n}{z} \,\phi(z/n)^n \enspace .

1148: $$

1149: Letting $0^0 \equiv 1$, define $g$ on $[0,1]$ as

1150: \[

1151: g(\alpha) = \frac{\phi(\alpha)}

1152:                  {\alpha^\alpha \,(1-\alpha)^{1-\alpha}} \enspace

1153:                  .

1154: \]

1155: If there exists $\amax \in (0,1)$ such that $g(\amax) \equiv \gmax

1156: > g(\alpha)$ for all $\alpha \neq \amax$, and $g''(\amax) < 0$,

1157: then there exist constants $B, C > 0$ such that for all

1158: sufficiently large $n$

1159: \[

1160: B \times \gmax^n \,\le\, S_n \,\le\, C \times \gmax^n \enspace .

1161: \]

1162: \end{lemma}

1163:

1164: \section{Bounds for finite {\large{$k$}}}\label{finitek}

1165:

1166: As mentioned in Section~\ref{outline}, for small values of $k$ the

1167: simple adaptation scheme of Proposition~\ref{prop:main} does not

1168: yield the best possible lower bound for $\p$-satisfiability

1169: afforded by our method. For that, one has to use a significantly

1170: more refined adaptation of $\gamma,\eta$ with respect to $\alpha$.

1171: Our lower bounds reported in Figure 1 are, indeed, the result of

1172: performing such optimization of $\gamma,\eta$ numerically (for

1173: both the upper bound plots and the plots of the lower bound

1174: from~\cite{sork} we used the explicit formulas).

1175:

1176: Specifically, to create the plots of the lower bounds we computed

1177: a lower bound for 100 equally spaced values of $p$ on the

1178: horizontal axis (and then had Maple's~\cite{maple} plotting

1179: function ``connect the dots"). For each of these values of $p$, to

1180: prove the corresponding lower bound for $r$ we had to establish

1181: that there exist a choice of functions $\chi,\omega$ as in

1182: Lemma~\ref{ksekola} such that for all $\alpha \in (1/2,1]$ we have

1183: $g_{r}(1/2,\gamma_0,\eta_0) >

1184: g_{r}(\alpha,\chi(\alpha),\omega(\alpha))$. To that end, we

1185: partitioned $(1/2,1]$ to 10,000 points and for each such point we

1186: searched for values of $\gamma \geq \gamma_0$ and $\eta \geq

1187: \eta_0$ such that this condition holds with a bit of room. (For

1188: $k>4$ we solved~\eqref{eq:defzero}, defining $\gamma_0$ and

1189: $\eta_0$, numerically to 10 digits of accuracy. For the

1190: optimization we exploited convexity to speed up the search.)

1191: Having determined such values, we (implicitly) extended the

1192: functions $\chi,\omega$ to all $(1/2,1]$ by assigning to every

1193: not-chosen point the value at the nearest chosen point. Finally,

1194: we computed a (crude) upper bound on the derivative of $g_r$ with

1195: respect to $\alpha$ in $(1/2,1]$. This bound on the derivative,

1196: along with our room factor, then implied that for every point that

1197: we did not check, the value of $g_r$ was sufficiently close to its

1198: value at the corresponding chosen point to also be dominated by

1199: $g_{r}(1/2,\gamma_0,\eta_0)$.

1200:

1201:

1202:

1203: \section*{Acknowledgements}

1204: We thank Cris Moore for helpful conversations in the early stages

1205: of this work.

1206:

1207: \bibliographystyle{amsalpha}

1208: \bibliography{stoc,extra,theory}

1209:

1210: \newpage

1211:

1212: \appendix

1213:

1214:

1215:

1216: \section{Building up an arsenal}

1217:

1218:

1219: In this section we collect some basic inequalities and identities

1220: that we will use in the proofs of Lemmas \ref{lem:biggerthanhalf},

1221: \ref{lem:decreasing} and \ref{lem:bigalpha}. For readability, in

1222: this Appendix we have replaced $q$ of Definition~\ref{skolio} with

1223: the letter $y$.

1224:

1225: Plugging in the definition of $\eta_0$ from (\ref{eq:defzero})

1226: into the definition of $f$ we get

1227: \begin{align*} \lefteqn{f(\alpha,\gamma_0,\eta_0)} \nonumber \\

1228: &

1229: =\eta_0^{-{y}/{2^{k-1}}}\left\{\left(1-\alpha+\alpha\frac{\gamma_0^2+\gamma_0^{-2}}{2}\right)^k-

1230: \frac{(1-\gamma_0^2)(1+\gamma_0^2)^{k-1}}{2^{k-1}}

1231: \left(\alpha\gamma_0^{-2}+1-\alpha\right)^k+

1232: \frac{\alpha^k(1-\gamma_0^2)^2(1+\gamma_0^2)^{2k-2}}{2^k\gamma_0^{2k}}\right\}.

1233: \end{align*}

1234: For some parts of the ensuing calculations, it will be to

1235: convenient to use the following normalizations of

1236: $f(\alpha,\gamma_0,\eta_0)$ and $g_r(\alpha,\gamma_0,\eta_0)$

1237: denoted as $f_0$ and $g_0$ respectively

1238: \begin{eqnarray}\label{eq:normalization}

1239: f_0(\alpha)=2^{2k}\gamma_0^{2k}\eta_0^{y/2^{k-1}}f(\alpha,\gamma_0,\eta_0)\quad

1240: \mathrm{and}\quad

1241: g_0(\alpha)=2^{2kr}\gamma_0^{2kr}\eta_0^{yr/2^{k-1}}g_r(\alpha,\gamma_0,\eta_0).

1242: \end{eqnarray}

1243: We will also write $\e_0=1-\gamma_0^2$. With this notation, we

1244: have the following formula for $f_0$, which holds for every

1245: $x\in[-1/2,1/2]$

1246: \begin{eqnarray}\label{eq:f0}

1247: f_0\left(\frac{1}{2}+x\right)=[2x{\e_0}^2+(2-{\e_0})^2]^k-2{\e_0}(2-{\e_0})^{k-1}[2-{\e_0}+2x{\e_0}]^k+{\e_0}^2(2-{\e_0})^{2k-2}(1+2x)^k

1248: \enspace ,

1249: \end{eqnarray}

1250: In particular,

1251: \begin{eqnarray}\label{eq:f1/2}

1252: f_0\left(\frac{1}{2}\right)=(2-{\e_0})^{2k}-2{\e_0}(2-{\e_0})^{2k-1}+{\e_0}^2(2-{\e_0})^{2k-2}

1253: =4(1-{\e_0})^2(2-{\e_0})^{2k-2} \enspace .

1254: \end{eqnarray}

1255:

1256: The function $y\mapsto 1-y+y\log y$, defined on $[0,1]$, appears

1257: throughout our analysis. The following inequalities, valid for all

1258: $y\in [0,1]$, will be used

1259: \begin{eqnarray}\label{eq:stupid}

1260: \frac{(1-y)^2}{2}\le 1-y+y\log y\le (1-y)^2.

1261: \end{eqnarray}

1262: The right-hand inequality follows from the estimate $\log y\le

1263: y-1$. The left-hand inequality follows from integrating this

1264: estimate as follows

1265: $$

1266: -1+y-y\log y=\int_y^1\log x dx\le

1267: \int_y^1(1-x)dx=-\frac{(1-y)^2}{2}.

1268: $$

1269:

1270: \medskip

1271: We end this section by providing some estimates for the values of

1272: ${\e_0}$ and $\eta_0$

1273: \begin{fact}\label{fact:etae} For all sufficiently large $k$ ,

1274: \begin{eqnarray}\label{eq:e}

1275: \frac{2(1-y)}{2^{k}-k-1}-\frac{4k(1-y)^2}{2^{2k}}\;\le\;

1276: {\e_0}\;\le \;\frac{2(1-y)}{2^{k}-k-1} \enspace ,

1277: \end{eqnarray}

1278: and

1279: \begin{eqnarray}\label{eq:eta}

1280: \eta_0 \;\le \;

1281: \min\left\{y,y-\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k}\right\}

1282: \enspace .

1283: \end{eqnarray}

1284: \end{fact}

1285:

1286: \subsection{Proof of Fact \ref{fact:etae}}

1287:

1288: By the first equation in (\ref{eq:defzero}) we have that

1289: $\eta_0=1-{\e_0}(2-{\e_0})^{k-1}$. Plugging this into the second

1290: equation in (\ref{eq:defzero}) we find that

1291: \begin{eqnarray}\label{eq:identityy}

1292: \frac{y}{2^k}=\frac{1-{\e_0}(2-{\e_0})^{k-1}}{(2-{\e_0})^k-{\e_0}(2-{\e_0})^{k-1}}

1293: = \frac{1-{\e_0}(2-{\e_0})^{k-1}}{2(1-{\e_0})(2-{\e_0})^{k-1}}

1294: =\frac{1}{2}\left[1-\sum_{j=1}^{k-1}\frac{1}{(2-{\e_0})^{j}}\right].

1295: \end{eqnarray}

1296: Hence, if we denote

1297: \begin{eqnarray}\label{eq:defpsi}

1298: \psi(t)=\sum_{j=1}^{k-1}\frac{1}{(2-t)^{j}}=\frac{(2-t)^{k-1}-1}{(1-t)(2-t)^{k-1}}

1299: \enspace ,

1300: \end{eqnarray}

1301: then we require that $\psi({\e_0})=1-\frac{y}{2^{k-1}}$. We

1302: collect below some useful properties of $\psi$.

1303:

1304: \begin{lemma}\label{lem:psi} $\psi$ is increasing on $[0,1]$.

1305: Furthermore, for $k$ large enough and every $t\le t\le 1/(2k)$,

1306: \begin{eqnarray}\label{eq:ineqpsi}

1307: 1-\frac{1}{2^{k-1}}+t-\frac{(k+1)t}{2^k}+\frac{t^2}{2}\le

1308: \psi(t)\le  1-\frac{1}{2^{k-1}}+t-\frac{(k+1)t}{2^k}+2t^2

1309: \enspace.

1310: \end{eqnarray}

1311:

1312:

1313: \end{lemma}

1314:

1315: \begin{proof} The fact that $\psi$ is increasing follows

1316: immediately from the first formula in (\ref{eq:defpsi}). To prove

1317: the inequalities in (\ref{eq:ineqpsi}), observe that

1318: $$

1319: \psi(t)=\frac{\left(1-\frac{t}{2}\right)^{k-1}-\frac{1}{2^{k-1}}}{(1-t)\left(1-\frac{t}{2}\right)^{k-1}}=

1320: \frac{1}{1-t}-\frac{1}{2^{k-1}(1-t)\left(1-\frac{t}{2}\right)^{k-1}}

1321: \enspace .

1322: $$

1323: To estimate $\psi$ from below, we use the inequalities $1+a+a^2\le

1324: 1/(1-a)\le 1+a+2a^2$ and $(1-a)^{k-1}\ge1-(k-1)a$, valid for all

1325: $0\le a\le 1/2$, to show that whenever $a\le 1/(2k)$

1326: \begin{eqnarray*}

1327: \psi(t)&\ge&

1328: 1+t+t^2-\frac{1}{2^{k-1}(1-t)\left(1-\frac{(k-1)t}{2}\right)}\\

1329: &\ge&

1330: 1+t^2-\frac{1}{2^{k-1}}(1+t+2t^2)\left(1+\frac{(k-1)t}{2}+2\frac{(k-1)^2t^2}{4}\right)\\

1331: &\ge& 1-\frac{1}{2^{k-1}}+t-\frac{(k+1)t}{2^k}+\frac{t^2}{2}

1332: \enspace,

1333: \end{eqnarray*}

1334: for all $k$ sufficiently large.

1335:

1336: The reverse inequality is just as simple

1337: \begin{eqnarray*}

1338: \psi(t)&\le&

1339: 1+t+2t^2-\frac{1}{2^{k-1}}(1+t)\left(1+\frac{t}{2}\right)^{k-1}\\

1340: &\le&

1341: 1+t+2t^2-\frac{1}{2^{k-1}}(1+t)\left(1+\frac{(k-1)t}{2}\right)\\

1342: &\le& 1-\frac{1}{2^{k-1}}+t-\frac{(k+1)t}{2^k}+2t^2 \enspace.

1343: \end{eqnarray*}

1344: \end{proof}

1345:

1346: We are now in position to conclude the proof of Fact

1347: \ref{fact:etae}. Since $\psi$ is increasing and

1348: $\psi({\e_0})=1-\frac{y}{2^{k-1}}$, the inequalities in

1349: (\ref{eq:e}) will be proved once we show that

1350: \begin{eqnarray}\label{eq:need}

1351: \psi\left(\frac{2(1-y)}{2^{k}-k-1}-\frac{16(1-y)^2}{2^{2k}}\right)\le

1352: 1-\frac{y}{2^{k-1}}\le \psi\left(\frac{2(1-y)}{2^k-k-1}\right)

1353: \enspace.

1354: \end{eqnarray}

1355: To prove the right-hand inequality in (\ref{eq:need}), set

1356: $t=\frac{2(1-y)}{2^k-k-1}$ and observe that for $k$ large enough,

1357: $t\le 1/(2k)$. Hence, by Lemma \ref{lem:psi},

1358: \begin{eqnarray*}

1359: \psi(t)\ge

1360: 1-\frac{1}{2^{k-1}}+t-\frac{(k+1)t}{2^k}=1-\frac{1}{2^{k-1}}+\frac{2(1-y)}{2^k-k-1}-\frac{(k+1)}{2^k}\cdot

1361: \frac{2(1-y)}{2^k-k-1}=1-\frac{y}{2^{k-1}} \enspace.

1362: \end{eqnarray*}

1363:

1364: The left-hand inequality in (\ref{eq:need}) is equally simple. In

1365: this case we apply Lemma \ref{lem:psi} with

1366: $t=\frac{2(1-y)}{2^{k}-k-1}-\frac{16(1-y)^2}{2^{2k}}$ and get that

1367: \begin{eqnarray*}

1368: \psi(t)&\le& 1-\frac{1}{2^{k-1}}+t-\frac{(k+1)t}{2^k}+2t^2\\

1369: &\le&

1370: 1-\frac{1}{2^{k-1}}+\frac{2(1-y)}{2^k-k-1}-\frac{(k+1)}{2^k}\cdot

1371: \frac{2(1-y)}{2^k-k-1}-\frac{16(1-y)^2}{2^{2k}}+2\cdot

1372: \frac{4(1-y)^2}{(2^k-k-1)^2}\\&\le& 1-\frac{y}{2^{k-1}} \enspace,

1373: \end{eqnarray*}

1374: as long as $k$ is sufficiently large.

1375:

1376: \smallskip

1377:

1378: To prove the estimate (\ref{eq:eta}) observe that the function

1379: $s\mapsto s(2-s)^{k-1}$ is increasing on $[0,2/k]$. Since we have

1380: shown that for sufficiently large $k$, ${\e_0}\le

1381: \frac{2(1-y)}{2^{k}-k-1}\le \frac{2}{k}$, the lower bound in

1382: (\ref{eq:e}) yields

1383: \begin{eqnarray*}

1384: \eta_0&=&1-{\e_0}(2-{\e_0})^{k-1}\\

1385: &\le&1-2^{k-1}\left(\frac{2(1-y)}{2^{k}-k-1}-\frac{16(1-y)^2}{2^{2k}}\right)

1386: \left(1-\frac{1-y}{2^{k}-k-1}-\frac{8(1-y)^2}{2^{2k}}

1387: \right)^{k-1}\\

1388: &\le& 1-\left(\frac{2^k(1-y)}{2^k-k-1}-\frac{8(1-y)^2}{2^k}\right)

1389: \left(1-\frac{(k-1)(1-y)}{2^{k}-k-1}-\frac{8(k-1)(1-y)}{2^{2k}}

1390: \right)\\

1391: &<& y-\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k} \enspace,

1392: \end{eqnarray*}

1393: provided $k$ is large enough. The inequality $\eta_0\le y$ is

1394: simpler. By (\ref{eq:identityy}),

1395: $$

1396: \frac{y}{2^k}=\frac{1-{\e_0}(2-{\e_0})^{k-1}}{2(1-{\e_0})(2-{\e_0})^{k-1}}=\frac{\eta_0}{2^k(1-\e_0)(1-\e_0/2)^{k-1}}\ge

1397: \frac{\eta_0}{2^k} \enspace .

1398: $$

1399:

1400: \section{Proof of Lemma~\ref{lem:biggerthanhalf}}

1401:

1402:

1403: Since the function $\alpha\mapsto

1404: \alpha^\alpha(1-\alpha)^{1-\alpha}$ is symmetric around $1/2$, it

1405: suffices to prove that for every $x\in (0,1/2]$,

1406: \begin{eqnarray}\label{eq:notsym}

1407: f\left(\frac12+x,\gamma,\eta\right)>f\left(\frac12-x,\gamma,\eta\right).

1408: \end{eqnarray}

1409: To this end, fix $x\in [-1/2,1/2]$ and $\gamma,\eta>0$. Denote

1410: $\e=1-\gamma^2$. Plugging this notation and $\alpha=1/2+x$ into

1411: (\ref{eq:deff}), we find that the following identity holds

1412: \begin{eqnarray}\label{eq:binom}

1413: \eta^{y/2^{k-1}}2^{2k}\gamma^{2k}f\left(\frac12+x,\gamma,\eta\right)&=&[2x\e^2+(2-\e)^2]^k-2(1-\eta)[(2-\e)+

1414: 2x\e]^k+(1-\eta)^2(1+2x)^k\nonumber\\

1415: &=&\sum_{j=0}^k\binom{k}{j}2^jx^{j}[\e^{2j}(2-\e)^{2(k-j)}-2(1-\eta)\e^j(2-\e)^{k-j}+(1-\eta)^2]\nonumber\\

1416: &=&\sum_{j=0}^k\binom{k}{j}2^jx^{j}[\e^j(2-\e)^{k-j}-(1-\eta)]^2.

1417: \end{eqnarray}

1418: This shows that we can write

1419: $f(1/2+x,\gamma,\eta)=\sum_{j=0}^ka_jx^j$ for some $a_j\ge 0$,

1420: such that at most one of the $a_j$'s is zero. Since for every

1421: $x>0$ and odd $j$, $x^j-(-x)^j>0$, (\ref{eq:notsym}) follows.

1422:

1423:

1424: \section{Proof of Lemma~\ref{lem:decreasing}}

1425:

1426: In this section we will use the normalization

1427: (\ref{eq:normalization}). To prove the first assertion of Lemma

1428: \ref{lem:decreasing}, our goal is to show that $g_0'(\alpha) < 0$

1429: for $\frac12 < \alpha \le 1-\frac{3\log k}{k}$. Observe that

1430: \begin{eqnarray}\label{eq:derg0}

1431: g'_0(\alpha)=\frac{f_0(\alpha)^{r-1}\Big\{rf'_0(\alpha)+

1432: f_0(\alpha)\big[\log(1-\alpha)-\log\alpha\big]\Big\}}{\alpha^\alpha(1-\alpha)^{1-\alpha}}

1433: \enspace .

1434: \end{eqnarray}

1435: Differentiating (\ref{eq:f0}) at $x=0$ we find that

1436: $$

1437: f_0'\left(\frac12\right)=2k{\e_0}^2(2-{\e_0})^{2k-2}-4{\e_0}^2(2-{\e_0})^{2k-2}+2{\e_0}^2(2-{\e_0})^{2k-2}=0\enspace.

1438: $$

1439: Since, by (\ref{eq:binom}), $f_0(\alpha)>0$ it is enough to show

1440: that the following function is decreasing on

1441: $\left[\frac12,1-\frac{3\log k}{k}\right]$

1442: $$

1443: \psi(\alpha)=rf_0'(\alpha)+f_0(\alpha)[\log (1-\alpha)-\log\alpha]

1444: \enspace .

1445: $$

1446: Now,

1447: $$

1448: \psi'(\alpha)= rf_0''(\alpha)+f_0'(\alpha)[\log

1449: (1-\alpha)-\log\alpha]-f_0(\alpha)\left(\frac{1}{\alpha}+\frac{1}{1-\alpha}\right)

1450: \enspace .

1451: $$

1452: Since for $1/2< \alpha\le 1$, $\log(1-\alpha)< \log \alpha$ and,

1453: by (\ref{eq:binom}), $f_0'>0$ on $(1/2,1]$, it is thus enough to

1454: prove that

1455: $$

1456: rf_0''(\alpha)\le

1457: f_0(\alpha)\left(\frac{1}{\alpha}+\frac{1}{1-\alpha}\right)

1458: \enspace.

1459: $$

1460: Now, $\frac{1}{\alpha}+\frac{1}{1-\alpha}\ge 4$ and, from

1461: (\ref{eq:f1/2}), we get that for $\alpha\ge 1/2$,

1462: $$

1463: f_0(\alpha)\ge

1464: f_0\left(\frac12\right)=4(1-{\e_0})^2(2-{\e_0})^{2k-2}\ge

1465: (2-{\e_0})^{2k-2} \enspace,

1466: $$

1467: where we also used that, by (\ref{eq:e}), ${\e_0}\le 1/2$ for $k$

1468: large enough. Thus, it suffices to prove

1469: \begin{eqnarray}\label{eq:goal}

1470: rf_0''\left(\frac{1}{2}+x\right)\le 4(2-{\e_0})^{2k-2}.

1471: \end{eqnarray}

1472:

1473: Now, using that $x\le\frac12- \frac{3\log k}{k}$, we differentiate

1474: (\ref{eq:f0}) twice to get

1475: \begin{align*}\label{eq:crude}

1476: \lefteqn{f_0''\left(\frac{1}{2}+x\right)} \nonumber\\

1477: &=4k(k-1)\left\{

1478: {\e_0}^4[2x{\e_0}^2+(2-{\e_0})^2]^{k-2}-2{\e_0}^3(2-{\e_0})^{k-1}[2-{\e_0}+2x{\e_0}]^{k-2}+{\e_0}^2(2-{\e_0})^{2k-2}(1+2x)^{k-2}\right\}\nonumber\\

1479: &\le 4k^2\left\{

1480: {\e_0}^4(2-{\e_0})^{2k-4}\left(1+\frac{2x{\e_0}^2}{(2-{\e_0})^2}\right)^{k-2}

1481: +{\e_0}^2(2-{\e_0})^{2k-2}(1+2x)^{k-2}\right\}\nonumber\\

1482: &\le 4k^2\left\{ {\e_0}^4(2-{\e_0})^{2k-4}\left(1+2x\right)^{k-2}

1483: +{\e_0}^2(2-{\e_0})^{2k-2}(1+2x)^{k-2}\right\}\\

1484: &\le 8k^2{\e_0}^2(2-{\e_0})^{2k-2}(1+2x)^k\\ &\le

1485: 8k^2{\e_0}^2(2-{\e_0})^{2k-2}\left[2-\frac{6\log k}{k}\right]^k\\

1486: &\le 8k^2{\e_0}^2(2-{\e_0})^{2k-2}2^k\frac{1}{k^3}\\

1487: &\le

1488: \frac{2^{k+3}}{k}\left(\frac{4(1-y)}{2^k}\right)^2(2-{\e_0})^{2k-2},

1489: \end{align*}

1490: where in the last line we used the fact that for $k$ large enough,

1491: (\ref{eq:e}) implies  ${\e_0}\le {4(1-y)}/{2^k}$.

1492:

1493: Combining this estimate with (\ref{eq:goal}), we see that we must

1494: show that for sufficiently large $k$

1495: $$

1496: \frac{128\log 2}{1-y+y\log y}\cdot \frac{(1-y)^2}{k}\le 4 \enspace

1497: ,

1498: $$

1499: and this is indeed the case by (\ref{eq:stupid}).

1500:

1501: \smallskip

1502:

1503: It remains to show that $g_0''(1/2)<0$. Denoting

1504: $\zeta(\alpha)=\alpha^{-\alpha}(1-\alpha)^{\alpha-1}$, we see from

1505: (\ref{eq:derg0}) that

1506: $g_0'(\alpha)=f_0(\alpha)^{r-1}\psi(\alpha)\zeta(\alpha)$. Since

1507: $f_0(1/2)=0$, $\zeta'(1/2)=0$, and we have just verified that

1508: $\psi'(1/2)<0$, the required result follows.

1509:

1510: \section{Proof of Lemma~\ref{lem:bigalpha}}

1511:

1512: Our goal is to show that for any $0<r\le \ttk$, and $1-\frac{3\log

1513: k}{k}< \alpha\le 1$,

1514: \begin{equation}\label{eq:maingoal}

1515: \left[\frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}\right]^r\le

1516: 2\alpha^\alpha(1-\alpha)^{1-\alpha} \enspace .

1517: \end{equation}

1518:

1519: The following lemma gives an upper bound for the left-hand side of

1520: (\ref{eq:maingoal}).

1521:

1522: \begin{lemma}\label{lem:ratio} For all

1523: sufficiently large $k$,

1524: \begin{eqnarray}\label{eq:final}

1525: \frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}\le

1526: y^{y/2^k}\left[1+\frac{2(1-y)-2(1-\sqrt{y})+(1-\sqrt{y})^2\alpha^k}{2^k}+\frac{60k(1-y)^2}{2^{2k}}\right]

1527: \enspace .

1528: \end{eqnarray}

1529: \end{lemma}

1530:

1531:

1532: \begin{proof}

1533:

1534:

1535: \begin{comment}

1536: To this end, we naturally require a lower bound for

1537: $f(1/2,\gamma_0,\eta_0)$ and an upper bound for

1538: $f(\alpha,\gamma_0,\sqrt{\eta_0})$. By (\ref{eq:f1/2}),

1539: \begin{eqnarray}\label{eq:lowerf1/2}

1540: \eta_0^{y/2^{k-1}}2^{2k}\gamma_0^{2k}f\left(\frac12,\gamma_0,\eta_0\right)&=&4(1-{\e_0})^2(2-{\e_0})^{2k-2}\nonumber\\&=&

1541: 2^{2k}(1-{\e_0})^2\left(1-\frac{{\e_0}}{2}\right)^{2k-2}\nonumber\\

1542: &\ge& 2^{2k}(1-2{\e_0})[1-(k-1){\e_0}]\nonumber\\&\ge&

1543: 2^{2k}[1-(k+1){\e_0}].

1544: \end{eqnarray}

1545: \end{comment}

1546:

1547:

1548: Denote $\e_1=1-(\sqrt{\gamma_0})^2=1-\sqrt{1-\e_0}$. For

1549: $1-\frac{4\log k}{k}< \alpha\le 1$ write $x=\alpha-1/2$.

1550: Analogously to (\ref{eq:binom}) we have we have the following

1551: identity

1552: \begin{eqnarray}\label{eq:identitye}

1553: &&\!\!\!\!\!\!\!\!\!\!\!\!\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\frac12+x,\sqrt{\gamma_0},\sqrt{\eta_0}\right)\nonumber\\

1554: &=&

1555: [2x{\e_1}^2+(2-{\e_1})^2]^k-2(1-\sqrt{\eta_0})[(2-{\e_1})+2x{\e_1}]^k+(1-\sqrt{\eta_0})^2(1+2x)^k.

1556: \end{eqnarray}

1557: Our first goal is replace $\eta_0$ in the right-hand side of

1558: (\ref{eq:identitye}) by its upper bound from (\ref{eq:eta}). To

1559: this end consider the function

1560: \begin{eqnarray}\label{eq:rho}

1561: \rho(b)=[2x{\e_1}^2+(2-{\e_1})^2]^k-2b[(2-{\e_1})+2x{\e_1}]^k+b^2(1+2x)^k,

1562: \end{eqnarray}

1563: and observe the right-hand side of (\ref{eq:identitye}) equals

1564: $\rho\left(1-\sqrt{\eta_0}\right)$. So, that it is enough to show

1565: that $\rho$ is decreasing on $[0,1]$. Since $\rho$ is convex and

1566: quadratic, this would follow once we show that $\rho'(1)\le 0$.

1567: This is equivalent to $1+2x\le 2-{\e_1}+2x{\e_1}$, which is true

1568: since $x\le 1/2$. Hence

1569: \begin{comment}

1570: Now,

1571: $$

1572: \eta_0^{y/2^k}2^{2k}\gamma_0^{2k}f\left(\frac12+x,\gamma_0,\sqrt{\eta_0}\right)=\rho\left(1-\sqrt{\eta_0}\right).

1573: $$

1574: By Fact \ref{fact:etae}, as long as $k\ge 30$, $1-\sqrt{\eta_0}\ge

1575: 1-\sqrt{y}$, and since $\rho$ is decreasing on $[0,1]$, we can

1576: bound

1577: $\eta_0^{y/2^k}2^{2k}\gamma_0^{2k}f\left(\alpha,\gamma_0,\sqrt{\eta_0}\right)$

1578: from above as follows

1579: \end{comment}

1580: \begin{eqnarray}\label{eq:upperf}

1581: &&\!\!\!\!\!\!\!\!\!\!\!\!\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)=

1582: \rho(1-\sqrt{\eta_0})

1583: \nonumber\\

1584: &\le& \rho(1-\sqrt{z})=

1585: [2x{\e_1}^2+(2-{\e_1})^2]^k-2(1-\sqrt{z})[(2-{\e_1})+2x{\e_1}]^k+(1-\sqrt{z})^2(1+2x)^k,

1586: \end{eqnarray}

1587: Where $z$ is the upper bound for $\eta_0$ from (\ref{eq:eta}),

1588: i.e.,

1589: \begin{eqnarray}\label{eq:defz}

1590: z=

1591: \min\left\{y,y-\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k}\right\}

1592: \enspace .

1593: \end{eqnarray}

1594:

1595: Hence using $\eta_0\le z$ and the identity (\ref{eq:f1/2}) we

1596: bound the ratio in (\ref{eq:maingoal}) as follows

1597: \begin{align}

1598: \lefteqn{\frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}} \nonumber \\

1599: & =\eta_0^{y/2^k}\cdot\gamma_0^k\cdot

1600: \frac{\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)}

1601: {\eta_0^{y/2^{k-1}}2^{2k}\gamma_0^{2k}f\left(1/2,\gamma_0,\eta_0\right)}\nonumber\\

1602: &=\frac{\eta_0^{y/2^k}(1-\e_0)^{k/2}}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}\cdot

1603: \eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)\nonumber\\

1604: &\le

1605: \frac{z^{y/2^k}(1-\e_0)^{k/2}}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}\cdot

1606: \left\{[2x{\e_1}^2+(2-{\e_1})^2]^k-2(1-\sqrt{z})[(2-{\e_1})+2x{\e_1}]^k+(1-\sqrt{z})^2(1+2x)^k\right\}\nonumber\\

1607: &= z^{y/2^k}\left[1+\frac{{\e_0}}{2

1608: (1-{\e_0})}\right]^2\left[\frac{\sqrt{1-\e_0}}{(1-\e_0/2)^2}\right]^k\cdot\nonumber\\

1609: &\phantom{\le}

1610: \left\{\left[\frac{x\e_1^2}{4}+\left(1-\frac{\e_1}{2}\right)^2\right]^k-\frac{2(1-\sqrt{z})

1611: [1-{\e_1}(1-\alpha)]^k}{2^k}+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}\right\}

1612: \enspace . \label{eq:estimate}

1613: \end{align}

1614: %

1615: %\begin{eqnarray}\label{eq:estimate}

1616: %&&\!\!\!\!\!\!\!\!\!\!\!\!\frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}

1617: %=\eta_0^{y/2^k}\cdot\gamma_0^k\cdot

1618: %\frac{\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)}

1619: %{\eta_0^{y/2^{k-1}}2^{2k}\gamma_0^{2k}f\left(1/2,\gamma_0,\eta_0\right)}\nonumber\\

1620: %&=&\frac{\eta_0^{y/2^k}(1-\e_0)^{k/2}}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}\cdot

1621: %\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)\nonumber\\

1622: %&\le&

1623: %\frac{z^{y/2^k}(1-\e_0)^{k/2}}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}\cdot

1624: %\left\{[2x{\e_1}^2+(2-{\e_1})^2]^k-2(1-\sqrt{z})[(2-{\e_1})+2x{\e_1}]^k+(1-\sqrt{z})^2(1+2x)^k\right\}\nonumber\\

1625: %&=& z^{y/2^k}\left[1+\frac{{\e_0}}{2

1626: %(1-{\e_0})}\right]^2\left[\frac{\sqrt{1-\e_0}}{(1-\e_0/2)^2}\right]^k\cdot\nonumber\\

1627: %&\phantom{\le}&

1628: %\left\{\left[\frac{x\e_1^2}{4}+\left(1-\frac{\e_1}{2}\right)^2\right]^k-\frac{2(1-\sqrt{z})

1629: %[1-{\e_1}(1-\alpha)]^k}{2^k}+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}\right\}.

1630: %\end{eqnarray}

1631:

1632: We will bound the various terms in (\ref{eq:estimate}) separately.

1633: First of all, using (\ref{eq:defz}) and the inequality $e^a\le

1634: 1+a+a^2$, which is valid for $0\le a\le 1$, we get that

1635: \begin{eqnarray}\label{eq:zpart}

1636: z^{y/2^k}&\le&

1637: y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{y(2^k-k-1)}+\frac{4k(1-y)^2}{y2^k}\right]^{y/2^k}\nonumber\\

1638: &\le&

1639: y^{y/2^k}\exp\left[-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{4k(1-y)^2}{2^{2k}}\right]\nonumber\\

1640: &\le&y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right],

1641: \end{eqnarray}

1642: as long as $k$ is large enough.

1643:

1644: Next, using the inequality $1/(1-a)\le 1+2a$, valid for $0\le a\le

1645: 1/2$ we get

1646: \begin{eqnarray}\label{eq:thesquare}

1647: \left[1+\frac{{\e_0}}{2

1648: (1-{\e_0})}\right]^2\le\left[1+\frac{{\e_0}}{2}(1+2{\e_0})\right]^2\le

1649: 1+\e_0+5\e_0^2.

1650: \end{eqnarray}

1651: Next, using the inequality $\sqrt{1-x}\le 1-x/2$, the inequality

1652: $1/(1-a)\le 1+a+2a^2$, valid for $0\le a\le 1/2$, and the

1653: inequality $(1+a)^k\le 1+ka+k^2a^2/2$, which is valid for all

1654: $a\le 1/(4k^2)$, we get that since for $k$ large enough $\e_0\le

1655: 1/(4k^2)$,

1656: \begin{eqnarray}\label{eq:theroot}

1657: \left[\frac{\sqrt{1-\e_0}}{(1-\e_0/2)^2}\right]^k\le

1658: \frac{1}{(1-\e_0/2)^k}\le

1659: \left(1+\frac{\e_0}{2}+\frac{\e_0^2}{2}\right)^k\le

1660: 1+\frac{k\e_0}{2}+\frac{k^2\e_0^2}{8}+k\e_0^2.

1661: \end{eqnarray}

1662: Hence, for $k$ large enough

1663: \begin{eqnarray}\label{eq:theproduct}

1664: \left[1+\frac{{\e_0}}{2

1665: (1-{\e_0})}\right]^2\left[\frac{\sqrt{1-\e_0}}{(1-\e_0/2)^2}\right]^k\le

1666: 1+\left(1+\frac{k}{2}\right)\e_0+\frac{k^2\e_0^2}{4}+k\e_0^2.

1667: \end{eqnarray}

1668:

1669: Next, using the inequality $x/2\le 1-\sqrt{1-x}\le x/2+x^2$, which

1670: is valid for $0\le x\le 1/2$, we get that

1671: \begin{eqnarray}\label{eq:ineqe1}

1672: \frac{\e_0}{2}\le\e_1=1-\sqrt{1-\e_0}\le \frac{\e_0}{2}+\e_0^2\le

1673: \e_0.

1674: \end{eqnarray}

1675: Observe that since $0\le x\le 1/2$ and $\e_1<1/2$, the function

1676: $\e_1\mapsto \frac{x\e_1^2}{4}+\left(1-\frac{\e_1}{2}\right)^2$ is

1677: decreasing in $\e_1$. Hence, the lower bound in (\ref{eq:ineqe1}),

1678: together with another application of the inequality $(1+a)^k\le

1679: 1+ka+k^2a^2/2$, valid for all $a\le 1/(4k^2)$, implies that for

1680: sufficiently large $k$

1681: \begin{eqnarray}\label{eq:firstbracket}

1682: \left[\frac{x\e_1^2}{4}+\left(1-\frac{\e_1}{2}\right)^2\right]^k&\le&

1683: \left[\frac{\e_0^2}{32}+\left(1-\frac{\e_0}{4}\right)^2\right]^k\le

1684: \left[1-\frac{\e_0}{2}+\frac{\e_0^2}{10}\right]^k\le

1685: 1-\frac{k\e_0}{2}+\frac{k^2\e_0^2}{8}+k\e_0^2 \enspace .

1686: \end{eqnarray}

1687:

1688: The second term in the brackets of (\ref{eq:estimate}) appears

1689: with a minus sign, so we bound it from below, using the fact that

1690: $z\le y$ and $\e_1\le\e_0$.

1691: \begin{eqnarray}\label{eq:secondbracket}

1692: \frac{2(1-\sqrt{z}) [1-{\e_1}(1-\alpha)]^k}{2^k}&\ge&\nonumber

1693: \frac{2(1-\sqrt{z})}{2^k}-\frac{2(1-\sqrt{z})k\e_1(1-\alpha)}{2^k}\\&\ge&

1694: \frac{2(1-\sqrt{z})}{2^k}-\frac{2(1-\sqrt{y})k\e_0}{2^k}\nonumber\\

1695: &\ge&

1696: \frac{2(1-\sqrt{z})}{2^k}-\frac{2(1-y)k}{2^k}\cdot\frac{2(1-y)}{2^{k}-k-1}\nonumber\\

1697: &\ge& \frac{2(1-\sqrt{z})}{2^k}-\frac{8k(1-y)^2}{2^{2k}},

1698: \end{eqnarray}

1699: where we have used the upper bound in (\ref{eq:e}).

1700:

1701: Combining (\ref{eq:estimate}), (\ref{eq:zpart}),

1702: (\ref{eq:theproduct}), (\ref{eq:firstbracket}) and

1703: (\ref{eq:secondbracket}) we get

1704:

1705:

1706: \begin{eqnarray}\label{eq:almost}

1707: &&\!\!\!\!\!\!\!\!\!\!\!\!\!\!\frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}\nonumber\\

1708: &\le&y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right]

1709: \left[1+\left(1+\frac{k}{2}\right)\e_0+\frac{k^2\e_0^2}{8}+k\e_0^2\right]\cdot\nonumber\\

1710: &\phantom{\le}&\left\{1-\frac{k\e_0}{2}+\frac{k^2\e_0^2}{8}+k\e_0^2-

1711: \frac{2(1-\sqrt{z})}{2^k}+\frac{8k(1-y)^2}{2^{2k}}+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}\right\}\nonumber\\

1712: &\le&

1713: y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right]\cdot\nonumber\\

1714: &\phantom{\le}&\left\{1+\e_0+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}-\frac{2(1-\sqrt{z})}{2^k}

1715: +\frac{8k(1-y)^2}{2^{2k}}+\frac{k(1-\sqrt{z})^2\e_0}{2^k}+\frac{8k^2(1-y)^2\e_0}{2^{2k}}+k^3\e_0^3\right\}\nonumber\\

1716: &\le&y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right]\left\{1+\frac{2(1-y)}{2^k-k-1}+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}-\frac{2(1-\sqrt{z})}{2^k}+

1717: \frac{30k(1-y)^2}{2^{2k}}\right\}\nonumber\\

1718: &\le&y^{y/2^k}\left\{1+\frac{2(1-y)}{2^k-k-1}-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+

1719: \frac{(1-\sqrt{z})^2\alpha^k}{2^k}-\frac{2(1-\sqrt{z})}{2^k}+

1720: \frac{50k(1-y)^2}{2^{2k}}\right\},

1721: \end{eqnarray}

1722: where we have used the upper bound in (\ref{eq:e}),

1723: (\ref{eq:defz}) and the fact that $k$ is large enough.

1724:

1725: Now, we claim that for every $\alpha\in [0,1]$,

1726: \begin{eqnarray}\label{eq:trick}

1727: (1-\sqrt{z})^2\alpha^k -2(1-\sqrt{z})\le (1-\sqrt{y})^2\alpha^k

1728: -2(1-\sqrt{y})+z-y.

1729: \end{eqnarray}

1730: Indeed, since by (\ref{eq:defz}), $z\le y$, the left-hand side

1731: minus the right-hand side of (\ref{eq:trick}) is an increasing

1732: function, which vanishes at $1$. Moreover, by (\ref{eq:defz}),

1733: $$

1734: z-y\le -\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k},

1735: $$

1736: so that (\ref{eq:trick}) becomes

1737: $$ (1-\sqrt{z})^2\alpha^k

1738: -2(1-\sqrt{z})\le (1-\sqrt{y})^2\alpha^k

1739: -2(1-\sqrt{y})-\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k}.

1740: $$

1741: Plugging this into (\ref{eq:almost}) we get

1742: \begin{eqnarray}\label{eq:almostalmost}

1743: &&\!\!\!\!\!\!\!\!\!\!\!\!\!\!\frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}\le

1744: y^{y/2^k}

1745: \left\{1+\frac{2(1-y)}{2^k-k-1}-\frac{2(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{(1-\sqrt{y})^2\alpha^k}{2^k}-

1746: \frac{2(1-\sqrt{y})}{2^k}+\frac{60k(1-y)^2}{2^{2k}}\right\}\nonumber\\

1747: &=& y^{y/2^k}

1748: \left\{1+\frac{2(1-y)+(1-\sqrt{y})^2\alpha^k-2(1-\sqrt{y})}{2^k}+\frac{60k(1-y)^2}{2^{2k}}\right\}.

1749: \end{eqnarray}

1750:

1751: This concludes the proof of Lemma \ref{lem:ratio}.\bigskip

1752: \end{proof}

1753:

1754: \mbox{}

1755:

1756:  Denote $h(\alpha)=-\alpha\log

1757: \alpha-(1-\alpha)\log(1-\alpha)$. Taking logarithms of

1758: (\ref{eq:maingoal}), and using (\ref{eq:final}) and the inequality

1759: $\log (1+x)\le x$, we see that our goal is reduced to showing that

1760: for all $r\le \ttk$,

1761: \begin{eqnarray}\label{eq:newgoal}

1762: \frac{r}{2^k}&&\!\!\!\!\!\!\!\!\!\!\!\left[y\log

1763: y+2(1-y)-2(1-\sqrt{y})+(1-\sqrt{y})^2\alpha^k+\frac{60k(1-y)^2}{2^{k}}\right]

1764: \le \log 2-h(\alpha) \enspace .

1765: \end{eqnarray}

1766:

1767: For simplicity denote:

1768: \begin{eqnarray}\label{eq:abc}

1769: \left\{\begin{array}{ll} A=(1-\sqrt{y})^2 \\

1770: B=y\log

1771: y+2(1-y)-2(1-\sqrt{y})+\frac{60k(1-y)^2}{2^{k}}\end{array}\right.

1772: \end{eqnarray}

1773: With this notation (\ref{eq:newgoal}) becomes

1774: $$

1775: \frac{r}{2^k}\le \frac{\log2-h(\alpha)}{A\alpha^k+B}\equiv

1776: M(\alpha) \enspace ,

1777: $$

1778: and this should hold for all $\alpha \ge 1-\frac{3\log k}{k}$. We

1779: are therefore interested in the minimal value of $M$ on the

1780: interval $\left[1-\frac{3\log k}{k},1\right]$. The derivative of

1781: $M$ is

1782: \begin{eqnarray}\label{eq:derM}

1783: M'(\alpha)=\frac{(A\alpha^k+B)\cdot[\log

1784: \alpha-\log(1-\alpha)]-kA\alpha^{k-1}[\log2-h(\alpha)]}{(A\alpha^k+B)^2}.

1785: \end{eqnarray}

1786: In particular, $M'(1)=\infty$, so that the minimum of $M$ cannot

1787: occur at $\alpha=1$. We rule out the possibility of the minimum

1788: being at $1-\frac{3\log k}{k}$ in the following claim.

1789: \begin{claim}\label{claim:notleft} If $k$ is large enough

1790: then for every $1-\frac{3\log k}{k}\le \alpha\le 2^{-1/k}$,

1791: $M\left(\alpha\right)>M(1)$.

1792: \end{claim}

1793: \begin{proof} Observe that for every $\beta\in [0,1]$,

1794: \begin{eqnarray}\label{eq:ineqh}

1795: h(\beta)&=&\beta\log(1/\beta)-(1-\beta)\log(1-\beta)\nonumber\\&\le&

1796: \beta\left(\frac{1}{\beta}-1\right)-(1-\beta)\log(1-\beta)=1-\beta-(1-\beta)\log(1-\beta)

1797: \enspace .

1798: \end{eqnarray}

1799: Hence, since $\alpha\ge 1-\frac{3\log k}{k}$,

1800: \begin{eqnarray}\label{eq:hleft}

1801: h(\alpha)\le h\left(1-\frac{3\log k}{k}\right)\le \frac{3\log

1802: k}{k}+\frac{3\log k}{k}\log\left(\frac{k}{3\log k}\right)\le

1803: \frac{4(\log k)^2}{k} \enspace .

1804: \end{eqnarray}

1805: Using the fact that $\alpha\le 2^{-1/k}$, it follows that

1806: \begin{eqnarray*}

1807: M\left(\alpha\right)\ge \frac{\log 2-\frac{5(\log

1808: k)^2}{k}}{A\left(2^{-1/k}\right)^k+B}\ge \frac{\log

1809: 2\left(1-\frac{10(\log k)^2}{k}\right)}{\frac{A}{2}+B} \enspace .

1810: \end{eqnarray*}

1811: On the other hand, $M(1)=\log 2/(A+B)$, so that it is enough to

1812: show that

1813: $$

1814: 1-\frac{10(\log

1815: k)^2}{k}\ge\frac{\frac{A}{2}+B}{A+B}=1-\frac{\frac{1}{2}A}{A+B}

1816: \enspace ,

1817: $$

1818: which is equivalent to

1819: \begin{eqnarray}\label{eq:stupidgoal}

1820: \frac{A}{A+B}\ge \frac{20(\log k)^2}{k} \enspace .

1821: \end{eqnarray}

1822: Observe that since $1-\sqrt{y}\ge (1-y)/2$, $A\ge (1-y)^2/4$. On

1823: the other hand, using (\ref{eq:stupid}) we get that for

1824: sufficiently large $k$,

1825: \begin{eqnarray*}

1826: A+B=1-y+y\log y+\frac{60k(1-y)^2}{2^{k}} \le

1827: (1-y)^2+\frac{60k(1-y)^2}{2^k}\le 2(1-y)^2 \enspace ,

1828: \end{eqnarray*}

1829: It follows that the left-hand side in (\ref{eq:stupidgoal}) is at

1830: least $1/8$, so that (\ref{eq:stupidgoal}) provided $k$ large

1831: enough.

1832: \end{proof}

1833:

1834: By Claim \ref{claim:notleft} it remains to bound $M(\alpha)$ from

1835: below when $\alpha>2^{-1/k}$ and $M'(\alpha)=0$. In this case, by

1836: (\ref{eq:derM}),

1837: \begin{eqnarray}\label{eq:condition}

1838: -\log(1-\alpha)= -\log\alpha+

1839: \frac{kA\alpha^{k-1}}{A\alpha^k+B}[\log 2-h(\alpha)] \enspace .

1840: \end{eqnarray}

1841: From the lower bound $\alpha\ge 2^{-1/k}$ and (\ref{eq:ineqh}) it

1842: follows that $h(\alpha)\le \frac{4\log k}{k}$. Hence

1843: (\ref{eq:condition}), together with our assumption that $k$ is

1844: large, implies

1845: \begin{eqnarray}\label{eq:alphaisbig}

1846: -\log(1-\alpha)>\frac{kA/2}{A+B}\left[\log2-\frac{4\log

1847: k}{k}\right]\ge \frac{k}{5}\cdot\frac{A}{A+B} \enspace .

1848: \end{eqnarray}

1849: As we have seen in the proof of Claim \ref{claim:notleft}, $A\ge

1850: (1-y)^2/4$ and $A+B\le 2(1-y)^2$. Plugging these inequalities into

1851: (\ref{eq:alphaisbig}), we get that $-\log(1-\alpha)>k/40$, i.e.

1852: $\alpha\ge 1-e^{-k/40}$. Plugging this into (\ref{eq:condition})

1853: once more, we get that

1854: \begin{eqnarray*}

1855: -\log(1-\alpha)\ge \frac{kA[1-2k/(e^{k/40})]}{A+B}[\log

1856: 2-2/(e^{k/40})]\ge \frac{kA\log

1857: 2}{A+B}\left(1-\frac{6}{e^{k/40}}\right) \enspace .

1858: \end{eqnarray*}

1859: Finally, we have shown that

1860: \begin{eqnarray}\label{eq:maximizer}\alpha\ge 1-\exp\left[-\frac{kA\log

1861: 2}{A+B}\left(1-\frac{6}{e^{k/40}}\right)\right] \enspace .

1862: \end{eqnarray}

1863:

1864: We are now ready to bound $M(\alpha)$ from below. We start by

1865: recalling that

1866: \begin{eqnarray*}

1867: A+B= 1-y+y\log y+\frac{60k(1-y)^2}{2^{k}} \equiv 1-y+y\log y+P

1868: \enspace .

1869: \end{eqnarray*}

1870: Using the inequality $1/(1+x)\ge 1-x$, we get

1871: \begin{eqnarray}\label{eq:A+C}

1872: \frac{1}{A+B}\ge \frac{1}{1-y+y\log y}\left[1-\frac{P}{1-y+y\log

1873: y}\right] \ge\frac{1}{1-y+y\log y}\left[1-\frac{120k}{2^k}\right],

1874: \end{eqnarray}

1875: where the last inequality used (\ref{eq:stupid}). Of course, we

1876: also know that $A+B\ge 1-y+y\log y$.

1877:

1878: Now, using (\ref{eq:ineqh}), (\ref{eq:maximizer}), (\ref{eq:A+C}),

1879: and the fact that $(1-\sqrt{y})^2/(1-y+y\log y)\le 1$, we get

1880: \begin{eqnarray*}

1881: h(\alpha)&\le& \frac{2kA\log

1882: 2}{A+B}\left(1-\frac{6}{e^{k/40}}\right)\exp\left[-\frac{kA\log

1883: 2}{A+B}\left(1-\frac{6}{e^{k/40}}\right)\right]\\

1884: &\le& 2k\log2\exp\left[-\frac{k(1-\sqrt{y})^2\log 2}{1-y+y\log

1885: y}\left(1-\frac{120k}{2^k}\right)\left(1-\frac{6}{e^{k/40}}\right)\right]\\

1886: &\le&2k\log2\exp\left[-\frac{k(1-\sqrt{y})^2\log 2}{1-y+y\log

1887: y}\left(1-\frac{120k}{2^k}-\frac{6}{e^{k/40}}\right)\right]\\

1888: &\le& 10k\log 2 \cdot 2^{-k\varphi(y)},

1889: \end{eqnarray*}

1890: where  as in Proposition \ref{prop:main},

1891: $\varphi(y)=\frac{(1-\sqrt{y})^2}{1-y+y\log y}$.

1892:

1893: \smallskip

1894: So, using (\ref{eq:A+C}), we get

1895: \begin{eqnarray*}

1896: M(\alpha)&=&\frac{\log2-h(\alpha)}{A\alpha^k+B}\\

1897: &\ge& \frac{\log 2}{A+B}(1-10k2^{-k\varphi(y)})\\

1898: &\ge&\frac{\log 2}{1-y+y\log

1899: y}\left[1-\frac{120k}{2^k}\right](1-10k2^{-k\varphi(y)})\\

1900: &\ge&\frac{\log 2}{1-y+y\log

1901: y}\left[1-\frac{120k}{2^k}-10k2^{-k\varphi(y)}\right]\\

1902: &\ge& \frac{\log 2}{1-y+y\log

1903: y}\left[1-20k2^{-k\varphi(y)}\right],

1904: \end{eqnarray*}

1905: where we have used the fact that $\varphi(y)\ge 1/2$ and that $k$

1906: is sufficiently large.

1907:

1908: \medskip

1909:

1910: This concludes the proof of Lemma \ref{lem:bigalpha}.

1911:

1912: \end{document}

1913:

1914:

1915:

1916:

1917:

1918:

1919: \subsection{Proof of Lemma \ref{lem:ratio}}.

1920:

1921:

1922: \begin{comment}

1923: To this end, we naturally require a lower bound for

1924: $f(1/2,\gamma_0,\eta_0)$ and an upper bound for

1925: $f(\alpha,\gamma_0,\sqrt{\eta_0})$. By (\ref{eq:f1/2}),

1926: \begin{eqnarray}\label{eq:lowerf1/2}

1927: \eta_0^{y/2^{k-1}}2^{2k}\gamma_0^{2k}f\left(\frac12,\gamma_0,\eta_0\right)&=&4(1-{\e_0})^2(2-{\e_0})^{2k-2}\nonumber\\&=&

1928: 2^{2k}(1-{\e_0})^2\left(1-\frac{{\e_0}}{2}\right)^{2k-2}\nonumber\\

1929: &\ge& 2^{2k}(1-2{\e_0})[1-(k-1){\e_0}]\nonumber\\&\ge&

1930: 2^{2k}[1-(k+1){\e_0}].

1931: \end{eqnarray}

1932: \end{comment}

1933:

1934:

1935: Denote $\e_1=1-(\sqrt{\gamma_0})^2=1-\sqrt{1-\e_0}$. For

1936: $1-\frac{4\log k}{k}< \alpha\le 1$ write $x=\alpha-1/2$.

1937: Analogously to (\ref{eq:f0}) we have we have the following

1938: identity

1939: \begin{eqnarray}\label{eq:identitye}

1940: &&\!\!\!\!\!\!\!\!\!\!\!\!\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\frac12+x,\sqrt{\gamma_0},\sqrt{\eta_0}\right)=

1941: \nonumber\\

1942: &=&

1943: [2x{\e_1}^2+(2-{\e_1})^2]^k-2(1-\sqrt{\eta_0})[(2-{\e_1})+2x{\e_1}]^k+(1-\sqrt{\eta_0})^2(1+2x)^k.

1944: \end{eqnarray}

1945: Our first goal is replace $\eta_0$ in the right-hand side of

1946: (\ref{eq:identitye}) by its upper bound from (\ref{eq:eta}). To

1947: this end consider the function

1948: \begin{eqnarray}\label{eq:rho}

1949: \rho(b)=[2x{\e_1}^2+(2-{\e_1})^2]^k-2b[(2-{\e_1})+2x{\e_1}]^k+b^2(1+2x)^k,

1950: \end{eqnarray}

1951: and observe the right-hand side of (\ref{eq:identitye}) equals

1952: $\rho\left(1-\sqrt{\eta_0}\right)$. So, that it is enough to show

1953: that $\rho$ is decreasing on $[0,1]$. Since $\rho$ is convex and

1954: quadratic, this would follow once we show that $\rho'(1)\le 0$.

1955: This is equivalent to $1+2x\le 2-{\e_1}+2x{\e_1}$, which is true

1956: since $x\le 1/2$.

1957:

1958: \begin{comment}

1959: Now,

1960: $$

1961: \eta_0^{y/2^k}2^{2k}\gamma_0^{2k}f\left(\frac12+x,\gamma_0,\sqrt{\eta_0}\right)=\rho\left(1-\sqrt{\eta_0}\right).

1962: $$

1963: By Fact \ref{fact:etae}, as long as $k\ge 30$, $1-\sqrt{\eta_0}\ge

1964: 1-\sqrt{y}$, and since $\rho$ is decreasing on $[0,1]$, we can

1965: bound

1966: $\eta_0^{y/2^k}2^{2k}\gamma_0^{2k}f\left(\alpha,\gamma_0,\sqrt{\eta_0}\right)$

1967: from above as follows

1968: \end{comment}

1969: \begin{eqnarray}\label{eq:upperf}

1970: &&\!\!\!\!\!\!\!\!\!\!\!\!\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)\le

1971: \nonumber\\

1972: &\le&

1973: [2x{\e_1}^2+(2-{\e_1})^2]^k-2(1-\sqrt{z})[(2-{\e_1})+2x{\e_1}]^k+(1-\sqrt{z})^2(1+2x)^k,

1974: \end{eqnarray}

1975: Where $z$ is the upper bound for $\eta_0$ from (\ref{eq:eta}),

1976: i.e.,

1977: \begin{eqnarray}\label{eq:defz}

1978: z=

1979: \min\left\{y,y-\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k}\right\}.

1980: \end{eqnarray}

1981:

1982:  Hence using $\eta_0\le z$ and the identity (\ref{eq:f1/2})

1983: we bound the ratio in (\ref{eq:maingoal}) as follows

1984: \begin{eqnarray}\label{eq:estimate}

1985: \frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}&=&\nonumber\\

1986: &=&\eta_0^{y/2^k}\cdot\gamma_0^k\cdot

1987: \frac{\eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)}

1988: {\eta_0^{y/2^{k-1}}2^{2k}\gamma_0^{2k}f\left(1/2,\gamma_0,\eta_0\right)}\nonumber\\

1989: &=&\frac{\eta_0^{y/2^k}(1-\e_0)^{k/2}}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}\cdot

1990: \eta_0^{y/2^k}2^{2k}\gamma_0^{k}f\left(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0}\right)\nonumber\\

1991: &\le& \frac{z^{y/2^k}(1-\e_0)^{k/2}}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}\cdot\nonumber\\

1992: &\phantom{\le}&

1993: \left\{[2x{\e_1}^2+(2-{\e_1})^2]^k-2(1-\sqrt{z})[(2-{\e_1})+2x{\e_1}]^k+(1-\sqrt{z})^2(1+2x)^k\right\}\nonumber\\

1994: &=& z^{y/2^k}\left[1+\frac{{\e_0}}{2

1995: (1-{\e_0})}\right]^2\left[\frac{\sqrt{1-\e_0}}{(1-\e_0/2)^2}\right]^k\cdot\nonumber\\

1996: &\phantom{\le}&

1997: \left\{\left[\frac{x\e_1^2}{4}+\left(1-\frac{\e_1}{2}\right)^2\right]^k-\frac{2(1-\sqrt{z})

1998: [1-{\e_1}(1-\alpha)]^k}{2^k}+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}\right\}.

1999: \end{eqnarray}

2000:

2001: We will bound the various terms in (\ref{eq:estimate}) separately.

2002: First of all, using (\ref{eq:defz}) and the inequality $e^a\le

2003: 1+a+a^2$, which is valid foe $0\le a\le 1$, we get

2004: \begin{eqnarray}\label{eq:zpart}

2005: z^{y/2^k}&\le&

2006: y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{y(2^k-k-1)}+\frac{4k(1-y)^2}{y2^k}\right]^{y/2^k}\nonumber\\

2007: &\le&

2008: y^{y/2^k}\exp\left[-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{4k(1-y)^2}{2^{2k}}\right]\nonumber\\

2009: &\le&y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right],

2010: \end{eqnarray}

2011: where we have also used the fact that $k\ge 20000$.

2012:

2013: Next, using the inequality $1/(1-a)\le 1+2a$, valid for $0\le a\le

2014: 1/2$ we get

2015: \begin{eqnarray}\label{eq:thesquare}

2016: \left[1+\frac{{\e_0}}{2

2017: (1-{\e_0})}\right]^2\le\left[1+\frac{{\e_0}}{2}(1+2{\e_0})\right]^2\le

2018: 1+\e_0+5\e_0^2.

2019: \end{eqnarray}

2020: Next, using the inequality $\sqrt{1-x}\le 1-x/2$, and the

2021: inequality $1/(1-a)\le 1+a+2a^2$, valid for $0\le a\le 1/2$, we

2022: get that since $\e_0\le 1/k$,

2023: \begin{eqnarray}\label{eq:theroot}

2024: \left[\frac{\sqrt{1-\e_0}}{(1-\e_0/2)^2}\right]^k\le

2025: \frac{1}{(1-\e_0/2)^k}\le \frac{1}{1-k\e_0/2}\le

2026: 1+\frac{k\e_0}{2}+\frac{k^2\e_0^2}{2}.

2027: \end{eqnarray}

2028: Hence, since $k\ge 20000$:

2029: \begin{eqnarray}\label{eq:theproduct}

2030: \left[1+\frac{{\e_0}}{2

2031: (1-{\e_0})}\right]^2\left[\frac{\sqrt{1-\e_0}}{(1-\e_0/2)^2}\right]^k\le

2032: 1+(1+k/2)\e_0+2\e_0^2.

2033: \end{eqnarray}

2034:

2035: Next, using the inequality $x/2\le 1-\sqrt{1-x}\le x/2+x^2$, which

2036: is valid for $0\le x\le 1/2$, we get that:

2037: \begin{eqnarray}\label{eq:ineqe1}

2038: \frac{\e_0}{2}\le\e_1=1-\sqrt{1-\e_0}\le \frac{\e_0}{2}+\e_0^2\le

2039: \e_0.

2040: \end{eqnarray}

2041: Observe that since $0\le x\le 1/2$ and $\e_1<1/2$, the function

2042: $\e_1\mapsto \frac{x\e_1^2}{4}+\left(1-\frac{\e_1}{2}\right)^2$ is

2043: decreasing in $\e_1$. Hence, the lower bound in (\ref{eq:ineqe1}),

2044: together with the inequality $(1+a)^k\le 1+ka+k^2a^2/2$, which is

2045: valid for all $a\le 1/(4k^2)$, implies that since $k\ge 20000$:

2046: \begin{eqnarray}\label{eq:firstbracket}

2047: \left[\frac{x\e_1^2}{4}+\left(1-\frac{\e_1}{2}\right)^2\right]^k&\le&

2048: \left[\frac{\e_0^2}{32}+\left(1-\frac{\e_0}{4}\right)^2\right]^k\le

2049: \left[1-\frac{\e_0}{2}+\frac{\e_0^2}{10}\right]^k\le

2050: 1-\frac{k\e_0}{2}+\frac{k^2\e_0^2}{4}.

2051: \end{eqnarray}

2052:

2053: The second term in the brackets of (\ref{eq:estimate}) appears

2054: with a minus sign, so we bound it from below, using the fact that

2055: $z\le y$ and $\e_1\le\e_0$.

2056: \begin{eqnarray}\label{eq:secondbracket}

2057: \frac{2(1-\sqrt{z}) [1-{\e_1}(1-\alpha)]^k}{2^k}&\ge&\nonumber

2058: \frac{2(1-\sqrt{z})}{2^k}-\frac{2(1-\sqrt{z})k\e_1(1-\alpha)}{2^k}\\&\ge&

2059: \frac{2(1-\sqrt{z})}{2^k}-\frac{2(1-\sqrt{y})k\e_0}{2^k}\nonumber\\

2060: &\ge&

2061: \frac{2(1-\sqrt{z})}{2^k}-\frac{2(1-y)k}{2^k}\cdot\frac{2(1-y)}{2^{k}-k-1}\nonumber\\

2062: &\ge& \frac{2(1-\sqrt{z})}{2^k}-\frac{8k(1-y)^2}{2^{2k}},

2063: \end{eqnarray}

2064: where we have used the upper bound in (\ref{eq:e}).

2065:

2066: Combining (\ref{eq:estimate}), (\ref{eq:zpart}),

2067: (\ref{eq:theproduct}), (\ref{eq:firstbracket}) and

2068: (\ref{eq:secondbracket}) we get that:

2069:

2070:

2071: \begin{eqnarray}\label{eq:almost}

2072: &&\!\!\!\!\!\!\!\!\!\!\!\!\!\!\frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}\le\nonumber\\

2073: &\le&y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right]

2074: \left[1+\left(1+\frac{k}{2}\right)\e_0+2\e_0^2\right]\cdot\nonumber\\

2075: &\phantom{\le}&\left\{1-\frac{k\e_0}{2}+\frac{k^2\e_0^2}{4}-

2076: \frac{2(1-\sqrt{z})}{2^k}+\frac{8k(1-y)^2}{2^{2k}}+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}\right\}\nonumber\\

2077: &\le&

2078: y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right]\cdot\nonumber\\

2079: &\phantom{\le}&\left\{1+\e_0+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}-\frac{2(1-\sqrt{z})}{2^k}

2080: +\frac{8k(1-y)^2}{2^{2k}}+\frac{k(1-\sqrt{z})^2\e_0}{2^k}+\nonumber\right.\\

2081: &\phantom{\le}&\left.\frac{8k^2(1-y)^2\e_0}{2^{2k}}+k^3\e_0^3\right\}\nonumber\\

2082: &\le&y^{y/2^k}\left[1-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{8k(1-y)^2}{2^{2k}}\right]\cdot\nonumber\\

2083: &\phantom{\le}&\left\{1+\frac{2(1-y)}{2^k-k-1}+\frac{(1-\sqrt{z})^2\alpha^k}{2^k}-\frac{2(1-\sqrt{z})}{2^k}+

2084: \frac{30k(1-y)^2}{2^{2k}}\right\}\nonumber\\

2085: &\le&y^{y/2^k}\left\{1+\frac{2(1-y)}{2^k-k-1}-\frac{(k+1)(1-y)}{2^k(2^k-k-1)}+

2086: \frac{(1-\sqrt{z})^2\alpha^k}{2^k}-\nonumber\right.\\&\phantom{\le}&\left.\frac{2(1-\sqrt{z})}{2^k}+

2087: \frac{50k(1-y)^2}{2^{2k}}\right\},

2088: \end{eqnarray}

2089: where we have used the upper bound in (\ref{eq:e}),

2090: (\ref{eq:defz}) and the fact that $k\ge 20000$.

2091:

2092: Now, we claim that for every $\alpha\in [0,1]$,

2093: \begin{eqnarray}\label{eq:trick}

2094: (1-\sqrt{z})^2\alpha^k -2(1-\sqrt{z})\le (1-\sqrt{y})^2\alpha^k

2095: -2(1-\sqrt{y})+z-y.

2096: \end{eqnarray}

2097: Indeed, since by (\ref{eq:defz}), $z\le y$, the left-hand side

2098: minus the right-hand side of (\ref{eq:trick}) is an increasing

2099: function, which vanishes at $1$. Moreover, by (\ref{eq:defz}),

2100: $$

2101: z-y\le -\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k},

2102: $$

2103: so that (\ref{eq:trick}) becomes:

2104: $$

2105: (1-\sqrt{z})^2\alpha^k -2(1-\sqrt{z})\le (1-\sqrt{y})^2\alpha^k

2106: -2(1-\sqrt{y})-\frac{(k+1)(1-y)}{2^k-k-1}+\frac{4k(1-y)^2}{2^k}.

2107: $$

2108: Plugging this into (\ref{eq:almost}) we get:

2109: \begin{eqnarray}\label{eq:almostalmost}

2110: &&\!\!\!\!\!\!\!\!\!\!\!\!\!\!\frac{f(\alpha,\sqrt{\gamma_0},\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}\le\nonumber\\

2111: &\le& y^{y/2^k}

2112: \left\{1+\frac{2(1-y)}{2^k-k-1}-\frac{2(k+1)(1-y)}{2^k(2^k-k-1)}+\frac{(1-\sqrt{y})^2\alpha^k}{2^k}-\right.\nonumber\\

2113: &\phantom{\le}&\left.

2114: \frac{2(1-\sqrt{y})}{2^k}+\frac{60k(1-y)^2}{2^{2k}}\right\}\nonumber\\

2115: &=& y^{y/2^k}

2116: \left\{1+\frac{2(1-y)+(1-\sqrt{y})^2\alpha^k-2(1-\sqrt{y})}{2^k}+\frac{60k(1-y)^2}{2^{2k}}\right\}.

2117: \end{eqnarray}

2118:

2119: This concludes the proof of Lemma \ref{lem:ratio}.

2120:

2121:

2122:

2123:

2124:

2125:

2126:

2127:

2128:

2129:

2130:

2131:

2132:

2133:

2134:

2135:

2136:

2137:

2138:

2139:

2140:

2141:

2142:

2143:

2144: \pagebreak We will bound the various terms in (\ref{eq:estimate})

2145: separately. To bound the first term we use the inequality

2146: $1/(1-{\e_0})\le 1+2{\e_0}$ and the inequality $(1+a)^k\le

2147: 1+ka+k^2a^2$, which is valid for all $a\le 1/k$. Since for $k\ge

2148: 32$, ${\e_0}\le 1/k$, we get that:

2149: \begin{eqnarray}\label{eq:first}

2150: \left[1+\frac{{\e_0}}{2

2151: (1-{\e_0})}\right]^2\cdot\left[1+\frac{2x{\e_0}^2}{(2-{\e_0})^2}\right]^k&\le&

2152: \left[1+\frac{{\e_0}}{2}(1+2{\e_0})\right]^2\cdot(1+{\e_0}^2)^k\nonumber\\

2153: &\le& \left(1+{\e_0}+5{\e_0}^2\right)(1+k{\e_0}^2+k^2{\e_0}^4)\nonumber\\

2154: &=&

2155: 1+{\e_0}+2k{\e_0}^2+k{\e_0}^3+5k{\e_0}^4+k^2{\e_0}^4+k^2{\e_0}^5+5k^2{\e_0}^6\nonumber\\

2156: &\le& 1+\frac{2(1-y)}{2^k}+\frac{50k(1-y)}{2^{2k}},

2157: \end{eqnarray}

2158: where we have used the upper bound in (\ref{eq:e}) and the fact

2159: that $k\ge 20000$.

2160:

2161: The second term in (\ref{eq:estimate}) appears with a minus sign,

2162: so we must bound it from below.

2163: \begin{eqnarray}\label{eq:second}

2164: \frac{2(1-\sqrt{y})[(2-{\e_0})+2x{\e_0}]^k}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}&=&\frac{1}{2^{2k}}\cdot

2165: \frac{2(1-\sqrt{y})}{(1-{\e_0})^2(1-{\e_0}/2)^{2k-2}}\cdot

2166: 2^k[1-{\e_0}(1-\alpha)]^k\nonumber\\

2167: &\ge&\frac{2(1-\sqrt{y})}{2^k}[1-k{\e_0}(1-\alpha)]\nonumber\\

2168: &\ge& \frac{2(1-\sqrt{y})}{2^k}-\frac{8k(1-y)}{2^{2k}},

2169: \end{eqnarray}

2170: where we have used once more the estimate (\ref{eq:e}).

2171:

2172: Finally, we estimate the last term in (\ref{eq:estimate}):

2173:

2174: \begin{eqnarray}\label{eq:third}

2175: \frac{(1-\sqrt{y})^2(1+2x)^k}{4(1-{\e_0})^2(2-{\e_0})^{2k-2}}&=&

2176: \frac{(1-\sqrt{y})^22^k\alpha^k}{2^{2k}(1-{\e_0})^2(1-{\e_0}/2)^{2k-2}}\nonumber\\

2177: &\le&

2178: \frac{(1-\sqrt{y})^2\alpha^k}{2^{k}}\cdot\frac{1}{(1-2{\e_0})[1-(k-1){\e_0}]}\nonumber\\

2179: &\le&

2180: \frac{(1-\sqrt{y})^2\alpha^k}{2^{k}}\cdot(1+4{\e_0})[1+2(k-1){\e_0}]\nonumber\\

2181: &\le& \frac{(1-\sqrt{y})^2\alpha^k}{2^{k}}+\frac{20

2182: k(1-y)}{2^{2k}}.

2183: \end{eqnarray}

2184: We have used here the estimate (\ref{eq:e}) one last time.

2185:

2186: Plugging (\ref{eq:first}), (\ref{eq:second}), (\ref{eq:third})

2187: into (\ref{eq:estimate}) we get that:

2188: \begin{eqnarray*}

2189: \frac{f(\alpha,\gamma_0,\sqrt{\eta_0})}{f(1/2,\gamma_0,\eta_0)}&=&

2190: y^{y/2^k}\left[1+\frac{2(1-y)-2(1-\sqrt{y})+(1-\sqrt{y})^2\alpha^k}{2^k}+\frac{80k(1-y)}{2^{2k}}\right],

2191: \end{eqnarray*}

2192: as required.

2193:

2194:

2195:

2196:

2197:

2198:

2199:

2200:

2201:

2202:

2203:

2204:

2205:

2206:

2207:

2208:

2209:

2210: \pagebreak

2211:

2212:

2213: We will require the following fact:

2214:

2215: \begin{claim}\label{claim:monotone} The function:

2216: \begin{eqnarray}\label{eq:chi}

2217: \chi(\alpha)=\frac{kA\alpha^{k-1}-B}{A\alpha^k+B(1-\alpha)+C}

2218: \end{eqnarray}

2219: is increasing on $\left[1-\frac{4\log k}{k},1\right]$.

2220: \end{claim}

2221:

2222:

2223: We postpone the proof of Fact \ref{claim:monotone} to the end of

2224: this section, and proceed to conclude the proof of Lemma

2225: \ref{lem:bigalpha}.

2226:

2227: \medskip

2228:

2229: By (\ref{eq:condition}), Claim \ref{claim:monotone} and

2230: (\ref{eq:hleft}), we get that for $\alpha\ge 1-\frac{4\log k}{k}$,

2231: \begin{eqnarray*}

2232: -\log(1-\alpha)&=&-\log \alpha +\chi(\alpha)[\log 2-h(\alpha)]\\

2233: &\le& -\log \alpha +\chi(1)[\log 2-h(\alpha)]\\ &=&

2234:  -\log\alpha+

2235: \frac{kA-B}{A+C}[\log 2-h(\alpha)]\\

2236: &\le& -\log \alpha +\frac{kA}{A+C}[\log 2-h(\alpha)].

2237: \end{eqnarray*}

2238: Since $C\ge y\log y+2(1-y)-2(1-\sqrt{y})$, we see that:

2239: $$

2240: \frac{A}{A+C}\le \frac{(1-\sqrt{y})^2}{1-y+y\log y}\le 1.

2241: $$

2242:

2243:

2244:

2245: \begin{comment}

2246: -\log\left(1-\frac{4\log k}{k}\right)+\frac{kA}{A+C}\left[\log

2247: 2-h\left(1-\frac{4\log k}{k}\right)\right].

2248: \end{eqnarray*}

2249: \end{comment}

2250:

2251:

2252:

2253:

2254:

2255:

2256: \pagebreak

2257:

2258: Our goal is to find the maximal $r$ such that for all $\alpha\in

2259: [0,1]$, $E(\alpha)\ge E(1/2)$. We assume that $z\le 1-\frac{3\log

2260: k}{k}$ and $r\le \frac{2^k\log 2}{1-y+y\log y}$, in which case

2261: Lemma \ref{lem:decreasing} tells us that we must show that for

2262: $\alpha\in [z,1]$,

2263: $$

2264: \left[\frac{f(\alpha,\gamma_1,\eta_1)}{f(1/2,\gamma_0,\eta_0)}\right]^r\le

2265: 2\alpha^\alpha(1-\alpha)^{1-\alpha}.

2266: $$

2267: Denote ${\e_0}_1=1-\gamma_1^2$. Then:

2268: \begin{eqnarray}\label{eq:f1}

2269: \eta_1^u2^{2k}\gamma_1^{2k}&&\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!f\left(\frac12+x,\gamma_1,\eta_1\right)=\nonumber\\

2270: &=&

2271: [2x{\e_0}_1^2+(2-{\e_0}_1)^2]^k-2(1-\eta_1)[(2-{\e_0}_1)+2x{\e_0}_1]^k+(1-\eta_1)^2(1+2x)^k

2272: \begin{comment}

2273: &=& \sum_{j=0}^k

2274: \binom{k}{j}(2x)^j\big[{\e_0}_1^{2j}(2-{\e_0}_1)^{2(k-j)}-2{\e_0}_1^{j}(2-{\e_0})^{k-j}(1-\eta_1)+(1-\eta_1)^2\big]\nonumber\\

2275: &=& \sum_{j=0}^k

2276: \binom{k}{j}(2x)^j\big[{\e_0}_1^j(2-{\e_0}_1)^{k-j}-(1-\eta_1)\big]^2.

2277: \end{comment}

2278: \end{eqnarray}

2279:

2280: Write ${\e_0}_1=\frac{w}{2^k}$, $w\in [0,1]$. For $1\ge

2281: \alpha>1-\frac{3\log k}{k}$ set $\alpha=1/2+x$, in which case:

2282: \begin{eqnarray*}

2283: &&\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\eta_1^u2^{2k}\gamma_1^{2k}f\left(\frac12+x,\gamma_1,\eta_1\right)=\\

2284: &\le&

2285: \left[2^{2k}-2^kkw+\frac{k(k-1)}{2}w^2+kw^2\alpha^2+O\left(\frac{k^3}{2^k}\right)\right]-2(1-\eta_1)\left[2^k-kw(1-\alpha)\right] +(1-\eta_1)^22^k\alpha^k\\

2286: &\le&2^{2k}-2^k[kw+2(1-\eta_1)-(1-\eta_1)^2\alpha^k]+\frac{k^2w^2}{2}+\frac{kw^2}{2}+O(\log

2287: k).

2288: \end{eqnarray*}

2289: Similarly, since ${\e_0}$ is approximately $\frac{1-y}{2^{k-1}}$,

2290: \begin{eqnarray*}

2291: &&\!\!\!\!\!\!\!\!\!\!\!\!\eta_0^u2^{2k}\gamma_0^{2k}f\left(\frac12,\gamma_0,\eta_0\right)=4(1-{\e_0})^2(2-{\e_0})^{2k-2}\\&=&

2292: 2^{2k}-2^{k+1}(k+1)(1-y)+2k^2(1-y)^2+3k(1-y)^2+O\left(1\right).

2293: \end{eqnarray*}

2294: Using the fact that $\frac{1}{1-v}=1+v+v^2+O(v^3)$, it follows

2295: that:

2296: \begin{eqnarray}\label{eq:fpart}

2297: &&\!\!\!\!\!\!\!\!\!\!\!\!\!\frac{\eta_1^u2^{2k}\gamma_1^{2k}

2298: f\left(\alpha,\gamma_1,\eta_1\right)}{\eta_0^u2^{2k}\gamma_0^{2k}f

2299: \left(1/2,\gamma_0,\eta_0\right)}=\nonumber\\

2300: &=&\left[1-\frac{kw+2(1-\eta_1)-

2301: (1-\eta_1)^2\alpha^k}{2^k}+\frac{k^2}{2^{2k+1}}w^2+\frac{k}{2^{2k+1}}w^2+O\left(\frac{\log

2302: k}{2^{2k}}\right)\right]

2303: \cdot\nonumber\\

2304: &\phantom{\le}&\left[1+\frac{2(k+1)(1-y)}{2^k}-\frac{2k^2(1-y)^2}{2^{2k}}-\frac{3k(1-y)^2}{2^{2k}}+

2305: \frac{4k^2(1-y)^2}{2^{2k}}+\frac{8k(1-y)^2}{2^{2k}}+

2306: O\left(\frac{1}{2^{2k}}\right)\right]\nonumber\\

2307: &=&

2308: 1-\frac{k(w-2(1-y))+2(y-\eta_1)-(1-\eta_1)^2\alpha^k}{2^k}+\frac{k^2[w-2(1-y)]^2}{2^{2k+1}}+\nonumber\\

2309: &\phantom{\le}&

2310: \frac{k[2w^2-2w(1-y)+5(1-y)^2-4(1-y)(1-\eta_1)+2(1-y)(1-\eta_1)^2\alpha^k]}{2^{2k}}+

2311:  O\left(\frac{\log k}{2^{2k}}\right).

2312: \end{eqnarray}

2313: Moreover:

2314: \begin{eqnarray}\label{eq:gammapart}

2315: \!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!\!&&\left(\frac{\gamma_0}{\gamma_1}\right)^{2k}=\nonumber\\&=&

2316: \left[\frac{1-\frac{1-y}{2^{k-1}}}{1-\frac{w}{2^k}}\right]^k\nonumber\\

2317: &=& 1+\frac{k[w-2(1-y)]}{2^k}+\frac{k^2[w-2(1-y)]^2}{2^{2k+1}}+\nonumber\\

2318: &\phantom{\le}& \frac{k[2w^2-2w(1-y)-(w-2(1-y))^2]}{2^{2k+1}}+

2319: O\left(\frac{1}{2^{2k}}\right).

2320: \end{eqnarray}

2321:

2322: Multiplying (\ref{eq:fpart}) and (\ref{eq:gammapart}) we get a lot

2323: of cancellation:

2324: \begin{eqnarray}\label{eq:cancellation}

2325: &&\!\!\!\!\!\!\!\!\!\!\!\!\!\frac{\eta_1^u2^{2k}\gamma_1^{2k}

2326: f\left(\alpha,\gamma_1,\eta_1\right)}{\eta_0^u2^{2k}\gamma_0^{2k}f

2327: \left(1/2,\gamma_0,\eta_0\right)}\cdot\left(\frac{\gamma_0}{\gamma_1}\right)^{2k}=\nonumber

2328: \\&=&1-1-\frac{2(y-\eta_1)-(1-\eta_1)^2\alpha^k}{2^k}

2329: \end{eqnarray}

2330:

2331:

2332: Hence, using the fact that $\eta_0\le y$,

2333: \begin{eqnarray*}

2334: \frac{f(\alpha,\gamma_1,\eta_1)}{f(1/2,\gamma_0,\eta_0)}&=&\\

2335: &=&

2336: \left(\frac{\eta_0}{\eta_1}\right)^u\left(\frac{\gamma_0}{\gamma_1}\right)^{2k}

2337: \frac{\eta_1^u2^{2k}\gamma_1^{2k}

2338: f\left(\alpha,\gamma_1,\eta_1\right)}{\eta_0^u2^{2k}\gamma_0^{2k}f

2339: \left(1/2,\gamma_0,\eta_0\right)}\\

2340: &\le& \left(\frac{y}{\eta_1}\right)^{y/2^{k-1}}

2341: \left[1+\frac{k[w-2(1-y)]}{2^k}+\frac{k^2[w-2(1-y)]^2}{2^{2k+1}}+O\left(\frac{k}{2^{2k}}\right)\right]\cdot\\

2342: &\phantom{\le}&

2343: \left[1-\frac{k(w-2(1-y))+2(y-\eta_1)-(1-\eta_1)^2\alpha^k}{2^k}+\frac{k^2[w-2(1-y)]^2}{2^{2k+1}}+

2344: O\left(\frac{k}{2^{2k}}\right)\right]

2345: \end{eqnarray*}

2346: Hence, assuming that $w=2(1-y)$, we get that:

2347: \begin{eqnarray*}

2348: \frac{f(\alpha,\gamma_1,\eta_1)}{f(1/2,\gamma_0,\eta_0)}&\le&\\

2349: &\le&

2350: \left(\frac{y}{\eta_1}\right)^{y/2^{k-1}}\left[1-\frac{2(y-\eta_1)-(1-\eta_1)^2\alpha^k}{2^k}+

2351: O\left(\frac{k^2}{2^{2k}}\right)\right].

2352: \end{eqnarray*}

2353:

2354: Now, we must show that for $\alpha \ge 1-\frac{3\log k}{k}$,

2355: $$

2356: r\log

2357: \left[\frac{f(\alpha,\gamma_1,\eta_1)}{f(1/2,\gamma_0,\eta_0)}\right]\le

2358: \log2+\alpha\log\alpha+(1-\alpha)\log(1-\alpha).

2359: $$

2360: Observe that for $1\ge \alpha \ge 1-\frac{3\log k}{k}$,

2361: $$

2362: \log2+\alpha\log\alpha+(1-\alpha)\log(1-\alpha)\ge \log

2363: 2\left[1-O\left(\frac{(\log k)^2}{k}\right)\right].

2364: $$

2365: By the above estimate, writing $r=2^kd$ we get that:

2366: \begin{eqnarray*}

2367: r\log

2368: \left[\frac{f(\alpha,\gamma_1,\eta_1)}{f(1/2,\gamma_0,\eta_0)}\right]&\le&\\

2369: &\le&

2370: d\left[2y\log\left(\frac{y}{\eta_1}\right)+(1-\eta_1)^2\alpha^k-2(y-\eta_1)+O\left(\frac{k^2}{2^{k}}\right)\right]\\

2371: &\le&d\left[2y\log\left(\frac{y}{\eta_1}\right)+(1-\eta_1)^2-2(y-\eta_1)+O\left(\frac{k^2}{2^{k}}\right)\right].

2372: \end{eqnarray*}

2373: Minimizing the function in the brackets we see that we should

2374: choose $\eta_1=\sqrt{y}$, which gives that we should find the

2375: largest $d$ such that

2376: $$

2377: d\left[1-y+y\log y+O\left(\frac{k^2}{2^{k}}\right)\right]\le \log

2378: 2\left[1-O\left(\frac{(\log k)^2}{k}\right)\right].

2379: $$

2380:

2381: Which gives:

2382: $$

2383: d=\frac{\log 2}{1-y+y\log y}-O\left(\frac{(\log k)^2}{k}\right).

2384: $$

2385:

2386: \medskip

2387:

2388: We now show how to improve the estimates on $d$. Set

2389: $h(\alpha)=-\alpha\log \alpha-(1-\alpha)\log(1-\alpha)$. Fix

2390: $\eta_1$. We know that there exists a constant $C>0$ such that we

2391: can take:

2392: $$

2393: d\le \min\left\{ \frac{\log

2394: 2-h(\alpha)}{2y\log\left(\frac{y}{\eta_1}\right)+(1-\eta_1)^2\alpha^k-2(y-\eta_1)+\frac{Ck^2}{2^{k}}};\

2395: \frac{2}{3}\le \alpha\le 1\right\}.

2396: $$

2397: For the sake of simplicity, write $A=(1-\eta_1)^2$ and

2398: $B=2y\log\left(\frac{y}{\eta_1}\right)-2(y-\eta_1)+\frac{Ck^2}{2^{k}}$.

2399: Then, the function we wish to maximize is:

2400: $$

2401: \psi(\alpha)\equiv \frac{\log 2-h(\alpha)}{A\alpha^k+B}.

2402: $$

2403: Now,

2404: $$

2405: \psi'(\alpha)=\frac{(A\alpha^k+B)[\log

2406: \alpha-\log(1-\alpha)]-kA\alpha^{k-1}[\log

2407: 2-h(\alpha)]}{(A\alpha^k+B)^2}.

2408: $$

2409: Since $\psi'(1)=\infty$, the minimum cannot occur at $\alpha=1$.

2410: We therefore distinguish between two cases:

2411:

2412: \medskip

2413:

2414: \noindent{\bf Case 1:} The minimum of $\psi$ is at $\alpha=2/3$.

2415: In this case take $\eta_1=y$, and the condition on $d$ becomes $

2416: d\cdot O\left[\left(2/3\right)^k\right]\le \log2-h(2/3)$, i.e.

2417: $d=O\left[\left(3/2\right)^k\right]$.

2418:

2419: \medskip

2420:

2421: \noindent{\bf Case 2:} The minimum of $\psi$ is attained at

2422: $\alpha_0\in(2/3,1)$. We restrict ourselves to $\eta_1\ge y$, in

2423: which case it is straightforward to verify that $B>0$. Now,

2424: solving $\psi(\alpha_0)=0$ we get:

2425: $$

2426: -\log2-\log(1-\alpha_0)\le\log\alpha_0-\log(1-\alpha_0)=

2427: \frac{kA\alpha_0^{k-1}}{A\alpha_0^k+B}[\log 2-h(\alpha_0)]\le

2428: k\frac{\log 2-h(\alpha_0)}{\alpha_0}\le k\frac{3\log2}{2}.

2429: $$

2430: Hence $\log(1-\alpha_0)\le -4k$, i.e. $\alpha_0\ge 1-e^{-4k}$.

2431:

2432:

2433:

2434:

2435:

2436:

2437:

2438:

2439:

2440: \begin{comment}

2441: \pagebreak

2442:

2443:

2444:

2445: \bigskip

2446: Now,

2447: $$

2448: r\log \left[\frac{f(\alpha)}{f(1/2)}\right]\le

2449: r\left[\frac{f(\alpha)-f(1/2)}{f(1/2)}\right].

2450: $$

2451: We will need an estimate for $f(1/2)$:

2452: \begin{eqnarray}\label{eq:half}

2453: f\left(\frac12\right)&=&4(1-{\e_0})^2(2-{\e_0})^{2k-2}\ge \nonumber\\

2454: &\ge & 4(1-2{\e_0})2^{2k-2}[1-(k-1){\e_0}]\nonumber\\

2455: &=&

2456: 2^{2k}\left(1-\frac{1-y}{2^{k-1}}\right)\left(1-\frac{(k-1)(1-y)}{2^{k-1}}\right)\nonumber\\

2457: &=& 2^{2k}-2^{k+1}k(1-y)+4(k-1)(1-y)^2.

2458: \end{eqnarray}

2459:

2460: Now we put $\alpha=\frac{1}{2}+x$ and follow the same reasoning as

2461: in the original paper. Using the identity (\ref{eq:binom}), in

2462: which the term corresponding to $j=1$ vanishes and for $j\ge 1$,

2463: ${\e_0}^j(2-{\e_0})^{k-j}\le {\e_0}(2-{\e_0})^{k-1}$, we find

2464: that:

2465: \begin{eqnarray}\label{eq:crude}

2466: f(\alpha)&=&\nonumber\\

2467: &=& f\left(\frac12\right)

2468: +\sum_{j=2}^k\binom{k}{j}(2x)^j[{\e_0}^j(2-{\e_0})^{k-j}-{\e_0}(2-{\e_0})^{k-1}]^2\nonumber\\

2469: &\le& f\left(\frac12\right)

2470: +{\e_0}^2(2-{\e_0})^{2k-2}\sum_{j=0}^k\binom{k}{j}(2x)^j\nonumber\\

2471: &=& f\left(\frac12\right)+2^k{\e_0}^2(2-{\e_0})^{2k-2}\alpha^k.

2472: \end{eqnarray}

2473: So, since ${\e_0}\le 1/2$,

2474: \begin{eqnarray*}

2475: \frac{f(\alpha)}{f(1/2)}=1+2^{k-2}\left(\frac{{\e_0}}{1-{\e_0}}\right)^2\alpha^k\le

2476: 1+2^{k-2}({\e_0}^2+8{\e_0}^3)\alpha^k=1+\frac{(1-y)^2}{2^k}\left(1+\frac{1-y}{2^{k-4}}\right)\alpha^k.

2477: \end{eqnarray*}

2478: Now,

2479: $$

2480: r\log \left[\frac{f(\alpha)}{f(1/2)}\right]\le

2481: \frac{r(1-y)^2}{2^k}\left(1+\frac{1-y}{2^{k-4}}\right)\alpha^k,

2482: $$

2483:

2484:

2485: \end{comment}

2486: