1: %Created: Wed 02 Aug 2006 10:35:58 CEST
2: %Last Modified: Tue 07 Nov 2006 12:02:16 CET
3: %<<< Preamble
4: \documentclass[a4paper,12pt,oneside,reqno]{amsart}
5: \usepackage[headinclude,DIV13]{typearea}
6: \areaset{15.5cm}{25cm}
7: %\usepackage{a4wide}
8: %\textheight 48\baselineskip
9: \usepackage{amsfonts,amssymb,amsmath,amsthm}
10: %\usepackage[color,notcite]{showkeys}
11: %\usepackage[utf8]{inputenc}
12:
13: \overfullrule 1mm
14:
15: \newtheorem{theorem}{Theorem}[section]
16: \newtheorem{lemma}[theorem]{Lemma}
17: \newtheorem{proposition}[theorem]{Proposition}
18: \newtheorem{corollary}[theorem]{Corollary}
19: \newtheorem{exercise}{Exercise}
20:
21: \theoremstyle{definition}
22: \newtheorem{definition}[theorem]{Definition}
23: \newtheorem{assumption}[theorem]{Assumption}
24: \newtheorem{condition}{Condition}
25: \newtheorem{solution}{Solution}
26: \newtheorem{question}{Question}
27:
28: \theoremstyle{remark}
29: \newtheorem*{remark}{Remark}
30: \newtheorem*{hint}{Hints}
31:
32: \numberwithin{equation}{section}
33:
34: %\usepackage{verbatim}
35: %\let\solution=\comment
36: %\let\endsolution=\endcomment
37:
38: \def\todo#1{\marginpar{\raggedright\footnotesize #1}}
39: \def\change#1{{\color{green}\todo{change}#1}}
40:
41: \def\vagconv{\overset v\to }
42: \def\ppconv{\overset {pp} \to }
43: \def\weakconv{\overset w\to }
44:
45: \def\laweq{\overset{\mathrm{law}}=}
46:
47: \def\Max{{\mathrm{max}}}
48: \def\Min{{\mathrm{min}}}
49: \def\supp{\mathop{\mathrm{supp}}\nolimits}
50: \def\Id{\mathrm{Id}}
51: \def\dist{\mathop{\rm dist}\nolimits}
52: \def\supp{\mathop{\rm supp}\nolimits}
53: \def\sign{\mathop{\rm sign}\nolimits}
54: \def\psum{\mathop{\sum\nolimits'}}
55: \def\d{\mathrm{d}}
56: \def\dd{\mathtt{d}}
57: \def\bbone{{\mathchoice {\rm 1\mskip-4mu l} {\rm 1\mskip-4mu l}
58: {\rm 1\mskip-4.5mu l} {\rm 1\mskip-5mu l}}}
59: %\def\bbone{\chi }
60:
61: \def\<{\langle}
62: \def\>{\rangle}
63:
64: %>>>
65:
66:
67: \begin{document}
68: \title{Hitting time of large subsets of the hypercube}
69:
70: \author[J. \v Cern\'y]{Ji\v r\'\i~\v Cern\'y}
71: \address{Ji\v r\'\i~\v Cern\'y\\
72: \'Ecole Polytechnique F\'ed\'erale de Lausanne\\
73: 1015 Lausanne\\
74: Switzerland
75: }
76: \email{jiri.cerny@epfl.ch}
77:
78: \author[V. Gayrard]{V\'eronique Gayrard}
79: \address{V\'eronique Gayrard\\Laboratoire d'Analyse, Topologie, Probabilit\'es\\
80: CMI, 39 rue Joliot-Curie\\
81: 13453 Marseille Cedex}
82: \email{gayrard@latp.univ-mrs.fr, veronique@gayrard.net}
83:
84: \subjclass[2000]{60J10, 60K37, 82D30}
85: \keywords{Hitting time, random walk, hypercube, aging}
86:
87: \date{\today}
88: \begin{abstract}
89: We study the simple random walk on the $n$-dimensional hypercube, in
90: particular its hitting times of large (possibly random) sets. We give
91: simple conditions on these sets ensuring that the properly-rescaled
92: hitting time is asymptotically exponentially distributed, uniformly in
93: the starting position of the walk. These conditions are then verified for
94: percolation clouds with densities that are much smaller than
95: $(n \log n)^{-1}$. A main motivation behind this paper is the study of
96: the so-called aging phenomenon in the Random Energy Model (REM),
97: the simplest model of a mean-field spin glass. Our results allow
98: us to prove aging in the REM for all temperatures,
99: thereby extending earlier results to their optimal temperature domain.
100:
101: \end{abstract}
102:
103: \maketitle
104:
105:
106: %%%% START OF ht1.tex %%%%
107: \section{Introduction}
108: \label{s:introduction}
109: Let $\mathcal V_n$ be the $n$-dimensional hypercube,
110: $\mathcal V_n=\{0,1\}^n$. We equip $\mathcal V_n$ with
111: the metric
112: \begin{equation}
113: d(x,y)=\sum_{i=1}^n \bbone\{x(i)\neq y(i)\},
114: \end{equation}
115: where $x(i)$ are the coordinates of $x\in \mathcal V_n$. Let further $Y_n$ be
116: the simple random walk on $\mathcal V_n$. That is, $Y_n$ is the
117: discrete-time Markov chain with state space $\mathcal V_n$ whose
118: transition probabilities are given by
119: $\mathbb P[Y_n(k+1)=y|Y_n(k)=x]=n^{-1}$ if $d(x,y)=1$, and $0$ otherwise. We
120: write $\mathbb P_x$ for the distribution of $Y_n$ conditioned on $Y_n(0)=x$.
121: For $A\subset \mathcal V_n$ we define the hitting time of $A$ by
122: \begin{equation}
123: H_n(A)=\min\{k\ge 0: Y_n(k)\in A\}.
124: \end{equation}
125:
126: We are interested in the distribution of the hitting time of large random
127: subsets of the hypercube. Specifically, let $\rho \in [0,1] $. We say that
128: the set $A\subset \mathcal V_n$ is a \textit{percolation cloud} on
129: $\mathcal V_n$ with density $\rho $ if each site $x\in \mathcal V_n$ is
130: in $A$ with probability $\rho $ independently of all others.
131:
132:
133: Our main aim is to prove the following theorem.
134: \begin{theorem}
135: \label{t:perc}
136: Let $\bar m(n)$ be such that
137: \begin{equation}
138: \label{e:condm}
139: n \log n \ll \bar m(n) \ll 2^{n}(\log n)^{-1}
140: \end{equation}
141: and let $A_n$ be a sequence of percolation clouds on $\mathcal V_n$ with
142: densities $\bar m(n)^{-1}$ defined on a common probability space
143: $(\Omega,F, P)$.
144: Then, for all $a>0$,
145: \begin{equation}
146: \label{e:thmjedna}
147: \lim_{n\to\infty}
148: \max_{x\in \mathcal V_n}
149: \Big |
150: \mathbb P_x \big[H_n(A_n\setminus x)\ge a \bar m(n)\big]-e^{-a}
151: \Big |
152: =0\,,\quad P\text{-a.s.}
153: \end{equation}
154: In words, the distribution of the normalised hitting time of
155: $A_n\setminus x$ converges to the exponential distribution uniformly in
156: the starting position $x$.
157: \end{theorem}
158:
159:
160: In Theorem~\ref{t:sampling} below we state a similar result for
161: another important class of random sets, namely sets $A_n$ that are
162: sampled from $\mathcal V_n$ without replacement (i.e. each subset
163: of $|A_n|$ elements of $\mathcal V_n$ is equally likely).
164:
165:
166: \begin{theorem}
167: \label{t:sampling}
168: Let $M_n$ be a sequence of integers such that, setting $m(n)={2^n}/{M_n}$,
169: \begin{equation}
170: \label{e:condm'}
171: n \log n \ll m(n) \leq 2^n
172: \end{equation}
173: Let $A_n$ be subsets of $M_n$ elements sampled from $\mathcal V_n$ without replacement,
174: and defined on a common probability space
175: $(\Omega',F', P')$.
176: Then, for all $a>0$,
177: \begin{equation}
178: \lim_{n\to\infty}
179: \max_{x\in \mathcal V_n}
180: \Big |
181: \mathbb P_x \big[H_n(A_n\setminus x)\ge a m(n)\big]-e^{-a}
182: \Big |
183: =0\,,\quad P'\text{-a.s.}
184: \end{equation}
185: \end{theorem}
186:
187:
188: Estimates on the distribution of the hitting time of various subsets of
189: the hypercube have a long history. They can be traced back to the early
190: literature on first passage times for Markov chains (see \cite{Kem61} and
191: the references therein) where these questions are reformulated in terms
192: of the (one-dimensional) Ehrenfest urn scheme. These results provide very
193: sharp estimates on the asymptotic distribution of the hitting time of a
194: single point. More, recently Matthews \cite[p.118]{Mat89} gave finite-$n$
195: estimates for the Laplace transform of the hitting time of sets containing
196: one or two points. These estimates are key ingredients that enter his
197: description of the covering time of the hypercube and related questions.
198:
199: The hitting times of much more general sets (possibly random, and whose
200: size is possibly increasing with $n$) were studied very recently in
201: \cite{BC06} and \cite{BG06}. In \cite{BG06}, Ben Arous and Gayrard give
202: precise conditions for the hitting time of subsets of $\mathcal V_n$ to be
203: asymptotically exponentially distributed for a class of subsets for which
204: the so-called lumping construction can be applied. This construction was
205: fist used in this context by \cite{BBG03}. It can be understood as a $d$-dimensional
206: extension of the Ehrenfest urn scheme where the random walk $Y_n$ on the
207: hypercube is replaced by a walk defined on a $d$-dimensional state space
208: of smaller cardinality, and which evolves in a convex potential that is
209: very steep near its boundary. Such a chain is then studied using the tools
210: developed in \cite{BEGK1},\cite{BEGK2} for the study of metastability in
211: reversible Markov chains on discrete state space. This method allows in
212: particular to show that Theorem~\ref{t:perc} is valid for
213: $m(n)\ge C 2^n/\log n$.
214:
215: In \cite{BC06}, Ben Arous and \v Cern\'y obtain a result similar to
216: Theorem~\ref{t:perc}. Namely, they prove convergence to the
217: exponential distribution of the hitting time of percolation clouds
218: for densities of order $2^{-c n}$ with $c\in (3/4,1)$ (see
219: Lemma~3.7 of \cite{BC06}). The method is based on the formula discovered
220: by Matthews in his study of covering problems (see Theorem 1.3 in
221: \cite{Mat88}) and on (improved) estimates from \cite{Mat89}.
222:
223: The results of the present paper are the first that allow to treat sets
224: of very large size, namely sets as large as $o(2^n (n \log n)^{-1})$.
225:
226: \smallskip
227:
228: We will show in Section~\ref{s:equiv} that
229: both Theorem~\ref{t:perc} and Theorem~\ref{t:sampling} are
230: consequences of the following more general theorem. To state it we define
231: the function $\xi_n(k)$ by
232: \begin{equation}
233: \label{e:defxi}
234: \xi_n(k)=2^{-n } \frac n 2 \binom n k^{-1}
235: \sum_{j=1}^{n-k}\binom n {k+j} \frac 1 j,
236: \qquad k=0,\dots ,n.
237: \end{equation}
238: The role of this relatively complicated function will become evident in
239: Lemma~\ref{l:twopoint}.
240:
241: \begin{theorem}
242: \label{t:gen}
243: Let $n \log n \ll m(n) \leq 2^n$ and
244: $B_n\subset \mathcal V_n$ be such that
245: \begin{equation}
246: \label{e:Bsize}
247: 0<|B_n|= 2^n m(n)^{-1}(1+o(1)).
248: \end{equation}
249: Define
250: \begin{equation}
251: \begin{split}
252: \label{e:volumes}
253: v_n(k)&=\max_{x\in \mathcal V_n} \big|\{y\in B_n:d(x,y)=k\}\big|,\\
254: V_n(k)&=\max_{x\in \mathcal V_n} \big|\{y\in B_n:d(x,y)\le k\}\big|.
255: \end{split}
256: \end{equation}
257: If there exists a function $g(n)$, such that $g(n)\le n/2$,
258: \begin{equation}
259: \label{e:gg}
260: \xi_n(g(n))\ll 2^{-n}m(n),
261: \end{equation}
262: and
263: \begin{equation}
264: \label{e:vcond}
265: \lim_{n\to \infty}\sum_{k=1}^{g(n)-1}v_n(k)\xi_n(k)=0,
266: \qquad
267: \text{and}
268: \qquad
269: V_n(g(n)-1)\ll |B_n|,
270: \end{equation}
271: then
272: \begin{equation}
273: \label{e:lim}
274: \lim_{n\to\infty}
275: \max_{x\in \mathcal V_n}
276: \Big |
277: \mathbb P_x \big[H_n(B_n\setminus x)\ge a m(n)\big]-e^{-a}
278: \Big |
279: =0.
280: \end{equation}
281: \end{theorem}
282:
283: \begin{remark}
284: 1. While the upper bound on $m(n)$ in the range of validity of the
285: preceding theorem is trivially optimal, we cannot claim the same about
286: the lower bound. Morally, the proof exploits the fact that the simple
287: random walk on the hypercube equilibrates after $O(n \log n)$ steps,
288: even if an application of this fact is not easy to see in our actual
289: proof. It is intuitively obvious that after equilibration the hitting
290: time should be exponentially distributed. The claim of the theorem might
291: stay valid even for $m(n)=O(n\log n)$, however for different reasons.
292: Several serious technical complications appears in our proof for such
293: $m(n)$. We were not motivated to proceed further since it is not needed
294: for the applications we have in mind.
295:
296: 2. The results of \cite{BG06} (see Theorem 1.7 of \cite{BG06} and the
297: simpler Corollary 1.8) imply that Theorem~\ref{t:perc} holds true for
298: sets $B_n$ whose size is asymptotically constant that is, when
299: \eqref{e:Bsize} holds for $m(n)={2^n}/{M}$ where $M$ is a fixed integer.
300:
301: 3. We will show soon that $\xi_n(1)=O(n^{-1})$. Therefore, the
302: conditions appearing in \eqref{e:vcond} are void when \eqref{e:Bsize}
303: holds with $m(n)$ satisfying $2^{-n}m(n)\gg n^{-1}$ (and, in
304: particular, when $|B_n|$ is constant). As an immediate consequence of
305: Theorem~\ref{t:gen} we then obtain:
306: \end{remark}
307:
308: \begin{corollary}
309: \label{c:perc}
310: Theorem~\ref{t:perc} remains valid on $2^{n}(\log n)^{-1}\le \bar m(n)\leq
311: 2^n$ if the term $a\bar m(n)$ in \eqref{e:thmjedna} is replaced by $2^n |A_n|^{-1}$.
312: \end{corollary}
313:
314: \begin{proof}
315: It is sufficient to take $m(n)$ in Theorem~\ref{t:gen}
316: to be $m(n)=2^n/|A_n|$, where $A_n$ is the percolation cloud with
317: intensity $\bar m(n)$.
318: \end{proof}
319:
320: Our method of proof relies on a sharp estimate on the distribution
321: of hitting time of a single point when time is measured on the
322: scale $m(n)$, namely on the quantity
323: \begin{equation}
324: \mathbb P_{z_k}[H_n(\boldsymbol 0)< a m(n)]
325: \end{equation}
326: where $\boldsymbol 0$ is the vertex of $\mathcal V_n$ whose coordinates
327: are all $0$ and $z_k$ is any vertex at distance $k$ from it; this estimate
328: is itself derived from an estimate on the Laplace transform of the hitting
329: time $H_n(\boldsymbol 0)$ (see respectively Lemma~\ref{l:twopoint} and
330: Lemma~\ref{l:laplace}). The domain of validity of the latter determines
331: our bounds on $m(n)$. On this domain the Laplace transform is well
332: approximated by the sum of two terms: the expected contribution of an
333: exponential distribution, and the mysterious looking function $\xi_n(k)$
334: which is, essentially, the probability that $Y_n$ started from $z_k$ hits
335: $\boldsymbol 0$ in the first $n\log n$ steps.
336:
337: The rest of the paper is organised as follows. In the next section we
338: describe our main motivation and give some important consequences of our
339: results for aging in the Random Energy model.
340: In Section~\ref{s:prop} we give the proof of Theorem~\ref{t:gen}.
341: Finally, Theorems~\ref{t:perc} and~\ref{t:sampling} are proved in
342: Section~\ref{s:equiv}.
343: %%%% END OF ht1.tex %%%%
344:
345: %%%% START OF ht4.tex %%%%
346:
347: \section{Aging in the Random Energy Model}
348: \label{s:aging}
349:
350: The main motivation behind this paper originates in the study of the
351: Random Hopping Time (RHT) dynamics of the Random Energy Model (REM), which
352: is often called the simplest model of a spin glass. Let us describe this problem
353: briefly (for a recent review see \cite{BC06c}). In the REM an energy $E_x$
354: is attached to every site $x\in \mathcal V_n$. The $E_x$'s are chosen to
355: be i.i.d.~with standard normal distribution. Given a collection
356: $\boldsymbol E=\{E_x:x\in \mathcal V_n\}$ and a parameter $\beta>0 $
357: (representing the inverse of the temperature), the RHT dynamics in the REM
358: is defined as a continuous-time Markov process $X_n(\,\cdot\,)$ whose
359: transition rates are given by
360: $w^{\boldsymbol E}_n(x,y)=\exp(-\beta \sqrt n E_x)$ if $d(x,y)=1$ and zero
361: otherwise.
362:
363: The main goal of the study of the processes $X_n$ was to prove aging. In
364: this context it usually means showing that
365: the two-point function
366: \begin{equation}
367: R_n(t_w,t_w+t;\boldsymbol E):=P[X_n(t_w+t)=X_n(t_w)|\boldsymbol E],
368: \end{equation}
369: has a non-trivial limit as $t_w$, $ t=\theta t_w$ and $n$ tend simultaneously
370: to infinity.
371:
372: The first proof of aging in the REM for $\beta > \beta_c=\sqrt {2 \log 2}$
373: and times $t_w\sim \exp (\beta \beta_c n)$ was given in
374: \cite{BBG03,BBG03b}, based on the analysis of the metastable behaviour of
375: $X_n$ and renewal theory. In \cite{BC06} another proof
376: of aging, based on the arc-sine law for stable subordinators, was given
377: for temperatures satisfying
378: \begin{equation}
379: \label{e:bc}
380: \sqrt {3/4} < \alpha \beta / \beta_c < 1.
381: \end{equation}
382: and (shorter) time-scales
383: $t_w(n)=( \alpha \beta \sqrt{2 \pi n})^{-1/\alpha }\exp(\alpha \beta^2 n)$,
384: where $\alpha \in (0,1)$ is a free parameter. We will now explain how our
385: methods allows to improve the lower bound $\sqrt{3/4}$ in \eqref{e:bc} to its
386: optimal value, that is to $0$. To this end we need to describe briefly the
387: background of the techniques of \cite{BC06}.
388:
389: The transition rates of the process $X$ do not depend on the energy of the
390: target vertex $y$. Therefore, the process $X_n$ is a time change of the simple
391: random walk $Y_n$, and can be written as
392: $X_n(t)=Y_n(S^{-1}_n(t))$, where
393: \begin{equation}
394: S_n(k)=\sum_{i=0}^{k-1} e_i \exp(\beta \sqrt n E_{Y_n(i)}),
395: \end{equation}
396: $S^{-1}_n$ is the generalised right-continuous inverse of $S_n(k)$, and
397: $e_i$ is a sequence of mean-one i.i.d.~exponential random variables.
398: $S_n(k)$ is the time of the $k^{\text {th}}$ jump of $X_n$.
399:
400: It was proved in \cite{BC06} that $S(n)$ behaves (for $n$ large) as an
401: $\alpha $-stable subordinator in certain time and temperature regimes. To
402: be more precise, set $r(n)=\exp(\alpha^2 \beta^2 n /2)$. Then, if
403: \eqref{e:bc} holds then $t(n)^{-1}S_n(\lfloor r(n) \,\cdot\,\rfloor)$
404: converges in distribution to an $\alpha $-stable subordinator.
405:
406: It is a known fact that the value of a stable subordinator at time $t$ can
407: be approximated by the finite sum of its largest jumps up to this time.
408: The same is true for $S_n$: the main contribution to $S_n(r(n))$ comes
409: from a finite number of visits to sites with
410: $\exp(\beta \sqrt n E_x)\asymp t(n)$. Such sites form (due to the
411: i.i.d.~property of the energies) a percolation cloud. To understand
412: properties of $S_n$ it is therefore necessary to understand how the simple
413: random walk visits such clouds.
414:
415: As we have already remarked, the methods of \cite{BC06} are sufficient to
416: show Theorem~\ref{t:perc} for densities much smaller than $2^{-3n/4}$.
417: The constant $3/4$ in the exponent entails the constant $3/4$ in \eqref{e:bc}. The
418: result of our Theorem~\ref{t:perc} thus allows to extend the domain of
419: validity of Theorem 3.1 in \cite{BC06}. As we find this extension important
420: we state it here:
421:
422: \begin{theorem}
423: For $\alpha \in (0,1)$ let $t_w(n)$ and $r(n)$ be as above. If
424: \begin{equation}
425: \label{e:new}
426: 0 < \alpha \beta < \beta_c,
427: \end{equation}
428: then $t_w(n)^{-1}S_n(r(n)\,\cdot\,)$ converges in distribution to an
429: $\alpha $-stable subordinator and the two-point function $R_n$ exhibits
430: aging. Namely, for a.e.~realisation of
431: $\boldsymbol E$ and for every $\theta\in (0,\infty)$
432: \def\Asl{\mathop{\mathsf{Asl}}\nolimits}
433: \begin{equation}
434: \lim_{n\to\infty} R_n(t_w(n),(1+\theta ) t_w(n);\boldsymbol E)=\Asl_\alpha (1/1+\theta ),
435: \end{equation}
436: where $\Asl_\alpha (u)$ stands for the distribution function of the
437: generalised arcsine law with parameter $\alpha $,
438: $ \Asl_\alpha (z):= \pi^{-1}\sin \alpha \pi \int_0^{z}u^{\alpha
439: -1}(1-u)^{-\alpha }\,\d u$.
440: \end{theorem}
441:
442: It is worth noting that for any $\beta<\infty $ there exists $\alpha \in (0,1)$
443: such that \eqref{e:new} is satisfied. This implies that aging occurs in the
444: RHT dynamics of the REM at all temperatures.
445:
446: \smallskip
447:
448: Theorem~\ref{t:perc} allows further to study the RHT dynamics in
449: another interesting time-temperature regime where denser percolation
450: clouds should be considered: for
451: $t_w(n)=(\beta \sqrt{2 \pi n})^{-1}\exp(\beta^2 n)$, that is for
452: $\alpha =1$. It is argued in the physics literature \cite{BB02b}
453: that the two-point function $R_n$ exhibits some interesting ultrametric
454: behaviour in this case. The rigorous treatment of this problem was the
455: original motivation behind this paper and will be subject of
456: a forthcoming paper.
457:
458:
459:
460: %%%% END OF ht4.tex %%%%
461:
462: %%%% START OF ht2.tex %%%%
463:
464: \section{Proof of Theorem~\ref{t:gen}}
465: \label{s:prop}
466:
467: As mentioned earlier Theorem~\ref{t:gen} is already known for small sets, namely when
468: $m(n)={2^n}/{M}$ for $M$ a fixed integer (see \cite{BG06}, Corollary 1.8). Although
469: our method of proof clearly allows us to cover this case as well,
470: its treatment is, in places, quite different from the case $n \log
471: n \ll m(n) \ll 2^n$. Thus, in order to keep this paper as concise as
472: possible we will prove Theorem~\ref{t:gen} in that
473: latter case only and assume from now on that $n \log n \ll
474: m(n) \ll 2^n$.
475:
476: \begin{proof}[Proof of Theorem~\ref{t:gen} (for $n \log n \ll m(n) \ll 2^n$)]
477: We have
478: \begin{equation}
479: \mathbb P_x[H_n(B_n\setminus x)\ge a m(n)]
480: =
481: 1- \mathbb P_x\Big[\bigcup_{y\in B_n\setminus x}
482: H_n(y)< a m(n)\Big].
483: \end{equation}
484: Therefore,
485: it follows from the inclusion-exclusion principle that
486: for all even $\ell\in \mathbb N$
487: \begin{equation}
488: \mathbb P_x[H_n(B_n\setminus x)\ge a m(n)]
489: \le
490: 1+\sum_{i=1}^\ell \frac {(-1)^i} {i!}
491: \psum_{y_1,\dots,y_i\in B_n\setminus x}
492: \mathbb P_x\Big[\bigcap_{j=1}^i H_n(y_j)< a m(n)\Big],
493: \end{equation}
494: where $\psum$ denotes the sum over all mutually different $y$'s.
495: Analogous expressions for~$\ell$ odd give lower bounds.
496:
497: The following proposition is the
498: key step of the proof.
499: \begin{proposition}
500: \label{p:sum}
501: Under the assumptions of Theorem~\ref{t:gen}
502: and assuming that $n \log n \ll m(n) \ll 2^n$ we have,
503: for all $i\in \mathbb N$,
504: \begin{equation}
505: \lim_{n\to\infty}
506: \max_{x\in \mathcal V_n}
507: \bigg|
508: \psum_{y_1,\dots,y_i\in B_n\setminus x}
509: \mathbb P_x\Big[\bigcap_{j=1}^i H_n(y_j)< a m(n)\Big]
510: -a^i\bigg| =0.
511: \end{equation}
512: \end{proposition}
513: Using Proposition~\ref{p:sum} the completion of the proof of Theorem~\ref{t:gen}
514: under the assumption that $n \log n \ll m(n) \ll 2^n$ is immediate.
515: \end{proof}
516:
517:
518: The proof of Proposition~\ref{p:sum} relies on several technical
519: lemmas which we collect in the subsection below. The proof of
520: Proposition~\ref{p:sum} is then concluded in subsection
521: \ref{s:propproved}.
522:
523: \subsection{Preparatory Lemmas}
524: \label{s:preplem}
525:
526: Our first lemma collects the properties of the function $\xi_n(k)$
527: which will be needed later.
528:
529: \begin{lemma}
530: \label{l:xi}
531: (i) For all $k\in \{1,\dots,n\}$,
532: $ \xi_n(k) \le K \binom n k^{-1} n^{1/2}\log n$, where $K$ is a constant
533: independent of $n$ and $k$.
534:
535: (ii) For all $k\le n/2$, $ \xi_n(k)\ge \frac 1 2 \binom n k^{-1}$.
536:
537: (iii) For any fixed $n$ the function $\xi_n(k)$ is decreasing in $k$.
538:
539: (iv) If $k=o(n)$, then $ \xi_n(k)=\binom n k^{-1}(1+o(1))$.
540: \end{lemma}
541: \begin{proof}
542: (i) Recall that
543: $\xi_n(k)= 2^{-n}\frac n 2 \binom n k^{-1}\sum_{j=1}^{n-k}\binom n {j+k} \frac 1 j$.
544: From a standard moderate deviations argument it follows that
545: \begin{equation}
546: 2^{-n} \frac n 2 \sum_{\substack{j=0\\|j-n/2|\ge n^{7/12}}}^n \binom n j
547: \le c_1 \frac n 2 e^{-c_2 n^{1/6}}.
548: \end{equation}
549: Therefore, the contribution of $j$'s with $|k+j-n/2|\ge n^{7/12}$ is
550: $o(\binom n k^{-1})$. For the remaining $j$'s we use the approximation
551: \begin{equation}
552: \label{e:binomass}
553: \binom n {n/2+i} = \sqrt{\frac 2 \pi } n^{-1/2}2^n
554: e^{-2 i^2/n}(1+o(1)),
555: \end{equation}
556: which is valid uniformly for $i=o(n^{2/3})$. Setting
557: $a=n^{-1/2}(k+j-n/2)$ and $b=n^{-1/2}(k-n/2)$, and thus $j=n^{1/2}(a-b)$,
558: the contribution of the $j$'s with $|k+j-n/2|\le n^{7/12}$ to
559: $\xi_n(k)$ equals
560: \begin{equation}
561: \begin{split}
562: \label{e:gausapprox}
563: 2^{-n}\frac n2&
564: \binom n k^{-1}
565: \sum_{\substack{a\in [-n^{1/12},n^{1/12}]\cap (\mathbb Z/\sqrt n)\\
566: a\ge b+n^{-1/2}}}
567: \binom n {a \sqrt n + n/2} \frac 1 {(a-b)\sqrt n}
568: \\&\le K'
569: n^{1/2}
570: \binom n k^{-1}
571: \int_{( b+ n^{-1/2})\vee-n^{1/12}}^{n^{1/12}}
572: e^{-2x^2}\frac {\d x} {(x-b)}
573: \le
574: K
575: \binom n k^{-1}
576: n^{1/2}\log n.
577: \end{split}
578: \end{equation}
579:
580: (ii) For $k<n/2$,
581: $\xi_n(k)\ge 2^{-n}\frac n 2 \binom n k^{-1} \sum_{j=1}^{n/2}\binom n {k+j}\frac 2 n
582: \ge \frac 1 2 \binom n k^{-1}$.
583:
584: (iii) The function $\xi_n$ can be rewritten as
585: \begin{equation}
586: \label{e:secondform}
587: \xi_n(k)=
588: 2^{-n}\frac n2
589: \sum_{j=1}^{n-k}
590: \binom{n-k} j
591: \binom{k+j} j^{-1}
592: \frac 1 j .
593: \end{equation}
594: Here, the fact that $\xi_n(k)$ is decreasing is apparent.
595:
596:
597: (iv) Using again the moderate deviations argument, for $k=o(n)$,
598: \begin{equation}
599: \xi_n(k)=2^{-n}\frac n 2\binom n k^{-1}
600: \sum_{j=n/2-n^{7/12}-k}^{n/2+n^{7/12}-k}\binom n {j+k} \frac
601: 1 j(1+o(1))=\binom n k^{-1}(1+o(1)).
602: \end{equation}
603: This completes the proof of Lemma~\ref{l:xi}.
604: \end{proof}
605:
606: We now prove that for $m(n)$ in the range considered in Theorem~\ref{t:gen}
607: there always exists a function $g(n)$ with $g(n)\le n/2$ satisfying \eqref{e:gg}.
608: We in fact prove a little more:
609: \begin{lemma}
610: Let $m(n)$ be such that $n \log n \ll m(n)\ll 2^n$. Then there exist
611: function $g$ such that
612: \begin{equation}
613: \label{e:g}
614: n/2-g(n)\gg n^{1/2}
615: \qquad\text{and}\qquad
616: \xi_n(g(n))\ll 2^{-n}m(n).
617: \end{equation}
618: \end{lemma}
619: \begin{proof}
620: Take $m'(n)$ such that $n\log n \ll m'(n)\ll m(n)$ and define $g(n)$ by
621: \begin{equation}
622: g(n)=\min\{ k: \xi_n(k)\le 2^{-n}m'(n)\}.
623: \end{equation}
624: Such $g(n)$ satisfies the second half of \eqref{e:g} by definition.
625: By Lemma~\ref{l:xi}(i)
626: \begin{equation}
627: g(n)\le
628: \min\Big\{k: K \binom n k^{-1} n^{1/2} \log n\le 2^{-n}m'(n)\Big\}
629: \end{equation}
630: Write $k=n/2-i$. Using formula \eqref{e:binomass} for the binomial
631: coefficient we find that
632: \begin{equation}
633: g(n)\le \frac n2
634: +\min\Big\{i: e^{2i^2/n}\le \frac {c m'(n)}{n\log n}\Big\}.
635: \end{equation}
636: Since $m'(n)\gg n\log n$ there exists $i_0$, $n^{2/3}\gg i_0\gg n^{1/2}$, for
637: which the inequality in braces holds. Since $n/2-g(n)\ge i_0$, the first
638: half of \eqref{e:g} is proved.
639: \end{proof}
640:
641:
642: Let us assume from now on that $g(n)$ satisfies \eqref{e:g}. As
643: announced earlier, the key ingredient of the proof of
644: Proposition~\ref{p:sum} is a precise estimate on the Laplace
645: transform of the hitting time of a single point. We now state and
646: prove this result. As we will see, this is where the function
647: $\xi_n$ comes in.
648:
649:
650: \begin{lemma}
651: \label{l:laplace}
652: Let $\boldsymbol 0$ be the vertex
653: of the hypercube with all coordinates equal to $0$ and
654: let $z_k\in \mathcal V_n$ be an arbitrary
655: vertex of the hypercube such that $d(z_k,\boldsymbol 0)=k$. If
656: $n\log n \ll m(n) \ll 2^n$, then for all $s>0$
657: \begin{equation}
658: \mathbb E_{z_k}\exp\Big(-\frac s {m(n)} H_n(\boldsymbol 0)\Big)=
659: \Big[2^{-n}\frac {m(n)} s + \xi_n(k)\Big](1+o(1)).
660: \end{equation}
661: \end{lemma}
662: \begin{proof}
663: By Fourier methods for random walks on finite groups \cite{Dia88}, we
664: have as in \cite{Mat89,BC06}
665: \begin{equation}
666: \label{e:furt}
667: \mathbb E_{z_k} e^{-s/m H_n(\boldsymbol 0)}=
668: \frac
669: {\sum_{y\in \mathcal V_n}(-1)^{z_k\cdot y}\big[1-e^{-s/m }
670: (1-\frac {2 d(y,\boldsymbol 0)}{n})\big]^{-1}}
671: {\sum_{y\in \mathcal V_n}\big[1-e^{-s/m}
672: (1-\frac {2 d(y,\boldsymbol 0)}{n})\big]^{-1}},
673: \end{equation}
674: where $x\cdot y = \sum_{i=1}^n x(i) y(i)$ is the standard scalar
675: product in $\mathbb R^n$.
676:
677: Let us first consider the numerator of \eqref{e:furt}. Observe that there are
678: $\tbinom k i \tbinom {n-k} j$ sites $y\in \mathcal V_n$ such that
679: $d(0,y)=i+j$ and $z_k\cdot y = i$. Hence the numerator of \eqref{e:furt} is equal to
680: \begin{equation}
681: \sum_{i=0}^k \sum_{j=0}^{n-k}
682: (-1)^i \binom k i \binom {n-k} j
683: \frac 1 {1-e^{-s/m}(1-2n^{-1}(i+j))}.
684: \end{equation}
685: This expression can be simplified using the following lemma.
686: \begin{lemma}
687: For all $k,j\in \{0,1,\dots\}$ and all $s>0$
688: \begin{equation}
689: \label{e:betaint}
690: \sum_{i=0}^k
691: \frac{(-1)^i \binom k i}
692: {1-e^{-s/m}(1-2n^{-1}(i+j))}
693: =
694: \frac {ne^{s/m}} 2 \cdot
695: \frac{\Gamma (1+k)\Gamma(j+\frac n2(e^{s/m}-1))}
696: {\Gamma(1+k+j+\frac n2 (e^{s/m}-1))}
697: .
698: \end{equation}
699: \end{lemma}
700: \begin{proof}
701: Note that the second fraction on the right-hand side of
702: \eqref{e:betaint} can be expressed using the Beta-integral,
703: \begin{equation}
704: \frac {ne^{s/m}} 2 \cdot
705: \frac{\Gamma (1+k)\Gamma(j+\frac n2(e^{s/m}-1))}
706: {\Gamma(1+k+j+\frac n2 (e^{s/m}-1))}
707: =
708: \frac {ne^{s/m}} 2
709: \int_0^1
710: (1-t)^k t^{i+\frac n2 (e^{s/m}-1)-1}\,\d t.
711: \end{equation}
712: Expanding $(1-t)^k$ according to the binomial theorem, and performing an easy
713: integration then gives the left-hand side of \eqref{e:betaint}.
714: \end{proof}
715:
716: Using the last lemma the numerator of \eqref{e:furt} can be rewritten as
717: \begin{equation}
718: \label{e:num2}
719: \sum_{j=0}^{n-k}
720: \binom {n-k} j
721: \frac {ne^{s/m}} 2 \cdot
722: \frac{\Gamma (1+k)\Gamma(j+\frac n2(e^{s/m}-1))}
723: {\Gamma(1+k+j+\frac n2 (e^{s/m}-1))}.
724: \end{equation}
725:
726: So far we obtained an exact expression which we now want to evaluate. To do so
727: we will use the following known properties of $\Gamma $-functions
728: (we refer to \cite{AS72} for the definition and the properties of the functions
729: appearing below).
730: \begin{equation}
731: \Gamma '(x)=\Gamma (x) \psi _0(x),\qquad
732: \Gamma''(x)=\Gamma (x)(\psi _0(x)^2+\psi _1(x)),\qquad
733: \lim_{x\to 0}x\Gamma (x)=1,
734: \end{equation}
735: where $\psi _0$ is the digamma function and $\psi _1=\psi_0'$.
736: The values of these functions for integer arguments can be written explicitly:
737: \begin{equation}
738: \psi _0(k)=\sum_{i=1}^{k-1} \frac 1 i - \gamma_E = \log k(1+o(1)), \qquad
739: \psi _1(k)=\frac {\pi^2} 6- \sum_{i=1}^{k-1} \frac 1 {i^2}=O(1/k).
740: \end{equation}
741: where $\gamma_E$ is Euler's constant.
742:
743: We can now evaluate \eqref{e:num2}. Set $\varepsilon = \frac n2(e^{s/m}-1)$
744: and observe that the bound $m\gg n\log n$ entails $\varepsilon \ll (\log n)^{-1}$.
745: To treat the term $j=0$ in \eqref{e:num2} simply note that,
746: since $\varepsilon \psi_0(1+k)=o(1)$ for all $k\le n$,
747: \begin{equation}
748: \label{e:blams}
749: \frac {ne^{s/m}} 2 \cdot
750: \frac{\Gamma (1+k)\Gamma(\varepsilon )}
751: {\Gamma(1+k+\varepsilon )}
752: =
753: \frac {ne^{s/m}} 2 \cdot
754: \frac {\varepsilon^{-1}(1+o(1))}
755: {1+\varepsilon \psi _0(1+k)}=\frac m s
756: (1+o(1)).
757: \end{equation}
758: A similar calculation for the remaining terms combined with
759: \eqref{e:secondform} readily yields
760: \begin{equation}
761: \begin{split}
762: \label{e:blaxi}
763: \sum_{j=1}^{n-k}&
764: \binom {n-k} j
765: \frac {ne^{s/m}} 2 \cdot
766: \frac{\Gamma (1+k)\Gamma(j+\varepsilon )}
767: {\Gamma(1+k+j+\varepsilon )}
768: \\&=
769: \sum_{j=1}^{n-k}
770: \frac n {2j} \binom {n-k} j {\binom{k+j} j}^{-1}
771: \frac {(1+\varepsilon \psi _0(j))(1+o(1))}
772: {1+\varepsilon \psi _0(1+k+j)}
773: =2^n \xi_n(k)(1+o(1)).
774: \end{split}
775: \end{equation}
776:
777: For $k=0$ the numerator of \eqref{e:furt} coincides with the denominator.
778: Equations \eqref{e:blams}, \eqref{e:blaxi} and Lemma~\ref{l:xi}(iv)
779: then imply that the denominator behaves like
780: \begin{equation}
781: \label{e:bladem}
782: \{ms^{-1} + 2^n \xi_n(0)\}(1+o(1))=2^n(1+o(1)).
783: \end{equation}
784: Finally, putting together \eqref{e:blams}, \eqref{e:blaxi} and \eqref{e:bladem}
785: yields the claim of Lemma~\ref{l:laplace}.
786: \end{proof}
787:
788:
789: Lemma~\ref{l:laplace} now allows us to get information on the form
790: of the probability distribution function of $H_n(\boldsymbol 0)$.
791: Let us denote by $p_n(a,k)$ the probability
792: \begin{equation}
793: p_n(a,k)=\mathbb P_{z_k}[H_n(\boldsymbol 0)< a m(n)].
794: \end{equation}
795:
796: \begin{lemma}
797: \label{l:twopoint}
798: (i) There exists $C<\infty$ independent of $n$ and $k$ such that
799: \begin{equation}
800: \label{e:pnupperbound}
801: p_n(a,k)\le Ce^a\big(2^{-n}m(n) + \xi_n(k)\big).
802: \end{equation}
803:
804: (ii) For
805: any $a\in [0,\infty)$, uniformly on compact subsets of this interval,
806: \begin{equation}
807: \lim_{n\to\infty}
808: \max_{k\ge g(n)}
809: \big |2^n m(n)^{-1}p_n(a,k)-a\big|=0.
810: \end{equation}
811: \end{lemma}
812:
813: \begin{proof}
814: Assertion (i) follows from Chebyshev
815: inequality and Lemma~\ref{l:laplace}. To prove (ii) observe
816: that
817: for $k\ge g(n)$, by \eqref{e:g} and by Lemma~\ref{l:xi}(iii),
818: $\xi_n(k)\le \xi_n(g(n))\ll 2^{-n}m(n)$. Therefore,
819: $\mathbb E_{z_k}\exp(-s H_n(\boldsymbol 0)/m(n))=2^{-n}m(n)/s(1+o(1))$.
820: Consider the sequence of measures $\mu_n$ given by
821: \begin{equation}
822: \mu_n([0,t])= 2^n m(n)^{-1} \mathbb P_{z_k}\big[H_n(\boldsymbol 0)/m(n)\in
823: [0,t]\big].
824: \end{equation}
825: The Laplace transform of $\mu_n$ then satisfies
826: $ \int_0^\infty e^{-s t}\mu_n(\d t)\xrightarrow{n\to\infty} 1/s$.
827: Therefore, $\mu_n$ converges weakly to the Lebesgue measure (see
828: \cite{Fel71}, Section XIII.1, Theorem 2a, p. 433) and thus
829: \begin{equation}
830: 2^n m(n)^{-1}p_n(a,k)=\mu_n([0,a))\xrightarrow{n\to\infty} a.
831: \end{equation}
832: The uniformity on compact sets follows easily from the fact that
833: the probabilities $p_n(a,k)$ are increasing in $a$.
834: \end{proof}
835:
836: We finally use Lemma~\ref{l:twopoint} to get information on the
837: form of the probability distribution function of the hitting time
838: of finite subsets of points of $\mathcal V_n$. For
839: $y_1,\dots,y_\ell\in \mathcal V_n$ we define $\bar
840: H_n(y_1,\dots,y_\ell)$ by
841: \begin{equation}
842: \bar H_n(y_1,\dots,y_\ell)=
843: \begin{cases}
844: H_n(y_\ell)&\text{if $H_n(y_i)<H_n(y_{i+1})$ for all
845: $i\in \{1,\dots,\ell -1\}$},\\
846: \infty&\text{otherwise}.
847: \end{cases}
848: \end{equation}
849: That is $\bar H_n(y_1,\dots,y_\ell)$ is finite only if the $y$'s were
850: visited in the prescribed order. In this case it is equal to the time to
851: visit all $y$'s. Observe that it is always infinite if $y_i=y_j$ for some
852: $i\neq j$.
853:
854: \begin{lemma}
855: \label{l:coverbad}
856: Let $x=y_0$, let $y_1,\dots,y_\ell$ be mutually distinct points in
857: $\mathcal V_n$, and let $d(i)=d(y_{i-1},y_i)$. Then
858: \begin{equation}
859: \label{e:coverbad}
860: \mathbb P_x\big[\bar H_n(y_1,\dots,y_\ell)<am(n)\big]
861: \le
862: C^\ell e^{\ell a} \prod_{i=1}^\ell \big(2^{-n}m(n)+ \xi_n(d(i))\big).
863: \end{equation}
864: \end{lemma}
865:
866: \begin{proof}
867: Obviously, by the strong Markov property,
868: \begin{equation}
869: \mathbb P_x\big[\bar H_n(y_1,\dots,y_\ell)<am(n)\big] \le
870: \prod_{i=1}^\ell
871: \mathbb P_{y_{i-1}}[H_n(y_i)< am(n)],
872: \end{equation}
873: which, by Lemma~\ref{l:twopoint}(i), is bounded by the right-hand side of
874: \eqref{e:coverbad}.
875: \end{proof}
876:
877: \begin{lemma}
878: \label{l:covergood}
879: Let $x^n=y_0^n$, let $y_1^n,\dots,y_\ell^n\in \mathcal V_n$, and let
880: $d_n(i)=d(y^n_i,y^n_{i-1})$. Suppose that $d_n(i)\ge g(n)$ for
881: all $i\in\{1,\dots,\ell\}$ and all $n$.
882: Then, uniformly over all $x^n$ and $y^n_i$,
883: \begin{equation}
884: \lim_{n\to\infty}
885: 2^{\ell n} m(n)^{-\ell}
886: \mathbb P_{x^n}\big[\bar H_n(y_1^n,\dots,y_\ell^n)
887: < a m(n)\big] =
888: \frac {a^\ell}{\ell !}.
889: \end{equation}
890: \end{lemma}
891:
892: \begin{proof}
893: The probability in question can be bounded from above using the strong Markov
894: property,
895: \begin{equation}
896: \label{e:iiint}
897: \mathbb P_{x^n}\big[\bar H_n(y_1^n,\dots,y_\ell^n)
898: < a m(n)\big] \le
899: \sum_{\substack{a_1,\dots, a_\ell\in \mathbb N/m(n)
900: \\ a_1+\dots+a_\ell < a}}
901: \prod_{i=1}^\ell
902: \mathbb P_{y_{i-1}}[H_n(y_i)/m(n)= a_i].
903: \end{equation}
904: Since $d(y_{i-1},y_i)>g(n)$, it is easy to
905: see from Lemma~\ref{l:twopoint}(ii) that the sum behaves like
906: \begin{equation}
907: 2^{-n\ell} m(n)^{\ell}
908: \idotsint\limits_{x_1+\dots+x_\ell < a}
909: \d x_1\dots\d x_\ell
910: (1+o(1))
911: =
912: 2^{-n\ell} m(n)^{\ell}
913: a^\ell / \ell !
914: (1+o(1)).
915: \end{equation}
916:
917: The above expression only provides an upper bound since it does not
918: exclude the possibility that the random walk visits another $y_j$, $j>i$, on its way from
919: $y_{i-1}$ to $y_i$.
920: To construct a lower bound we should exclude such
921: visits. Therefore, denoting by $\mathrm{UB}$ the upper
922: bound~\eqref{e:iiint},
923: \begin{equation}
924: \begin{split}
925: \mathbb P_{x^n}&\big[\bar H_n(y_1^n,\dots,y_\ell^n)
926: < a m(n)\big]
927: \\
928: &\ge \mathrm{UB}-
929: \sum_{i=1}^\ell
930: \sum_{j=i+1}^\ell
931: \sum_{\substack{a_1,\dots, a_\ell\in \mathbb N/m(n)
932: \\ a_1+\dots+a_\ell < a}}
933: \prod_{\substack{k=1\\k\neq i}}^\ell
934: \mathbb P_{y_{k-1}}[H_n(y_k)/m(n)= a_k]
935: \\&\quad\times
936: \sum_{\substack{b\in \mathbb N/m(n)\\ b\in(0,a_i-a_{i-1})}}
937: \mathbb P_{y_{i-1}}[H_n(y_j)/m(n)= b]
938: \mathbb P_{y_{j}}[H_n(y_i)/m(n)= a_i-b].
939: \end{split}
940: \end{equation}
941: The negative term on the right-hand side is smaller than
942: \begin{equation}
943: C2^{-n(\ell-1)} m(n)^{\ell-1}
944: e^{(\ell-1)a}
945: \sum_{i=1}^\ell
946: \sum_{j=i+1}^\ell
947: \mathbb P_{y_{i-1}}[H_n(y_j)< am(n)]
948: \mathbb P_{y_{j}}[H_n(y_i)<am(n) ].
949: \end{equation}
950: Hence, if we can show that for all $j\neq i$
951: \begin{equation}
952: \label{e:smalll}
953: \mathbb P_{y_{i-1}}[H_n(y_j)< am(n)]
954: \mathbb P_{y_{j}}[H_n(y_i)<am(n) ] = o(2^{-n}m(n)),
955: \end{equation}
956: then the proof of Lemma~\ref{l:covergood} is finished.
957:
958: Let $k=d(y_{i-1},y_j)$ and $l=d(y_j,y_i)$.
959: Since $d(y_{i-1},y_i)\ge g(n)$ we have also $k+l\ge g(n)$. Obviously
960: $k\ge 1$, $l\ge 1$. By Lemma~\ref{l:twopoint}(i),
961: \begin{equation}
962: \begin{split}
963: \mathbb P_{y_{i-1}}&[H_n(y_j)< am(n)]
964: \mathbb P_{y_{j}}[H_n(y_i)<am(n) ]
965: \\&\le
966: C^2e^{2a}\big(2^{-n}m(n)+ \xi_n(k)\big).
967: \big(2^{-n}m(n)+ \xi_n(l)\big)
968: \\&\le
969: C^2e^{2a}\big(2^{-2n}m(n)^{2}+ 2^{-n}m(n) 2\xi_n(1)+\xi_n(k)\xi_n(l)\big).
970: \end{split}
971: \end{equation}
972: The first two summands are $o(2^{-n}m(n))$ (see Lemma~\ref{l:xi}(iii,iv)).
973: If $k$ or $l$ is larger than $g(n)$, then the same is valid for the
974: third one. As the last step of the proof we show that if
975: $\max \{k,l\}< g (n)$ and $k+l\ge g(n)$, then for any $\varepsilon >0$ and
976: $n$ large enough
977: \begin{equation}
978: \label{e:submult}
979: \xi_n(k)\xi_n(l)\le \varepsilon 2^{-n} m(n).
980: \end{equation}
981: Let $z_{k+l}$ be as in Lemma~\ref{l:laplace}
982: and let $z_k$ be any point such that $d(z_k,\boldsymbol 0)=k$ and
983: $d(z_k,z_{k+l})=l$. Since on the way from $z_{k+l}$ to $\boldsymbol 0$,
984: the random walk may pass through $z_k$ we have
985: \begin{equation}
986: \mathbb E_{z_{k+l}}[e^{-sH_n(\boldsymbol 0)}]
987: \ge
988: \mathbb E_{z_{k+l}}[e^{-sH_n(z_k)}]
989: \mathbb E_{z_{k}}[e^{-sH_n(\boldsymbol 0)}].
990: \end{equation}
991: Lemma~\ref{l:laplace} then yields
992: \begin{equation}
993: \big\{2^{-n}\frac {m(n)} s + \xi_n(k+l)\big\}(1+o(1))
994: \ge
995: \big\{2^{-n}\frac {m(n)} s + \xi_n(k)\big\}
996: \big\{2^{-n}\frac {m(n)} s + \xi_n(l)\big\}
997: \ge \xi_n(k)\xi_n(l).
998: \end{equation}
999: Since $k+l\ge g(n)$ we can, in view of \eqref{e:g}, ignore the term
1000: $\xi_n(k+l)$ on the left-hand side. Taking $s$ sufficiently large then
1001: proves \eqref{e:submult}. This concludes the proof of the lemma.
1002: \end{proof}
1003:
1004: We are now ready to complete the proof of Proposition~\ref{p:sum}.
1005:
1006: \subsection{Proof of Proposition~\ref{p:sum}}
1007: \label{s:propproved}
1008:
1009: We shall establish that
1010: \begin{equation}
1011: \lim_{n\to\infty}
1012: \max_{x\in \mathcal V_n}
1013: \bigg|
1014: \psum_{y_1,\dots,y_i\in B_n\setminus x}
1015: \mathbb P_x\Big[\bigcap_{j=1}^i H_n(y_j)< a m(n)\Big]
1016: -a^i\bigg| =0.
1017: \end{equation}
1018: Observe that, with $y_0=x$,
1019: \begin{equation}
1020: \begin{split}
1021: \label{e:longsum}
1022: \psum_{y_1,\dots,y_i\in B_n\setminus x}
1023: \mathbb P_x\Big[&\bigcap_{j=1}^i H_n(y_j)< a m(n)\Big]
1024: =
1025: i!
1026: \psum_{y_1,\dots,y_i\in B_n\setminus x}
1027: \mathbb P_x\big[\bar H_n(y_1,\dots,y_i)< a m(n)\big]
1028: \\&=
1029: i! \sum_{d_1,\dots,d_i=1}^n
1030: \psum_{\substack{y_1,\dots,y_i\in B_n\\d(y_i,y_{i-1})=d_i}}
1031: \mathbb P_x\big[\bar H_n(y_1,\dots,y_i)< a m(n)\big].
1032: \end{split}
1033: \end{equation}
1034: Consider first the summation over distances larger than $g(n)$. Using
1035: Lemma~\ref{l:covergood} we get that (uniformly in the starting position $x$)
1036: \begin{equation}
1037: \begin{split}
1038: i! \sum_{d_1,\dots,d_i=g(n)}^n&
1039: \psum_{\substack{y_1,\dots,y_i\in B_n\\d(y_i,y_{i-1})=d_i}}
1040: \mathbb P_x\big[\bar H_n(y_1,\dots,y_i)< a m(n)\big]
1041: \\&=
1042: \sum_{d_1,\dots,d_i=g(n)}^n
1043: \psum_{\substack{y_1,\dots,y_i\in B_n\\d(y_i,y_{i-1})=d_i}}
1044: 2^{-in} m(n)^i a^i(1+o(1))
1045: =a^i(1+o(1)).
1046: \end{split}
1047: \end{equation}
1048: For the second equality we used the fact that by \eqref{e:Bsize},
1049: \eqref{e:vcond} and the finiteness of $i$ there are
1050: $2^n m(n)^{-1}(1+o(1))$ choices for every $y_i$.
1051:
1052: To estimate the remaining contribution to \eqref{e:longsum},
1053: we first bound the sum
1054: \begin{equation}
1055: \label{e:smalls}
1056: \sum_{d=1}^{g(n)-1}
1057: \sum_{y\in B_n:d(y,x)=d}
1058: C e^a \big(2^{-n}m(n) + \xi_n(d)\big)
1059: \le
1060: \sum_{d=1}^{g(n)-1} v_n(d)\xi_n(d)
1061: +V_n(g(n)-1) 2^{-n}m(n).
1062: \end{equation}
1063: Both summands of in the last formula converge to $0$ which can be seen
1064: easily from \eqref{e:vcond} and \eqref{e:Bsize}. For $d>g(n)$,
1065: by Lemma~\ref{l:xi}(iii), $\xi_d(n)\ll 2^{-n}m(n)$.
1066: Therefore, using \eqref{e:Bsize}, for all $n$ large enough
1067: \begin{equation}
1068: \label{e:bigs}
1069: \sum_{d=1}^{n}
1070: \sum_{y\in B_n:d(y,x)=d} C e^a\big(2^{-n}m(n) + \xi_n(d)\big)
1071: \le 2Ce^a|B_n|2^{-n}m(n)\le 4 C e^a.
1072: \end{equation}
1073: According to Lemma~\ref{l:coverbad}, the remaining
1074: part of \eqref{e:longsum} then satisfies
1075: \begin{equation}
1076: \begin{split}
1077: i!& \sum_{\substack{d_1,\dots,d_i=1\\\exists d_i\le g(n)}}^n
1078: \psum_{\substack{y_1,\dots,y_i\in B_n\\d(y_i,y_{i-1})=d_i}}
1079: \mathbb P_x\big[\bar H_n(y_1,\dots,y_i)< a m(n)\big]
1080: \\&\le
1081: i!i
1082: \sum_{d_1=1}^{g(n)}
1083: \sum_{\substack{y\in B_n\\d(y,x)=d_1}}
1084: C a \big(2^{-n}m(n)+ \xi_n(d_1)\big)
1085: \bigg( \sum_{d=1}^{n}
1086: \sum_{\substack{y\in B_n\\d(y,x)=d}}
1087: C a\big(2^{-n}m(n)+ \xi_n(d)\big)
1088: \bigg)^{i-1},
1089: \end{split}
1090: \end{equation}
1091: which converges to $0$ by \eqref{e:smalls} and \eqref{e:bigs}. This
1092: finishes the proof of Proposition~\ref{p:sum}.
1093: %%%% END OF ht2.tex %%%%
1094:
1095: %%%% START OF ht3.tex %%%%
1096: \section{Proof of Theorem~\ref{t:perc} and of Theorem~\ref{t:sampling}}
1097: \label{s:equiv}
1098:
1099:
1100: In this section we apply Theorem~\ref{t:gen} to derive the
1101: asymptotic hitting distribution of randomly chosen sets in two
1102: different settings: for random clouds (namely we prove
1103: Theorem~\ref{t:perc}) and in the setting of drawing without
1104: replacement (which is Theorem~\ref{t:sampling}).
1105:
1106:
1107: \begin{proof}[Proof of Theorem~\ref{t:perc}]
1108: To prove Theorem~\ref{t:perc} we will naturally show that the
1109: assumptions of
1110: Theorem~\ref{t:gen} are satisfied for percolation clouds $A_n$ of
1111: density $\bar m(n)^{-1}$, where $n\log n\ll \bar m(n)\ll 2^n(\log n)^{-1}$.
1112:
1113: We first verify condition \eqref{e:Bsize}, i.e.~that
1114: $P$-a.s.~$|A_n|=2^n \bar m(n)^{-1}(1+o(1))$.
1115: By Chebyshev exponential inequality, for any $\delta >0$
1116: with $\lambda >0$,
1117: \begin{equation}
1118: \begin{split}
1119: \label{e:cheb}
1120: P[|A_n|\gtrless(1\pm \delta )2^nn^{-1}]
1121: &\le
1122: \exp\{\mp\lambda (1\pm \delta )2^n\bar m(n)^{-1}\}
1123: (1+(e^{\pm \lambda }-1)\bar m(n)^{-1})^{2^n}
1124: \\&\le
1125: \exp\{\mp\lambda (1\pm \delta )2^n\bar m(n)^{-1} +
1126: (e^{\pm \lambda }-1)2^n\bar m(n)^{-1}\}.
1127: \end{split}
1128: \end{equation}
1129: Taking $\lambda $ sufficiently small and using the fact that
1130: $2^n\bar m(n)^{-1}\gg \log n$, we see that the right-hand side of the last
1131: equation is summable. Borel-Cantelli lemma then yields the result.
1132:
1133: Let $f_n(k)=n(\log \bar m(n))^{-1}\bbone\{k=1\} + n\bbone\{k>1\}$. To prove
1134: that the first part of \eqref{e:vcond} is satisfied for the percolation
1135: cloud we show:
1136: \begin{lemma}
1137: \label{l:cloudv}
1138: There exists $C$ large enough, such that for a.e.~realisation of $A_n$
1139: and for $n$ large enough
1140: \begin{equation}
1141: \label{e:rhs}
1142: v_n(k)\le C\Big[\binom n k \bar m(n)^{-1} + f_n(k)\Big]\qquad\forall
1143: k\in\{1,\dots,g(n)\}.
1144: \end{equation}
1145: \end{lemma}
1146: \begin{proof}
1147: Let $F_n(k)$ denote the right-hand side of
1148: \eqref{e:rhs}. By definition of $v_n(k)$,
1149: \begin{equation}
1150: \begin{split}
1151: P[v_n(k)\ge F_n(k)]
1152: &\le
1153: \sum_{x\in \mathcal V_n}
1154: P[|\{y\in A_n:d(x,y)=k\}|\ge F_n(k)]
1155: \\&=
1156: 2^n
1157: P[|\{y\in A_n:d(\boldsymbol 0,y)=k\}|\ge F_n(k)].
1158: \end{split}
1159: \end{equation}
1160: Using the same calculation as in \eqref{e:cheb} this is bounded from
1161: above by
1162: \begin{equation}
1163: 2^n \exp\Big\{-C\lambda \Big[\binom n k \bar m(n)^{-1} + f_n(k)\Big]+
1164: (e^\lambda -1)\binom n k \bar m(n)^{-1}\Big\}.
1165: \end{equation}
1166: If we choose $\lambda = \log \bar m(n)$ and $C$ large enough for $k=1$, or
1167: $\lambda = \mathit{const}$ and $C$ large enough for $k>1$, then the
1168: right-hand side of the last equation decays at least as fast as $e^{-cn}$ for all
1169: $k$. Summing over $k$ and using Borel-Cantelli Lemma yields the desired
1170: result.
1171: \end{proof}
1172:
1173: Lemma~\ref{l:cloudv} implies that
1174: \begin{equation}
1175: \sum_{k=1}^{g(n)-1} v_n(k) \xi_n(k)
1176: \le
1177: \sum_{k=1}^{g(n)-1} C\Big[\binom n k \bar m(n)^{-1}+ f_n(k)\Big] \xi_n(k).
1178: \end{equation}
1179: Using Lemma~\ref{l:xi}(i),(iv) this can be bounded by
1180: \begin{equation}
1181: \label{e:condcond}
1182: C\bigg\{
1183: \sum_{k=1}^{g(n)-1}
1184: \xi_n(k)\binom n k \bar m(n)^{-1}+
1185: \sum_{k=1}^3 f_n(k)\binom n k^{-1}
1186: +\sum_{k=4}^{g(n)-1} n f_n(k) \binom n k^{-1}
1187: \bigg\}
1188: \end{equation}
1189: The last two terms in the last formula are bounded by
1190: \begin{equation}
1191: C \sum_{k=1}^3 f_n(k) n^{-k}
1192: + C n^3 n^{-4}\xrightarrow{n\to\infty}0
1193: \end{equation}
1194: as can be seen easily from the definition of $f_n(k)$.
1195: The first term in \eqref{e:condcond} equals
1196: \begin{equation}
1197: \frac {n2^{-n}}{2\bar m(n)}
1198: \sum_{k=1}^{g(n)-1}
1199: \sum_{j=1}^{n-k}
1200: \binom n { j+k} \frac 1j
1201: =
1202: \frac {n}{2\bar m(n)}
1203: \sum_{j=1}^{n-1}
1204: \frac {2^{-n}}j
1205: \sum_{k=j+1}^{n\wedge (g(n)+j-1)}\binom n k\le
1206: \frac {C n \log n }{\bar m(n)}.
1207: \end{equation}
1208: This tends to $0$ by the assumptions on $\bar m(n)$.
1209: Therefore $v_n(k)$ verifies the first part of \eqref{e:vcond} $P$-a.s.~.
1210:
1211: To verify the second part observe first that if
1212: $\bar m(n)\ge \delta 2^n n^{-1}$ for some $\delta >0$, then \eqref{e:gg}
1213: holds for $g(n)=2$. Therefore,
1214: by Lemma~\ref{l:cloudv}, $V_n(g(n)-1)\le v_n(1) + 1 \le C \ll |A_n|$.
1215: We can hence
1216: further suppose that $\bar m(n)\ll 2^n n^{-1}$.
1217: By moderate deviations argument, and since $n/2-g(n)\gg n^{1/2}$,
1218: \begin{equation}
1219: |\{y:d(\boldsymbol 0,y)\le g(n)-1\}|\le 2^n e^{-c g(n)^2/n}.
1220: \end{equation}
1221: Since $2^n\bar m(n)^{-1}\gg n$ there is a function $f(n)$ such that
1222: \begin{equation}
1223: 2^n \bar m(n)^{-1}f(n)\gg n
1224: \qquad \text{and}\qquad
1225: 1\gg f(n)\gg e^{-cg(n)^2/n}.
1226: \end{equation}
1227: As in \eqref{e:cheb}
1228: \begin{equation}
1229: P\Big[V_n(g(n)-1)\ge
1230: \frac{2^{n}f(n)}{\bar m(n)}\Big]\le
1231: 2^n\exp\Big\{\frac{2^n}{\bar m(n)}\big[
1232: -\lambda f(n)+
1233: (e^\lambda -1)e^{-cg(n)^2/n}
1234: \big]
1235: \Big\}.
1236: \end{equation}
1237: For our choice of $f$ this is summable. Therefore a.s.~for $n$ large enough
1238: $V_n(g(n)-1)\le 2^n \bar m(n)^{-1} f(n)\ll |A_n|$. This verifies the second part of
1239: \eqref{e:vcond}.
1240:
1241: We have verified that with $P$ probability one the sequence of percolation
1242: clouds $A_n$ satisfies all the assumptions of Theorem~\ref{t:gen}.
1243: This proves Theorem~\ref{t:perc}.
1244: \end{proof}
1245:
1246: \begin{proof}[Proof of Theorem~\ref{t:sampling}]
1247: For any $m(n)$ satisfying the conditions of Theorem~\ref{t:sampling} it
1248: is possible to choose $\bar m(n)$ satisfying the conditions of
1249: Theorem~\ref{t:perc} such that
1250: \begin{equation}
1251: \bar m(n)^{-1} \ge (1+\varepsilon )m(n).
1252: \end{equation}
1253: We now consider a sequence of percolation clouds $A_n$ with density
1254: $\bar m(n)^{-1}$ defined on the same probability space
1255: $(\Omega',F',P')$ as $A'_n$. Since $P'$-a.s.~for all $n$ large enough
1256: $|A_n|>2^n m(n)^{-1}=|A'_n|$, we can couple $A'_n$ and $A_n$ in the way
1257: that for all $n$ large $A'_n\subset A_n$. Moreover, $A_n$ satisfies the
1258: conditions \eqref{e:volumes}--\eqref{e:vcond} of Theorem~\ref{t:gen}. To
1259: finish the proof observe that if $A_n$ satisfies these conditions, then
1260: any subset of $A_n$ satisfies them too.
1261: \end{proof}
1262: %%%% END OF ht3.tex %%%%
1263:
1264: %\input htapp
1265:
1266: %%%% START OF htbiblio.tex %%%%
1267: \subsection*{Acknowledgements} Both authors thank
1268: the Chair of Stochastic Modelling
1269: of the \'Ecole Polytechnique F\'ed\'erale of Lausanne
1270: for financial support. Ji\v r\'\i~\v Cern\'y
1271: thank the Centre de Physique Th\'eorique of Marseille for hospitality.
1272:
1273:
1274: \bibliographystyle{amsalpha}
1275: \bibliography{ht0}
1276: %%%% END OF htbiblio.tex %%%%
1277:
1278:
1279:
1280: \end{document}
1281: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1282: % vim:foldmethod=marker:foldnestmax=1:foldlevel=0:foldmarker=%<<<,%>>>
1283: