0410:math0410012/ge.tex

1: \documentclass[12pt]{article}

2: %\documentclass{article}

3:

4: % % Most of the commented-out lines were removed to make arxiv's life easier!

5: %

6: % \newif\ifpdf\ifx\pdfoutput\undefined\pdffalse\else\pdfoutput=1\pdftrue\fi

7:

8: % =========================================

9: % Revision and document information DO NOT ALTER THIS!

10: %  \usepackage{rcs}

11: %  \RCS $RCSfile: ge.tex $

12: %  \RCS $Revision: 1.33 $

13: %  \RCS $Author: Wilfrid $

14: %  \RCS $State: W2k $

15: %  \RCS $Date: 2004/07/06 22:22:35 $

16:  \newcommand{\Title}{Geometric Ergodicity and Perfect Simulation}

17:  \def\Subject{Geometric ergodicity implies CFTP}

18:  \def\Keywords{CFTP, domCFTP, geometric Foster-Lyapunov condition,

19: geometric ergodicity, Markov chain Monte Carlo, perfect simulation,

20: uniform ergodicity}

21:

22: % =========================================

23: % Packages

24: %  \usepackage{ifthen,amsmath,amsfonts,multicol,calc,times}

25: %  \usepackage{chicago,afterpage,url,xspace}

26: %  \usepackage[english]{babel}

27: %  \usepackage{color,graphicx}

28: %  \DeclareGraphicsExtensions{.png}%

29:  \usepackage{ifthen,amsmath,amsfonts,multicol,calc,times}

30:  \usepackage{chicago,url,xspace}

31:

32: % % =========================================

33: % % Get color names:

34: %  \input{dvipsnam.def}

35:

36: % =========================================

37: % Load hyperref

38: % \DefineNamedColor{named}{MyMulberry}      {cmyk}{0.34,0.90,0,0.75}

39: % \DefineNamedColor{named}{MyPineGreen}     {cmyk}{0.92,0,0.59,0.75}

40: % \DefineNamedColor{named}{MyMidnightBlue}  {cmyk}{0.98,0.13,0,0.75}

41: % \definecolor{linkcolor}{named}{MyMulberry}

42: % \definecolor{citecolor}{named}{MyPineGreen}

43: % \definecolor{pagecolor}{named}{MyMulberry}

44: % \definecolor{urlcolor}{named}{MyMidnightBlue}

45: % \usepackage[

46: % \ifpdf

47: %  colorlinks=true,

48: %  linkcolor=linkcolor,

49: %  citecolor=citecolor,

50: %  pagecolor=pagecolor,

51: %  urlcolor=urlcolor,

52: %  bookmarksopen=true,

53: %  pdfstartview=FitH,

54: % \fi

55: %  ]{hyperref}

56: \usepackage{hyperref}

57:

58:

59: % =========================================

60: % Information to be attached to pdf file (use CTRL+D to display!)

61: % \ifpdf

62: %  \def\mymoddate$#1: #2/#3/#4 #5:#6:#7 ${(D:#2#3#4#5#6#7)}

63: %  \pdfinfo{

64: %    /Title (\Title)

65: %    /Creator (TeX)

66: %    /Author (W. S. Kendall)

67: %    /Subject(\Subject)

68: %    /Keywords (\Keywords)

69: %    /CreationDate (D:20040528100000)

70: %    /ModDate \mymoddate$Date: 2004/07/06 22:22:35 $

71: %  }

72: % \fi

73:

74: % =========================================

75: % Mathematics and style commands

76:  \newtheorem{result}{Result}

77:  \newtheorem{algorithm}[result]{Algorithm}

78:  \newtheorem{definition}[result]{Definition}

79:  \newtheorem{theorem}[result]{Theorem}

80:  \newtheorem{lemma}[result]{Lemma}

81:  \newenvironment{proof}[1][]{\noindent%

82: \ifthenelse{\equal{#1}{}}{\textbf{Proof:} }%

83: {\textbf{Proof (#1):\newline}}}%

84: {\smallskip\hfill\(\Box\)\bigskip}

85:

86: %\usepackage{wskmath}

87:  \renewcommand{\d}{\operatorname{\text{d}}}

88:  \newcommand{\dist}{\operatorname{\text{dist}}}

89:  \newcommand{\CFTP}{\emph{CFTP}\xspace}

90:  \newcommand{\domCFTP}{\emph{dom}\CFTP}

91:  \newcommand{\Expect}[1]{\operatorname{\mathbb{E}}\left[#1\right]}

92:  \newcommand{\Indicator}[1]{\operatorname{\mathbb{I}}\left[#1\right]}

93:  \newcommand{\Law}[1]{\mathcal{L}\left({#1}\right)}

94:  \newcommand{\Leb}{\operatorname{\text{Leb}}}

95:  \newcommand{\Prob}[1]{\operatorname{\mathbb{P}}\left[#1\right]}

96:

97:

98: \begin{document}

99:

100: % % Preprint header

101: %  \input 427.tex

102:

103: \thispagestyle{empty}

104:

105: \title{\Title}

106: % \author{\href{mailto:w.s.kendall@warwick.ac.uk?subject=Re:GEandCFTP(\RCSState-\RCSRevision)}{Wilfrid

107: %     S.~Kendall}}

108: \author{Wilfrid S.~Kendall}

109:

110: %\date{\today: DRAFT \RCSRevision}

111: \date{\text{ }}

112:

113: \maketitle

114:

115: \begin{quote}

116:   {\textbf{Keywords:} \small\textsc\Keywords}

117: \end{quote}

118: \begin{quote}

119:   {AMS 2000 Mathematics Subject

120:     Classification}: 60J10, 65C05, 68U20

121: \end{quote}

122:

123: \begin{abstract}

124:   This note extends the work of \citeN{FossTweedie-1997}, who showed

125:   that availability of the classic \citeN{ProppWilson-1996}

126: Coupling from The Past algorithm

127:   is essentially equivalent to uniform

128:   ergodicity for a Markov chain (see also \citeNP{HobertRobert-2004}).

129:   In this note we show that all geometrically ergodic chains possess

130:   dominated Coupling from The Past algorithms (not necessarily

131:   practical!) which are rather closely connected to Foster-Lyapunov

132:   criteria.

133: \end{abstract}

134:

135:

136: \section{Introduction}

137: \label{sec:intro}

138:

139: Throughout this paper \(X\) will denote an aperiodic Harris-recurrent

140: Markov chain on a measurable state space \(\mathcal{X}\) which is a

141: Polish space (the Polish condition is required in order to

142: ensure existence of regular conditional probabilities). Recall that \(X\) is

143: said to be \emph{geometrically ergodic} if it converges in total

144: variation and at geometric rate to statistical equilibrium \(\pi\), with

145: multiplicative constant depending on the starting point:

146: \begin{equation}

147:   \label{eq:geometric-equilibrium}

148: \dist_{\text{TV}}(\Law{X_n},\pi) \quad\leq\quad V(X_0) \gamma^n

149: \end{equation}

150: for some function \(V:\mathcal{X}\to[1,\infty)\) and some rate \(\gamma\in(0,1)\).

151: The chain \(X\) is said to be \emph{uniformly ergodic} if the function

152: \(V\) can be chosen to be constant.

153: % , and merely \emph{ergodic} if it

154: % can only be said that \(\dist_{\text{TV}}(\Law{X_n},\pi)\to0\) as \(n\to\infty\)

155: % for (\(\pi\)-almost all) \(x\in\mathcal{X}\).

156:

157: We also recall the notion of a small set:

158: \begin{definition}\label{lem:minorization}

159:   A subset \(C\subseteq\mathcal{X}\) is a \emph{small set (of order

160:   \(k\))} for the Markov chain \(X\) if there is a \emph{minorization

161:   condition}: for \(\beta\in(0,1)\), and probability measure \(\nu\),

162: \begin{equation}

163:   \label{eq:minorization}

164:   \Prob{X_{k}\in E\;|\;X_0=x} \quad\geq\quad \beta \Indicator{x\in C}\times\nu(E)

165: \quad\text{for all measurable }E\subseteq\mathcal{X}\,.

166: \end{equation}

167: \end{definition}

168: Results are often stated in terms of the more general notion of

169: \emph{petite sets}; however for \(\psi\)-irreducible aperiodic chains the

170: two notions are equivalent \\

171: \cite[Theorem 5.5.7]{MeynTweedie-1993}.

172:

173: \citeN{FossTweedie-1997} use small set theory to show that the

174: condition of uniform ergodicity for such \(X\) is \emph{equivalent} to

175: the existence of a Coupling from the Past algorithm in the sense of

176: \citeN{ProppWilson-1996}. This {\emph{classic} \CFTP{}} algorithm

177: delivers a perfect sample from the equilibrium distribution of \(X\).

178: The key to the \citeANP{FossTweedie-1997} argument is to remark that

179: in case of uniform ergodicity the entire state space is small.

180: Sub-sampling the process \(X\) if necessary (to reduce the

181: \hyperref[lem:minorization]{order of the small set} to \(1\)), one can

182: then devise a classic \CFTP algorithm which is actually of the form

183: introduced by \citeN{MurdochGreen-1997} as the \emph{multigamma

184:   coupler}.  \citeN{HobertRobert-2004} develop the

185: \citeANP{FossTweedie-1997} argument to produce approximations to deal

186: with \emph{burn-in} (time till approximate equilibrium) in the

187: geometrically ergodic case.

188:

189: The \citeANP{FossTweedie-1997} result might be thought to delimit and

190: constrain the possible range of applicability of \CFTP. However it is

191: also possible to sample perfectly from the equilibrium of some

192: strictly geometrically ergodic chains using a generalization: namely

193: \emph{dominated \CFTP} (\domCFTP) as introduced in \citeN{Kendall-1998a},

194: \citeN{KendallMoller-2000}, \citeN{CaiKendall-1999a}. In this note we

195: show that this is generic: geometric ergodicity implies the existence

196: of a special form of \domCFTP algorithm adapted to the geometric

197: ergodicity in question.  Recent expositions of quantitative

198: convergence rate estimation depend heavily on small sets and their

199: relatives (see for example \citeNP{Rosenthal-2002}), so this

200: piece of \CFTP theory connects to quantitative convergence theory in a

201: rather satisfying way.

202:

203: To describe this special form of \domCFTP, we must first introduce the

204: notion of a Foster-Lyapunov condition.  Geometric ergodicity for our

205: \(X\) is equivalent to a \emph{geometric Foster-Lyapunov condition}

206: involving recurrence on small sets (this can be extracted from

207: \citeNP[Theorem 16.0.1]{MeynTweedie-1993}):

208: \begin{equation}\label{eq:foster-lyapunov}

209: \Expect{\Lambda(X_{n+1}) \;|\; X_{n}=x} \quad\leq\quad

210: \alpha \Lambda(x) + b \Indicator{X_n\in C}\,,

211: \end{equation}

212: for some \(\alpha\in(0,1)\) and \(b>0\), some

213: \hyperref[lem:minorization]{small set} \(C\), and a function

214: \(\Lambda:\mathcal{X}\to[1,\infty)\) which is bounded on \(C\).

215: Note that \(\alpha+b\geq1\) is required, as is \(\Lambda|_{C^c}\geq\alpha^{-1}\),

216: since we impose \(\Lambda\geq1\).

217: \label{page:marker2}

218: %  Similarly, mere

219: %  ergodicity of \(X\) is equivalent to a weaker Foster-Lyapunov

220: %  condition \cite[Theorem 13.0.1]{MeynTweedie-1993}

221: %  \begin{equation}\label{eq:weak-foster-lyapunov}

222: %  \Expect{\Lambda(X_{n+1}) \;|\; X_{n}=x} \quad\leq\quad

223: %  \Lambda(x) -1 + b \Indicator{X_n\in C}\,,

224: %  \end{equation}

225: %  for \(b\), \(C\), \(\Lambda\) as above.

226:

227: Now \hyperref[eq:foster-lyapunov]{Condition

228:   (\ref*{eq:foster-lyapunov})} implies that every sub-level set

229: \(\{x\in\mathcal{X}:\Lambda(x)\leq c\}\) is small (as indeed do weaker

230:   conditions; \citeNP[Theorem 14.2.3]{MeynTweedie-1993}).

231:  \newline

232: %  \noindent\fbox{\begin{minipage}[c]{\linewidth}

233: %      \textcolor{blue}{\textbf{The following material is not present in

234: %          submitted version:}}

235: %      \textcolor{blue}{

236: This is a key fact for our argument so we

237:         sketch a coupling proof.

238: %}

239:

240:

241: %     \textcolor{blue}{

242: First note that without loss of generality we

243:        can employ sub-sampling to ensure that the small set \(C\) in

244:        \hyperref[eq:foster-lyapunov]{Condition

245:          (\ref*{eq:foster-lyapunov})} is of

246:        \hyperref[lem:minorization]{order \(1\)}. Super-martingale

247:        arguments show that we can choose \(n\) such that

248:        \(\Prob{X\text{ hits }C\text{ before }n\;|\;X_0=x}\) can be

249:        bounded away from zero uniformly in \(x\) for \(\Lambda(x)\leq c\). Let

250:        the hitting probability lower bound be \(\rho_0\). We can use the

251:        \hyperref[eq:minorization]{Minorization Condition

252:          (\ref*{eq:minorization})} to realize \(X\) as a split-chain in

253:        the sense of \citeN{Nummelin-1978}, regenerating with

254:        probability \(\beta\) whenever \(X\in C\).  Couple chains from

255:        different starting points according to the time when \(X\) first

256:        regenerates in \(C\), yielding a family of realizations \(X^x\)

257:        of the Markov chain, with \(X^x_0=x\), such that with positive

258:        probability \(\beta\rho_0\) all realizations \(\{X^x : \Lambda(x)\leq c\}\)

259:        coalesce into a set of at most \(n\) trajectories by time \(n\)

260:        (divided according to the time of first regeneration).  Now

261:        apply a renewal-theoretic argument to the subsequent

262:        regenerations of this finite set of trajectories, which are

263:        allowed to evolve independently, except that whenever two

264:        trajectories regenerate at the same time they are forced to

265:        coalesce.  Straightforward analysis shows that we can choose

266:        \(m\) such that with positive probability \(\rho_1<\beta\rho_0\) all

267:        trajectories starting from \(\{x\in\mathcal{X}:\Lambda(x)\leq c\}\) have

268:        coalesced to just one trajectory by time \(n+m\).  Hence

269:        \(\{x\in\mathcal{X}:\Lambda(x)\leq c\}\) is a small set of order \(n+m\),

270:        with minorization probability \(\rho_1\).

271: % }

272: %  \end{minipage}}

273: %

274: It is convenient to isolate the notion of a \emph{scale function}

275: such as \(\Lambda\) in \hyperref[eq:foster-lyapunov]{Equation

276:   (\ref*{eq:foster-lyapunov})}.

277: \begin{definition}\label{def:scale-function}

278:   A \emph{(Foster-Lyapunov) scale function} for a Markov chain state

279:   space \(\mathcal{X}\) is a measurable function

280: \[

281: \Lambda:\mathcal{X}\to[1,\infty)

282: \]

283: such that sub-level sets \(\{x\in\mathcal{X}:\Lambda(x)\leq \lambda\}\) are small for all

284: \(\lambda\geq1\).

285: \end{definition}

286:

287: Now we can define the special form of \domCFTP which we require, which

288: is adapted to a specified Foster-Lyapunov scale function.

289: \begin{definition}\label{defn:dominating-scale}

290:   Suppose that \(\Lambda\) is a scale function for an Harris-recurrent Markov chain

291:   \(X\).  We say the stationary ergodic random process \(Y\) on

292:   \([1,\infty)\) is a \emph{dominating process for \(X\) based on the scale

293:     function \(\Lambda\)} (with \emph{threshold} \(h\) and \emph{coalescence

294:     probability} \(\varepsilon\)) if it is coupled co-adaptively to realizations

295: of \(X^{x,-t}\) (the Markov chain \(X\) begun at \(x\) at time \(-t\))

296: as follows:

297: \begin{itemize}

298: \item[(a)] for all \(x\in\mathcal{X}\), \(n>0\), and \(-t\leq0\),

299: almost surely

300: \begin{equation}\label{eq:domination-requirement}

301: \Lambda(X^{x,-t}_{-t+n})\quad\leq\quad Y_{-t+n} \qquad\Rightarrow\qquad

302: \Lambda(X^{x,-t}_{-t+n+1})\quad\leq\quad

303: Y_{-t+n+1}\,;

304: \end{equation}

305: \item[(b)] moreover if \(Y_n\leq h\) then the probability  of

306:   \emph{coalescence} is at least \(\varepsilon\), where coalescence means that the set

307: \[

308: \left\{ X^{x,-t}_{n+1}\;:\;

309: \text{ such that }-t\leq n \text{ and } \Lambda(X^{x,-t}_n)\leq Y_n \right\}

310: \]

311: is a singleton set;

312: \item[(c)] and finally, \(\Prob{Y_n\leq h}\) must be positive.

313: \end{itemize}

314: \end{definition}

315:

316: Suppose \(Y\) is a dominating process for \(X\) based on the scale

317: \(\Lambda\).  The following \domCFTP algorithm then yields a draw from

318: the equilibrium distribution of \(X\).

319: \begin{algorithm}\label{ag:dom-cftp}

320: \begin{itemize}

321: \item[]

322: \item[] Simulate \(Y\) backwards in equilibrium till the most recent

323:     \(T<0\) for which \(Y_T\leq h\);

324: \item[] while coalescence does not occur

325:     at time \(T\):

326:   \begin{itemize}

327:   \item[] extend \(Y\) backwards till the most recent

328:       \(S<T\) for which \(Y_S\leq h\);

329:   \item[] set \(T\gets S\);

330:   \end{itemize}

331: \item[] simulate the coupled \(X\) forwards from time \(T+1\),

332:     starting with the unique state produced by the

333:     coalescence event at time \(T\);

334: \item[] return \(X_0\) as a perfect draw from equilibrium.

335: \end{itemize}

336: \end{algorithm}

337: Practical implementation considerations are: (1) can one draw from the

338: equilibrium of \(Y\)? (2) can one simulate \(Y\) backwards in

339: equilibrium? (3) can one couple the dominated target processes

340: \(X^{x,-t}\) with \(Y\) so as to ensure the possibility of

341: regeneration? (4) can one determine when this regeneration has

342: occurred? and, of course, (5) will the algorithm not run too slowly?

343:

344: The simplest kind of ordinary small-set \CFTP, as in

345: \citeN{MurdochGreen-1997}, is recovered from this Algorithm by taking

346: \(Y\equiv h\), and requiring the whole state-space to be small.  In actual

347: constructions, care must be taken to ensure that \(Y\) dominates a

348: coupled collection of \(X\) for which coalescence is possible as

349: specified in \hyperref[defn:dominating-scale]{Definition

350:   \ref*{defn:dominating-scale}(b)} (see the treatment of \CFTP for Harris

351: chains in \citeNP{CorcoranTweedie-2000}).

352:

353: The proof that this algorithm returns a perfect draw from the

354: equilibrium distribution of \(X\) is an easy variation on the usual

355: \domCFTP argument, found at varying levels of generality in

356: \citeNP{Kendall-1998a,KendallMoller-2000,CaiKendall-1999a}. The key is

357: to observe that \hyperref[ag:dom-cftp]{Algorithm \ref*{ag:dom-cftp}}

358: reconstructs a coalesced trajectory which may be viewed as produced by

359: the Markov chain begun at time \(-\infty\) at some specified state \(x\)

360: such that \(\Lambda(x)\leq h\): the proof is then an exercise in making this

361: heuristic precise.

362:

363: The \citeN{FossTweedie-1997} argument, and the fact that the

364: geometric Foster-Lyapunov

365: \hyperref[eq:foster-lyapunov]{condition

366:   (\ref*{eq:foster-lyapunov})} would certainly produce a dominating

367: process if the expectation inequality was replaced by a stochastic

368: domination, suggests our main result, which will be proved in

369: \hyperref[sec:implication]{Section \ref*{sec:implication}}:

370: \begin{theorem}\label{thm:geometric-domCFTP}

371:   If \(X\) is a geometrically ergodic Markov chain, and \(\Lambda\) is a

372:   scale function for \(X\) which is derived from some

373:   geometric Foster-Lyapunov condition, then there exists a \domCFTP

374:   algorithm for \(X\) (possible subject to sub-sampling) using a

375:   dominating process based on the scale \(\Lambda\), as in

376:   \hyperref[ag:dom-cftp]{Algorithm \ref*{ag:dom-cftp}}.

377: \end{theorem}

378:

379: As in the case of the \citeN{FossTweedie-1997} result, this algorithm

380: need not be at all practical!

381:

382: \section{Geometric ergodicity implies \domCFTP}

383: \label{sec:implication}

384: We begin with a lemma concerning the effect of sub-sampling on the

385: geometric Foster-Lyapunov

386: \hyperref[eq:foster-lyapunov]{

387:   condition}.

388: \begin{lemma}\label{lem:sub-sampling}

389:   Suppose \(X\) satisfies a \hyperref[eq:foster-lyapunov]{geometric

390:     Foster-Lyapunov condition}: for some \(\alpha<1\), some scale function

391:     \(\Lambda\), and small set \(C=\{x\in\mathcal{X}:\Lambda(x)\leq c\}\).

392: \begin{equation}\label{eq:fl-working}

393: \Expect{\Lambda(X_{n+1}) \;|\; X_{n}=x} \quad\leq\quad \alpha \Lambda(x) + b

394: \Indicator{\Lambda(X_n)\leq c}\,.

395: \end{equation}

396: Under \(k\)-sub-sampling we obtain a similar condition but with

397: different constants:

398: \begin{equation}

399: \Expect{\Lambda(X_{n+k}) \;|\; X_{n}=x} \quad\leq\quad \alpha^{k-1} \Lambda(x) + b^{\prime}

400: \Indicator{\Lambda(X_n)\leq c^{\prime}}\,,

401: \label{eq:sub-sampling-1}

402: \end{equation}

403: and also, if \(k\geq2\),

404: \begin{equation}

405: \Expect{\Lambda(X_{n+k}) \;|\; X_{n}=x} \quad\leq\quad \alpha \Lambda(x) + b^{\prime\prime}

406: \Indicator{\Lambda(X_n)\leq c^{\prime\prime}}\,.

407: \label{eq:sub-sampling-2}

408: \end{equation}

409: Moreover \(b^\prime=b/(1-\alpha)\), \(c\prime=b/(\alpha^{k-1} (1-\alpha)^2)\) may be chosen not

410: to depend on \(c\), and \(b^{\prime\prime}=b/(1-\alpha)\), \(c^{\prime\prime}=b/(\alpha(1-\alpha)^2)\)

411: may be chosen to depend neither on \(c\) nor on \(k\geq2\).

412: \end{lemma}

413: We are able to choose \(b^\prime\), \(c\prime\), \(b^{\prime\prime}\), \(c^{\prime\prime}\) not to

414: depend on \(c\) because we have allowed generous sub-sampling

415: (\emph{i.e.}: \(k\)-sub-sampling to change \(\alpha\) to \(\alpha^{k-1}\)).

416:

417: \begin{proof}

418:   Iterating \hyperref[eq:fl-working]{Equation (\ref*{eq:fl-working})},

419:   \begin{align*}

420: \Expect{\Lambda(X_{n+k}) \;|\; X_{n}=x} &\quad\leq\quad

421: \alpha^k \Lambda(x) + \sum_{j=1}^k \alpha^{j-1} b\Expect{\Indicator{\Lambda(X_{n+k-j})\leq c}\;|\;X_{n}=x}\\

422: &\quad\leq\quad

423: \alpha^k \Lambda(x) + \frac{b}{1-\alpha} \\

424: &\quad=\quad

425: \alpha^{k-1} \Lambda(x) - \alpha^{k-1} (1-\alpha) \Lambda(x)+ \frac{b}{1-\alpha} \\

426: &\quad\leq\quad

427: \begin{cases}

428:   \alpha^{k-1} \Lambda(x) & \text{ if } \Lambda(x)> \frac{b}{\alpha^{k-1} (1-\alpha)^2} \,,\\

429:   \alpha^{k-1} \Lambda(x) + {b}/{(1-\alpha)} & \text{ otherwise.}

430: \end{cases}

431:   \end{align*}

432:   Hence we may choose \(b^{\prime}=b/(1-\alpha)\), \(c^{\prime}=b/(\alpha^{k-1}

433:   (1-\alpha)^2)\). Alternatively

434: \begin{align*}

435:   \Expect{\Lambda(X_{n+k}) \;|\; X_{n}=x} &\quad\leq\quad

436: \alpha \Lambda(x) - \alpha (1-\alpha^{k-1}) \Lambda(x)+ \frac{b}{1-\alpha} \\

437: &\quad\leq\quad

438: \begin{cases}

439:   \alpha \Lambda(x) & \text{ if } \Lambda(x)> \frac{b}{\alpha (1-\alpha)(1-\alpha^{k-1})} \,,\\

440:   \alpha \Lambda(x) + {b}/{(1-\alpha)} & \text{ otherwise.}

441: \end{cases}

442: \end{align*}

443: Hence we may choose \(b^{\prime\prime}=b/(1-\alpha)\), \(c^{\prime\prime}=b/(\alpha (1-\alpha)^2)\) if

444: \(k\geq2\).

445: \end{proof}

446:

447:

448: \begin{proof}[of Theorem \ref*{thm:geometric-domCFTP}]

449: We first construct the dominating process.

450:

451: Consider Markov's inequality applied to the

452: geometric Foster-Lyapunov

453: \hyperref[eq:foster-lyapunov]{inequality

454:   (\ref*{eq:foster-lyapunov})}.

455: Any dominating process \(Y\) must satisfy the

456: \hyperref[eq:domination-requirement]{stochastic domination

457:   (\ref*{eq:domination-requirement})} described in

458: \hyperref[defn:dominating-scale]{Definition

459:   \ref*{defn:dominating-scale}}. Consequently, in default of further

460: distributional information about \(\Prob{\Lambda(X_{n+1}) | X_{n}=x}\),

461: if \(Y\) is to

462: be a dominating process based on the scale \(\Lambda\) then we need \(Y\) to

463: be stationary ergodic but also to satisfy

464: \begin{equation}\label{eq:crude-domination}

465:    \Prob{Y_{n+1}\geq \alpha z y\;|\; Y_n=z} \quad\geq\quad

466:  \sup_{x: \Lambda(x)\leq z}\frac{\Expect{\Lambda(X_{n+1})\;|\;X_n=x}}{\alpha z y}\,.

467: \end{equation}

468:

469: Now if \(C\subseteq\{x\in\mathcal{X}:\Lambda(x)\leq c\}\) then

470:  \begin{align*}

471:  \sup_{x: \Lambda(x)\leq z}\frac{\Expect{\Lambda(X_{n+1})\;|\;X_n=x}}{\alpha z y}

472:  &\quad\leq\quad

473:  \sup_{x: \Lambda(x)\leq z}

474: \frac{\alpha\Lambda(x) + b \Indicator{x:\Lambda(x)\leq c}}{\alpha z y}\\

475:  \quad\leq\quad \sup_{x: \Lambda(x)\leq z}\frac{\alpha \Lambda(x)}{\alpha z y} & \quad=\quad \frac{1}{y}

476:  \qquad\text{ so long as }

477:  z\geq c + \frac{b}{\alpha}\,.

478:  \end{align*}

479:

480: Consequently \(Y\) is a possible candidate for a dominating process based

481: on the scale \(\Lambda\) if

482: \begin{equation}

483:   \label{eq:dominating-in-scale}

484:   \Prob{Y_{n+1}\geq \alpha z y \:|\; Y_n=z} \quad=\quad

485:   \begin{cases}

486:     1/y & \text{ if } z\geq c + \frac{b}{\alpha} \,,\\

487:     1   & \text{ otherwise.}

488:   \end{cases}

489: \end{equation}

490: If we define \(U\) by \(Y=(c+b/ \alpha)\exp(U)\) (so \(U\) is a

491: \emph{log-dominating process}) then \(U\) is the system workload of a

492: \(D/M/1\) queue, sampled at arrivals, with arrivals every \(\log(1/

493: \alpha)\) units of time, and service times being independent and of unit

494: Exponential distribution. The process \(U\) is a random walk with

495: reflection (of Skorokhod type) at \(0\): as its jump distribution is

496: \(\text{Exponential}(1)-\log(1/ \alpha)\) we may deduce it is

497: positive-recurrent if and only if \(\alpha<e^{-1}\).

498:

499: \label{page:marker1}

500: In case \(e^{-1}<\alpha<1\), \(U\) and \(Y=(c+b/ \alpha)\exp(U)\) fail to be

501: positive-recurrent.  However the same construction will work if we use

502: \hyperref[eq:sub-sampling-1]{Equation (\ref*{eq:sub-sampling-1})} of

503: \hyperref[lem:sub-sampling]{Lemma \ref*{lem:sub-sampling}} to justify

504: sub-sampling \(X\) with a sampling period \(k\) large enough to ensure

505: a \hyperref[eq:foster-lyapunov]{geometric Foster-Lyapunov condition

506:   (\ref*{eq:foster-lyapunov})} using \(\Lambda\) as scale but with \(\alpha\)

507: replaced by \(\alpha^{k-1}<e^{-1}\), and amending \(b\) to \(b^\prime\), \(c\)

508: to \(c^\prime\) as in \hyperref[eq:sub-sampling-1]{Inequality

509:   (\ref*{eq:sub-sampling-1})}.

510:

511: Thus without loss of generality

512: we may assume \(\alpha<e^{-1}\),

513: and so this \(Y\) can be run in statistical equilibrium, and thus

514: qualifies as least partly as a dominating process for the purposes of

515: \hyperref[thm:geometric-domCFTP]{Theorem

516:   \ref*{thm:geometric-domCFTP}}. In the sequel we assume moreover that

517: further sub-sampling has been carried out based on

518: \hyperref[eq:sub-sampling-2]{Equation (\ref*{eq:sub-sampling-2})}, to

519: ensure that the following small set is of order \(1\):

520: \begin{equation}

521:   \label{eq:small-target}

522:   \left\{x\in\mathcal{X}\;:\; \Lambda(x) \leq h \right\}

523: \qquad\text{ for }\qquad

524: h=\max\left\{c+\frac{b}{\alpha},

525: \frac{b}{\alpha(1-\alpha)}\left(1+\frac{1}{1-\alpha}\right)

526: \right\}\,.

527: \end{equation}

528: Here the level \(h\geq c+b/ \alpha\) is fixed so as to ensure \(h=c^{\prime\prime}+b^{\prime\prime}/(1-\alpha)\) with

529: \(b^{\prime\prime}\), \(c^{\prime\prime}\) given as in

530: \hyperref[eq:sub-sampling-2]{Equation (\ref*{eq:sub-sampling-2})};

531: thus \(h\) supplies a stable threshold for geometric Foster-Lyapunov

532: conditions, even allowing for further sub-sampling if required. Note

533: in particular that \(Y=(c+b/

534: \alpha)\exp(U)\) is able to sink below \(h\), since \(h\geq c+b/\alpha\) and the

535: system workload \(U\) can reach zero.

536: \label{page:marker3}

537:

538: To fulfil the requirements on a dominating process given in

539: \hyperref[defn:dominating-scale]{Definition

540:   \ref*{defn:dominating-scale}}, we need to construct a coupling

541: between \(Y\) and the target process \(X\) expressed in

542: terms of a random flow of independent maps

543: \(F_{-t+n+1}:\mathcal{X}\to\mathcal{X}\):

544: \[

545: X^{x,-t}_{-t+n+1}\quad=\quad F_{-t+n+1}(X^{x,-t}_{-t+n})

546: \]

547: satisfying the distributional requirement that \(X^{x,-t}\) should

548: evolve as the Markov chain \(X\),

549: the \hyperref[eq:domination-requirement]{domination requirement

550: expressed by the implication (\ref*{eq:domination-requirement})},

551: and also the regeneration requirement

552: that with probability \(\varepsilon\) the set

553: \[

554: \left\{ F_n(u) \;:\; \text{ such that } \Lambda(u)\leq h \right\}

555: \]

556: should be a singleton set. The well-known link between stochastic

557: domination and coupling can be applied together with the arguments

558: preceding \hyperref[eq:dominating-in-scale]{Equation

559:   (\ref*{eq:dominating-in-scale})} to show that we can couple the

560: various \(X^{x,-t}\) with \(Y\) co-adaptively in this manner so that

561: the implication (\ref*{eq:domination-requirement}) holds: note that

562: here and here alone we use the Polish space nature of \(\mathcal{X}\),

563: which allows us to complete the couplings by constructing regular

564: conditional probability distributions for the various \(X^{x,-t}\)

565: conditioned on the \(\Lambda(X^{x,-t})\). Thus all that is required is to

566: show that this stochastic domination coupling can be modified to allow

567: for regeneration.

568:

569: The small set condition for \(\{x\in\mathcal{X}:\Lambda(x)\leq h\}\) means there is

570: a probability measure \(\nu\) and a scalar \(\beta\in(0,1)\) such that for all

571: Borel sets \(B\subseteq[1,\infty)\), whenever \(\Lambda(x)\leq h\),

572: \begin{equation}

573:   \label{eq:crucial-small-set}

574:   \Prob{\Lambda(X_{n+1})\in B \;|\; X_n=x} \quad\geq\quad \beta\nu(B) \,.

575: \end{equation}

576: Moreover the stochastic domination which has been arranged in the

577:  course of defining \(Y\) means that for all real \(u\),

578: whenever \(\Lambda(x) \leq y\),

579:  \begin{equation}

580:    \label{eq:crucial-stochastic-domination}

581:   \Prob{\Lambda(X_{n+1})>u \;|\; X_n=x} \quad\leq\quad

582: \Prob{Y>u\;|\; Y=y}\,.

583:  \end{equation}

584: We can couple in order to arrange for regeneration if we can identify

585: a probability measure \(\widetilde\nu\), defined solely in terms of

586: \(\nu\) and the dominating jump distribution \(\Prob{Y\geq u \;|\;

587:   Y=y}\), such that for all real \(u\)

588: \begin{align*}

589: \Prob{\Lambda(X_{n+1})>u \;|\; X_n=x} - \beta \nu((u,\infty))

590: \quad  &\leq\quad\Prob{Y>u \;|\;  Y=y} - \beta\widetilde\nu((u,\infty))\\

591: \nu((u,\infty)) \quad &\leq\quad \widetilde\nu((u,\infty))

592: \end{align*}

593: and moreover

594: \[

595:   \Prob{Y_{n+1}\in B \;|\; Y_n=y} \quad\geq\quad \beta\widetilde\nu(B) \,.

596: \]

597: For then at each step we may determine whether or not regeneration has

598: occurred (with probability \(\beta\)); under regeneration we use

599: stochastic domination to couple \(\nu\)

600: to \(\widetilde\nu\); otherwise we use

601: stochastic domination to couple the residuals.

602:

603: We state and prove this as an interior lemma, as it may be of wider interest.

604: \begin{lemma}\label{lem:mixture-domination-coupling}

605: Suppose \(U\), \(V\) are two random variables defined on \([1,\infty)\) such

606: that

607: \begin{itemize}

608: \item[(a)] The distribution \(\Law{U}\) is stochastically dominated by the distribution

609:   \(\Law{V}\):

610: \begin{equation}

611: \Prob{U> u} \quad\leq\quad \Prob{V>u}\qquad\text{ for all real }U\,;

612: \label{eq:lemma-domination}

613: \end{equation}

614: \item[(b)] \(U\) satisfies a minorization condition: for some

615:   \(\beta\in(0,1)\) and probability measure \(\nu\):

616:   \(B\subseteq[1,\infty)\),

617: \begin{equation}

618: \Prob{U\in B} \quad\geq\quad \beta\nu(B) \qquad \text{ for all Borel sets }B\subseteq[1,\infty)\,.

619: \label{eq:lemma-minorization}

620: \end{equation}

621: \end{itemize}

622: Then there is a probability measure \(\mu\) stochastically

623: dominating \(\nu\) and such that \(\beta\mu\) is minorized by

624: \(\Law{V}\). Moreover \(\mu\) depends only on \(\beta\nu\)

625: and \(\Law{V}\).

626: \end{lemma}

627: \begin{proof}[of Lemma \ref{lem:mixture-domination-coupling}]

628:   Subtract the measure \(\beta\nu((u,\infty))\) from both sides of

629:   \hyperref[eq:lemma-domination]{Inequality (\ref*{eq:lemma-domination})}

630:   representing the stochastic domination \(\Law{U}\preceq\Law{V}\). By the

631: \hyperref[eq:lemma-minorization]{minorization condition

632:   (\ref*{eq:lemma-minorization})} the resulting left-hand-side is nonnegtive. Thus for all real

633:   \(u\)

634: \[

635: 0 \quad\leq\quad \Prob{U>u} - \beta\nu((u,\infty)) \quad\leq\quad \Prob{V>u} - \beta\nu((u,\infty))

636: \]

637:  Now

638:   \(\Law{U}-\beta\nu\) is a nonnegative measure (because of the

639:   \hyperref[eq:lemma-minorization]{minorization condition

640:   (\ref*{eq:lemma-minorization})}). Consequently \(\Prob{U>u} - \beta\nu((u,\infty))\)  must be

641:   non-increasing in \(u\) and so we may reduce the

642:   right-hand side by minimizing over \(w\leq u\):

643: % Hence for all real \(u\)

644: \begin{align*}

645:   \Prob{U>u} - \beta\nu((u,\infty)) &\quad\leq\quad \inf_{w\leq u}\left\{ \Prob{V>w} -

646:     \beta\nu((w,\infty)) \right\}\\

647: &\quad=\quad \Prob{V>u} -\beta\mu((u,\infty))

648: \end{align*}

649: where \(\mu\) is the potentially \emph{signed} measure defined by

650: \[

651: \beta \mu([1,u]) \quad=\quad

652: \Prob{V\leq u} - \sup_{w\leq u}\left\{ \Prob{V\leq w} - \beta\nu([1,w)) \right\}\,.

653: \]

654: In fact \(\mu\) is a probability measure on \([1,\infty)\). Both

655: \(\mu(\{1\})=\nu(\{1\})\) and

656: \(\mu([1,\infty))=1\) follow from considering \(u=1\), \(u\to\infty\). Now we show

657: \(\mu\) is nonnegative:

658: \begin{align*}

659: &  \beta\mu((u,u+u^\prime]) - \Prob{u<V\leq u+u^\prime}

660:  \\

661: & \quad=\quad

662:  - \sup_{w\leq u+u^\prime}\left\{ \Prob{V\leq w} - \beta\nu([1,w)) \right\}

663:  + \sup_{w\leq u}\left\{ \Prob{V\leq w} - \beta\nu([1,w)) \right\}\,.

664: \end{align*}

665: If the first supremum were to be attained at \(w\leq u\) then the two suprema

666: would cancel. If the first supremum were to be attained at \(w^\prime\in[u,u+u^\prime]\)

667: then

668: \begin{align*}

669: &  \beta\mu((u,u+u^\prime]) - \Prob{u<V\leq u+u^\prime}

670:  \\

671: & \quad=\quad

672:  - \Prob{V\leq w^\prime} + \beta\nu([1,w^\prime))

673:  + \sup_{w\leq u}\left\{ \Prob{V\leq w} - \beta\nu([1,w)) \right\}\\

674: & \quad\geq\quad

675:  - \Prob{V\leq w^\prime} + \beta\nu([1,w^\prime))

676:  + \Prob{V\leq u} - \beta\nu([1,u)

677: \end{align*}

678: and hence

679: \[

680: \beta\mu((u,u+u^\prime])

681: \quad\geq\quad \Prob{w^\prime<V\leq u+u^\prime}

682:  + \beta\nu([u,w^\prime))

683: \quad\geq\quad0\,.

684: \]

685: So we can deduce \(\beta\mu\) is in fact a nonnegative measure.

686: \label{page:marker4}

687:

688: On the other hand

689: \begin{align*}

690: &  \beta\mu((u,u+u^\prime]) - \Prob{u<V\leq u+u^\prime}

691:  \\

692: & \quad=\quad

693:  - \sup_{w\leq u+u^\prime}\left\{ \Prob{V\leq w} - \beta\nu([1,w)) \right\}

694:  + \sup_{w\leq u}\left\{ \Prob{V\leq w} - \beta\nu([1,w)) \right\} \\

695: &\quad\leq\quad0\,,

696: \end{align*}

697: hence

698: \begin{equation}

699:   \label{eq:sandwich}

700:    0\quad\leq\quad \beta\mu((u,u+u^\prime])\quad\leq\quad \Prob{u<V\leq u+u^\prime}\,,

701: \end{equation}

702: so \(\beta \mu\) is absolutely continuous with respect to

703: \(\Law{V}\) and indeed we can deduce

704: \begin{equation}

705:   \label{eq:representation}

706:   \beta\d\mu(u) \quad=\quad \Indicator{\Prob{V>\cdot} -

707:     \beta\nu((\cdot,\infty))\text{ hits current minimum at }u } \d\Prob{V\leq u}\,.

708: \end{equation}

709: The minorization of \(\beta\mu\) by \(\Law{V}\) follows from this argument:

710: dependence

711: only on \(\beta\nu\)

712: and \(\Law{V}\) follows by construction; finally, stochastic

713: domination of \(\beta \nu\) follows from

714: \begin{align*}

715:   \beta\mu((u,\infty))& \quad=\quad \Prob{V>u} - \inf_{w \leq u}\left\{ \Prob{V>w} -

716:       \beta\nu((w,\infty))\right\} \\

717: & \quad=\quad\sup_{w\leq u}\left\{\beta\nu((w,\infty)) - \Prob{w<V\leq u} \right\} \\

718: &\quad\geq\quad \beta\nu((u,\infty))\,.

719: \end{align*}\end{proof}

720:

721: This concludes the proof of Theorem \ref*{thm:geometric-domCFTP}:

722: use \hyperref[lem:mixture-domination-coupling]{Lemma

723:   \ref*{lem:mixture-domination-coupling}} to couple

724: \(\Law{X_{n+1}\;|\;X_n=x}\) to \(\Law{Y_{n+1}\;|\;Y_n=y}\) whenever

725: \(\Lambda(x)\leq y\) in a way which implements stochastic domination and

726: ensures all the \(X_{n+1}\) regenerate simultaneously whenever \(Y\leq h\).

727: \end{proof}

728:

729: Note that the algorithm requires us to be able to draw from the

730: equilibrium distribution of \(Y\) and to simulate its time-reversed

731: equilibrium dual. Up to an additive constant \(\log(Y)\) is the workload

732: of a \(D/M/1\) queue. This queue is amenable to exact calculations, so

733: these simulation tasks are easy to implement

734: (specializing the theory of the \(G/M/1\) queue as discussed

735: % ,

736: % for

737: % example,

738: in \citeNP[ch.~11]{GrimmettStirzaker-1992}). However in general we do \emph{not} expect this

739: ``universal dominating process'' to lead to practical \domCFTP

740: algorithms! The difficulty in application will arise in determining whether or

741: not regeneration has occurred as in \hyperref[ag:dom-cftp]{Algorithm

742:   \ref*{ag:dom-cftp}}. This will be difficult especially if

743: sub-sampling has been applied, since then one will need detailed

744: knowledge of convolutions of the probability kernel for \(X\)

745: (potentially a harder problem than sampling from equilibrium!).

746:

747: Of course, in practice one uses different dominating processes

748: better adapted to the problem at hand. For example an \(M/D/1\) queue

749: serves as a good log-dominating process for perpetuity-type problems

750: and gives very rapid \domCFTP algorithms indeed, especially when

751: combined with other perfect simulation ideas such as multishift \CFTP

752: \cite{Wilson-1999}, read-once \CFTP \cite{Wilson-2000a}, or one-shot

753: coupling \cite{RobertsRosenthal-2002}.

754:

755: Finally note that, in cases when \(\alpha\in[e^{-1},1)\) or when the small

756: set \(\{x\in\mathcal{X}:\Lambda(x)\leq h\}\) is of order greater than \(1\), we are

757: forced to work with coupling constructions that are effectively

758: \emph{non-co-adapted} (sub-sampling means that target transitions

759: \(X_{mk}\) to \(X_{mk+1}\) depend on sequences \(Y_{mk}\), \(Y_{mk+1}\), \ldots,

760: \(Y_{mk+k}\)). The potential improvements gained by working with

761: non-adapted couplings are already known not only to theory (the

762: non-co-adapted filling couplings of

763: \citeNP{Griffeath-1974,Goldstein-1978}; and the efficiency

764: considerations of \citeNP{BurdzyKendall-1997a}) but also to

765: practitioners (\citeNP{Huber-2004}: non-Markovian techniques in \CFTP;\\

766: \citeNP{HayesVigoda-2003}: non-Markovian conventional MCMC for random

767: sampling of colorings).

768:

769: \section{Counter-example}

770: \label{sec:counter-example}

771: We complete this note by describing a counter-example:

772: a Markov chain \(X\) which satisfies a Foster-Lyapunov

773: condition involving a scale function \(\Lambda\), but such that there can be no

774: recurrent dominating process \(Y\) based on \(\Lambda\).  We begin by

775: choosing a sequence of disjoint measurable sets \(S_1\), \(S_2\), \ldots,

776: subsets of \([1,\infty)\) such that each set places positive measure in

777: every non-empty open set:

778: \begin{lemma}\label{lem:partition}

779:   One can construct a measurable partition \(S_1\), \(S_2\), \ldots of \([1,\infty)\),

780: \[

781: S_1 \sqcup S_2 \sqcup S_3 \sqcup \ldots \quad=\quad [1,\infty)\,,

782: \]

783: with the property

784: \(\Leb(S_i \cap (u,v)) > 0\)

785: for all \(0<u<v<\infty\), all \(i\in\{1, 2, \ldots\}\).

786: \end{lemma}

787: \begin{proof}

788:   Enumerate the rational numbers in \([0,1)\)

789: by \(0=\tilde q_0\), \(\tilde q_1\), \(\tilde q_2\), \ldots . Choose \(\alpha<1/2\), and define

790: \[

791: A_0 \quad=\quad \bigcup_{k=1}^\infty \bigcup_{n=0}^\infty \left[\tilde q_n+k,\tilde q_n+k+\alpha

792:   2^{-n}\right]\,.

793: \]

794: Then for each \(k\geq1\)

795: \[

796: \alpha \quad\leq\quad \Leb\left(A_0\cap[k,k+1)\right) \quad\leq\quad 2\alpha\,.

797: \]

798: Continue by defining a sequence of nested subsets \(A_{r}\subset A_{r-1}\) by

799: \begin{equation}

800:   \label{eq:subsets}

801: A_r \quad=\quad

802: \bigcup_{k=1}^\infty \bigcup_{n=0}^\infty \left[\frac{\tilde q_n+k}{2^r},\frac{\tilde q_n+k}{2^r}+\frac{\alpha}{4^r} 2^{-n}\right]\,,

803: \end{equation}

804: satisfying

805: \begin{equation}

806:   \label{eq:subsets-bounds}

807: \frac{\alpha}{4^r} \quad\leq\quad \Leb\left(A_r\cap\Big[\frac{k}{2^r},\frac{k+1}{2^r}\Big)\right)

808: \quad\leq\quad \frac{2\alpha}{4^r}\,.

809: \end{equation}

810:

811: Thus the measurable shell \(B_r=A_r\setminus A_{r+1}\) places mass of at least

812: \(\frac{\alpha}{2\times4^{r}}\) in each interval

813: \([\frac{k}{2^r},\frac{k+1}{2^r})\)\,.

814:

815: It follows that if \(S\) is defined by

816: \[

817: S \quad=\quad \bigcup_{s=1}^\infty \left(A_{r_s}\setminus A_{r_{s}+1}\right)

818: \]

819: then \(\Leb(S\cap U)>0\) for every open set \(U\subset[1,\infty)\). The desired

820: disjoint sequence \(S_1\), \(S_2\), \ldots is obtained by considering a

821: countably infinite family of disjoint increasing subsequences of the

822: natural numbers.

823: \end{proof}

824:

825: \begin{lemma}

826:   There is a Markov chain \(X\) satisfying a Foster-Lyapunov condition

827:   with scale function \(\Lambda\), such that any

828:   dominating process \(Y\) based on \(\Lambda\) will fail to be positive-recurrent.

829: \end{lemma}

830:

831: \begin{proof}

832:   The Markov chain \(X\) will have state space \([1,\infty)\), with scale

833:   function \(\Lambda(x)\equiv x\).  We begin by fixing \(\alpha\in(e^{-1},1)\), and set

834:   \(C=[1,\alpha^{-1}]\). The set \(C\) will be the small set for the

835:   Foster-Lyapunov condition.  Choose a measurable partition \(S_1 \sqcup

836:   S_2 \sqcup S_3 \sqcup \ldots = [1,\infty)\) as in \hyperref[lem:partition]{Lemma

837:     \ref*{lem:partition}}. Enumerate the rational numbers in \([1,\infty)\)

838:   by \(q_1\), \(q_2\), \ldots.

839:

840: We define the transition kernel \(p(x,\cdot)\) of \(X\) on

841:   \([1,\infty)\) as follows:

842: \begin{itemize}

843: \item[] For \(x\in[1,\alpha^{-1}]\), set

844: \[

845: p(x,\d y) \quad=\quad \exp(-(y-1)) \d y \quad\text{for }y\geq1\,,

846: \]

847: so that if \(X_n\in C\) then \(X_{n+1}-1\) has a unit rate

848: Exponential distribution. Then:

849: \begin{itemize}

850: \item[] \(C\) is a small set for \(X\) of order \(1\) (in fact it

851:   will be a regenerative atom!);

852: \item[] if \(X_n\in C\) then \(\Expect{X_{n+1}}=2\);

853: \item[] if \(X\) has positive chance of visiting state \(1\) then the

854:   whole state space \([1,\infty)\) will be maximally \(\Leb\)-irreducible.

855: \end{itemize}

856: \item[] For \(x>\alpha^{-1}\) and \(x\in S_i\), set

857: \[

858: p(x, \d y) \quad=\quad

859: \left(1-\frac{\alpha}{q_i}\right)\delta_{0}(\d y) + \frac{\alpha}{q_i}\delta_{q_i x}(\d y)\,.

860: \]

861: Note that, because we are using the identity scale \(\Lambda(x)\equiv x\),

862: \begin{itemize}

863: \item[] if \(x\not\in C\) then

864:   \(\Expect{\Lambda(X_{n+1})\;|\;X_n=x}=\Expect{X_{n+1}\;|\;X_n=x}=\alpha x\);

865: \item[] if \(x\not\in C\) then \(\Prob{X_{n+1}=1\;|\;X_n=x}>0\).

866: \end{itemize}

867: \end{itemize}

868: Thus \(X\) satisfies a geometric Foster-Lyapunov condition based on

869: scale \(\Lambda\) and small set \(C\), and so is geometrically ergodic.

870:

871: Suppose \(Y\) is a dominating process for \(X\) based on the identity

872: scale \(\Lambda\). This

873: means it must be possible to couple \(Y\) and \(X\) such that, if

874: \(\Lambda(X_n)=X_n\leq Y_n\) then \(\Lambda(X_{n+1})=X_{n+1}\leq Y_{n+1}\). This can be achieved if

875: and only if

876: \[

877: \Prob{X_{n+1}\geq z \;|\; X_n=u} \quad\leq\quad \Prob{Y_{n+1}\geq z\;|\; Y_n=x}\,\

878: \]

879: for all \(z\geq1\), and Lebesgue-almost all \(u<x\).

880: Therefore we require of such

881: % a Markov chain

882: \(Y\) that

883: \begin{align*}

884: &  \Prob{Y_{n+1}\geq \alpha x y\;|\; Y_n=x} \quad\geq\quad

885: \text{ess}\sup_{u<x} \left\{ \Prob{X_{n+1}\geq \alpha x y \;|\; X_n=u} \right\}\\

886: &\quad=\quad

887: \sup_i \text{ess}\sup\left\{ \frac{\alpha}{q_i}\;:\; \alpha^{-1}<u<x, u\in S_i, q_i

888:   u > \alpha x y\right\} \\

889: &\quad=\quad

890: \sup_i \left\{\frac{\alpha}{q_i}\;:\; q_i > \alpha y \right\}

891: \quad=\quad \frac{1}{y}\,,

892: \end{align*}

893: using Markov's inequality, then the construction of the kernel of

894: \(X\), then the measure-density of the \(S_i\).

895:

896: So such a Markov chain \(Y\) must also (at least when above level

897: \(\alpha^{-1}\)) dominate \(\exp(Z)\), where \(Z\) is a random walk with

898: jump distribution \(\text{Exponential}(1)+\log(\alpha)\). Hence it will

899: fail to be positive-recurrent on the small set \(C\) when \(\alpha\geq e^{-1}\).

900: \end{proof}

901:

902: There may exist some subtle re-ordering to provide \domCFTP for such a

903: chain on a different scale; however the above lemma shows that

904: \domCFTP must fail for dominating processes for \(X\) based on the

905: scale \(\Lambda\).

906:

907:

908: \section{Conclusion}

909: \label{sec:conclusion}

910:

911: We have shown that geometric ergodicity (more strictly, a geometric

912: Foster-\-Lyapunov condition) implies the existence of a special kind of

913: \domCFTP algorithm. The algorithm is not expected to be practical:

914: however it connects perfect simulation firmly with more theoretical

915: convergence results in the spirit of the \citeN{FossTweedie-1997}

916: equivalence between classic \CFTP and uniform ergodicity. Note also

917: that the ``universal dominating process'', the sub-critical

918: \(\exp(D/M/1)\) so derived, is itself geometrically ergodic.

919:

920: It is natural to ask whether other kinds of ergodicity (for example,

921: polynomial ergodicity) can also be related to perfect simulation in

922: this way; this is now being pursued by Stephen Connor as part of his

923: PhD research at Warwick.

924:

925: % ==============================

926: % \ifpdf

927: % {\small

928: % \begin{multicols}{2}

929: % \fi

930: % \bibliographystyle{wchicago}

931:  \bibliographystyle{chicago}

932:

933:   \thispagestyle{plain}

934:   \markboth{REFERENCES}{REFERENCES}

935:

936:  \bibliography{habbrev,ge,wsk}

937: % \ifpdf

938: % \end{multicols}

939: % }

940: % \fi

941:

942: %  \onecolumn\tiny % FANCY version

943: %  \input 427.lat

944:

945: \end{document}

946:

947: \newpage

948: {\scriptsize

949: \textcolor{red}{Changes:}

950: \begin{itemize}

951: \item Removed references to mere ergodicity (inappropriate if all our

952:   chains are Harris recurrent!)

953: \item Set \(b^{\prime\prime}=b/(1-\alpha)\), \(c^{\prime\prime}=b/(\alpha^{k-1} (1-\alpha)^2)\)

954:   explicitly in \hyperref[lem:sub-sampling]{Lemma

955:     \ref*{lem:sub-sampling}}.

956: \item Corrected definition of \(h\) after \hyperref[eq:small-target]{Equation (\ref*{eq:small-target})}!

957: \item Tried to clarify argument leading to

958:   \hyperref[eq:crude-domination]{Inequality

959:     \ref*{eq:crude-domination}}.

960: \item Reordered paragraph ``In case \(e^{-1}<\alpha<1\)\ldots'' on

961:   \pageref{page:marker1}.

962: \item Added note at pages \pageref{page:marker2},

963:   \pageref{page:marker3} to clarify why \(h\geq1\).

964: \item Fixed argument for nonnegativity of \(\beta\mu\) at page

965:   \pageref{page:marker4}, separating it from the argument for \(\beta\mu\)

966:   being absolutely continuous with respect to \(\Law{V}\).

967: \item All reported typos are fixed.

968: \end{itemize}

969: }

970:

971: \end{document}

972:

973: