0608:cs0608006/bc

1: \documentclass[10pt]{article}

2: \usepackage[dvips]{graphicx}

3: \usepackage{amsfonts,verbatim}

4: \usepackage{amsmath,amssymb}

5: \usepackage{amsmath,amsbsy}

6: \usepackage{epsf,epsfig,verbatim}

7: \baselineskip=1.0\normalbaselineskip

8: \renewcommand{\baselinestretch}{1.0}

9: \pagestyle{plain}

10:

11: \usepackage{multirow}

12: \usepackage{slashbox}

13: \usepackage{hhline}

14:

15: \newcommand{\prodin}{\prod_{i=1}^n}

16: \newcommand{\sumin}{\sum_{i=1}^n}

17: \newcommand{\ra}{\rightarrow}

18: \newcommand{\beq}{\begin{equation}}

19: \newcommand{\eeq}{\end{equation}}

20: \newcommand{\C}{{\bf \Large \mathcal{C}}}

21: \newcommand{\mc}{\mathcal}

22: \newcommand{\mb}{\mathbb}

23: \newcommand{\mf}{\mathbf}

24: \newcommand{\tl}{\tilde}

25: \newcommand{\h}{\hat}

26: \newcommand{\bm}{\boldmath}

27: \newcommand{\bs}{\boldsymbol}

28: \newcommand{\e}{\epsilon}

29: \newcommand{\dl}{\delta}

30: \newcommand{\g}{\gamma}

31: \newcommand{\Ae}{A_\epsilon^{(n)}}

32: \newcommand{\Ase}{A_\epsilon^{*(n)}}

33:

34:

35: \newtheorem{thm}{Theorem}

36: \newtheorem{dfn}{Definition}

37: \newtheorem{rem}{Remark}

38: \newtheorem{lem}{Lemma}

39: \newtheorem{cor}{Corollary}

40: \newtheorem{fact}{Fact}

41: \newtheorem{proof}{Proof}

42:

43:

44: \addtolength{\hoffset}{-3cm} \addtolength{\textwidth}{5.5cm}

45: \addtolength{\voffset}{-1.0cm} \addtolength{\textheight}{4cm}

46:

47: \allowdisplaybreaks

48:

49: \begin{document}

50:

51: \title{A Graph-based Framework for Transmission of Correlated Sources

52:   over Broadcast Channels

53: \footnote{This work was supported by NSF CAREER Grant

54:   CCF-0448115. This work was presented in part at the 39th Conference

55:   on Information Sciences and Systems (CISS), Baltimore, MD, March

56:   2005, Information Theory and Applications workshop

57:   (ITA), San Diego, CA, February, 2006, and

58: IEEE International Symposium on Information

59:   Theory (ISIT), Seattle, WA, July 2006.}}

60: \author{Suhan Choi and  S.\ Sandeep Pradhan\\

61:       \{suhanc, pradhanv\}@eecs.umich.edu,\\

62: EECS Dept., Univ. of  Michigan, Ann Arbor, MI.}

63:

64: \maketitle

65:

66:

67: \begin{abstract}

68: In this paper we consider the communication problem that involves

69: transmission of correlated sources over broadcast channels. We

70: consider a graph-based framework for this information transmission

71: problem. The system involves a source coding module and a channel

72: coding module. In the source coding module, the sources are

73: efficiently mapped into a nearly semi-regular bipartite graph, and

74: in the channel coding module, the edges of this graph are reliably

75: transmitted over a broadcast channel. We consider nearly

76: semi-regular bipartite graphs as discrete interface between source

77: coding and channel coding in this multiterminal setting. We provide

78: an information-theoretic characterization of (1) the rate of

79: exponential growth (as a function of the number of channel uses) of

80: the size of the bipartite graphs whose edges can be reliably

81: transmitted over a broadcast channel and (2) the rate of exponential

82: growth (as a function of the number of source samples) of the size

83: of the bipartite graphs which can reliably represent a pair of

84: correlated sources to be transmitted over a broadcast channel.

85: \end{abstract}

86:

87: \setlength{\baselineskip}{18.6pt}

88:

89: \section{Introduction}

90:

91: With the emergence of new set of applications such as wireless sensor

92: networks, the problem of transmission of correlated information

93: sources over multiterminal channels has received a renewed attention.

94: In this problem, many correlated information sources are accessed by a

95: set of transmitter terminals, and they wish to simultaneously transmit

96: some subset of them to another set of receiver terminals over a channel.

97: In this paper we address the one-to-many communication system, where

98: one transmitter terminal  has access to all the information sources,

99: and wish to transmit them to many receiver terminals. One such model

100: involving two receiver terminals was considered by Han and Costa  in

101: \cite{han-costa87}, and  is described in the

102: following. Consider a pair of correlated discrete

103: memoryless sources $(S,T)$ with some generic joint distribution

104: $p(s,t)$ and a pair of finite alphabets $\mc{S}$ and $\mc{T}$,

105: respectively.  The encoder observes long sequences of realizations of these

106: sources (of length say $n$),  and wishes to transmit them over a

107: broadcast channel which has one input $X$ and two outputs $Y_1$ and

108: $Y_2$, and  Receiver i has access to $Y_i$.  The channel behavior is

109: governed by a generic conditional distribution $p(y_1,y_2|x)$. The

110: channel is assumed to be discrete memoryless and is used without

111: feedback.   The encoder maps $n$-length source sequence pairs into

112: $n$-length channel input sequences. Each receiver  maps its

113: corresponding $n$-length channel output sequences into its

114: corresponding $n$-length source reconstruction sequences. The receivers

115: would like to produce a reconstruction sequence pair such that the

116: probability that this deviates from the original pair goes to zero as

117: blocklength $n$ becomes large. If it is possible to build such

118: sequences of mappings, then we say that the source pair is

119: transmissible over the broadcast channel. The goal is to find the set

120: of all sources that are transmissible over a given broadcast channel

121: or the set of all channels over which a given source pair is

122: transmissible.

123:

124: Two approaches have been proposed for this problem in the

125: literature. On is called the joint-source-channel coding approach

126: and the other is the separation-approach. The former approach

127: \cite{han-costa87}  addresses the problem directly by finding the

128: mappings for the given source pair and broadcast channel. A sufficient

129: condition for transmissibility of a source over a channel

130: has been given in \cite{han-costa87} for this

131: problem. In the separation-approach, we divide the encoding task into two

132: sub-tasks, and similarly the decoding task is accomplished in two

133: steps. In this approach, the $n$-length source pair is mapped into

134: three indexes (referred to as $W_0$, $W_1$ and $W_2$)

135: coming from three finite sets of size say $\Delta_0$,

136: $\Delta_1$ and $\Delta_2$, respectively. The first index (common

137: message) is meant for both receivers, and the second and the third indexes

138: (private messages)  are meant for Receiver 1 and Receiver 2,

139: respectively. This is called source encoding. The goal is to remove

140: all the redundancy from the pair to produce three independent bit

141: streams. Then these three indexes are mapped (referred to as channel

142: coding) to $n$-length channel input sequences. On the other side of

143: the channel,

144:  Receiver i first maps its $n$-length channel output

145: into a pair of indexes ($W_0$ and $W_i$) for $i=1,2$. Then they

146: independently produce source reconstruction sequences from the common

147: and the private messages. The first goal is to find the set (called as

148: the rate region) of all

149: the rate tuples $(R_0,R_1,R_2)$, where $R_j=\frac{1}{n}\log \Delta_j$

150: for $j=0,1,2$,

151: at which a  reliable representation of the given source pair can be

152: accomplished. The second goal is to find the set (called as the capacity

153: region) of all rate tuples

154: $(R_0,R_1,R_2)$, at which a reliable communication of indexes over

155: the given channel can be accomplished.

156: The source coding part works under the assumption that the channel is

157: noiseless, and the channel coding part works under the assumption that

158: the messages are independent.

159:

160: The channel coding part by itself was first introduced  by Cover

161: in \cite{cover72,cover75}.

162:  The capacity region has been found for many interesting classes of

163:  broadcast channels

164: \cite{cover72,cover75,bergmans73,bergmans74,gallager74,

165: ahlswede-korner75,korner-marton77,elgamal79,

166: marton77,pinsker78,marton79,

167: gelfand80,elgamal}.  The capacity region of certain class of

168: broadcast channels used in wireless communication systems

169: have been obtained in

170:  \cite{tse97,li01,caire03,viswanathtsc03,vishwanathjgda03,yu04,weingarten04}.

171: Marton \cite{marton79} (also see \cite{meulen75,hajek79,elgamal81})

172: established an inner bound to the capacity

173: region for the discrete memoryless broadcast channel, which

174: contains all the known achievable rate regions.

175: Outer bounds to the capacity region have been obtained in

176: \cite{sato78,marton79,nair06}. See \cite{cover98} for a latest

177: survey of the results on broadcast channels.

178: The source coding part was addressed by Gray and Wyner in

179: \cite{gray74}, where a complete characterization of the rate region

180: was given. We refer this source coding problem as Gray-Wyner problem.

181: However, it is well-known that the separation-approach is not

182: optimal for the one-to-many communication problem, unlike the case of

183: point-to-point information transmission

184: problem.  In particular, a simple  example was given in

185: \cite{han-costa87} that

186: showed that a triangular source can be sent reliably over a Blackwell

187: channel by  using a simple joint-source-channel coding scheme, but

188: there is no way of

189: transmitting that source over that channel using the separation-approach.

190:

191:

192: Loosely speaking, in the separation-approach, there is an interface

193: between the source coding module and the channel coding module.

194: Due to the structure of the system, i.e.,  a common message and a pair

195: of private messages, the interface can be thought of as

196: a finite collection of products of finite sets.

197: For example, in a system with $\Delta_i$ denoting the size of the

198: $i$th message for $i=0,1,2$, the interface can be thought of as

199: $\cup_{i=1}^{\Delta_0} (A_i \times B_i) $, where $A_i=\{(i-1)\Delta_1+1,

200: (i-1)\Delta_1+2, \ldots, i\Delta_1\}$ and

201: $B_i=\{(i-1)\Delta_2+1,

202: (i-1)\Delta_2+2, \ldots, i\Delta_2\}$.

203: However, as seen above, this interface, a finite collection of

204: products of finite sets,

205: is not an efficient representation of a pair of correlated sources for

206: transmission over broadcast channels\footnote{Another reason why this

207:   approach may be suboptimal is given in the following. In the

208:   characterization of the rate region in \cite{gray74}, it turns out

209:   that for certain choices of the triple $(R_0,R_1,R_2)$ that belongs

210:   to the boundary of the rate region, i.e., optimal triple,

211: the private indexes produced by the joint

212:   encoder will \emph{not} be independent asymptotically. Hence, the

213:   channel coding module that follows, which works under the assumptions of

214:   independence, can not exploit this correlation, and is wasteful of

215:   resources.   For example, in (15a) in \cite{gray74}, if one chooses

216:   $W$ such that $(X,W,Y)$ do \emph{not} form a Markov chain, then the triple

217: $(R_0,R_1,R_2)=(I(X,Y;W),H(X|W),H(Y|W))$ has this property. }.

218: Note that the corresponding interface for a similar approach in the

219: point-to-point case is just a finite set. It is also well-known that

220: a similar separation-approach is not optimal for other multiuser

221: information transmission problems such as many-to-one communication

222: \cite{cover-elgamal-salehi80}.

223:

224: In our recent work \cite{pradhan-choi-isit,pradhan-choi06}, we have

225: reported a  bipartite graph-based framework for the problem of

226: transmission  of information in the many-to-one case. A similar

227: approach was also studied in \cite{ahlswede-han83}. In the present

228: work, we consider a similar approach to the one-to-many communication

229: scenario. The fundamental motivation for this comes from the

230: concept of typicality \cite{cover-thomas}.  Given a correlated

231: source pair $(S,T)$, a sequence in $\mc{S}^n$

232: is said to be typical (or individually  typical) with respect to

233: $p(s)$, if its empirical

234: histogram is close to $p(s)$. Similarly one can define typical

235: sequences in $\mc{T}^n$. A sequence pair in $\mc{S}^n \times \mc{T}^n$

236: is said to be jointly typical if its empirical joint histogram is

237: close to $p(s,t)$. Using the law of large numbers, it follows that

238: (a) there are roughly $2^{nH(S)}$ and $2^{nH(T)}$ individually typical

239: sequences in $\mc{S}^n$ and $\mc{T}^n$, respectively, where $H(\cdot)$

240: denotes entropy \cite{cover-thomas}, (b) there are roughly

241: $2^{nH(S,T)}$ jointly typical sequence pairs in $\mc{S}^n \times

242: \mc{T}^n$, (c) for every typical

243: sequence in $\mc{S}^n$, there are roughly $2^{nH(T|S)}$ typical

244: sequences in $\mc{T}^n$ that are jointly typical and vice versa,

245: (d) probability, under $p(s,t)$, of the set of jointly typical sequences

246: (called jointly typical set)

247: is close to $1$, and (e) the probability, under $p(s,t)$, of every

248: jointly typical sequence pair is roughly equal to

249: $2^{-nH(S,T)}$. These five properties lead one to associate a

250: bipartite graph on the jointly typical set, with vertexes

251: formed by individually typical sequences, and two vertexes are connected by

252: an edge if they are jointly typical. Such bipartite graphs, where the

253: degrees of the vertexes of one set is close to one constant, and the

254: degrees of that of the  other set is close to another constant, are referred to

255: as nearly semi-regular \cite{vanlint}.  Hence graphs can naturally

256: capture the behavior of the source pair. The details regarding the

257: source distribution can be dispensed with, and one can just work with

258: this bipartite graph. This may also lead to the

259: possibility of using them as discrete interface for one-to-many

260: communication. The source encoder would now act on the source pair and

261: produce correlated messages, or edges in a bipartite graph, and the

262: channel encoder would now work with correlated messages and reliably

263: transmit the edges in the graph over the broadcast channel. We would still

264: have a source coding module and a channel coding module. However, now

265: they would be interfaced using nearly semi-regular bipartite graphs

266: rather than just a finite collection of products of finite sets. Of

267: course, a finite collection of products of finite sets is a special

268: case of nearly semi-regular bipartite graphs.

269:

270:

271: We now present a brief summary of the results presented in this paper,

272: for which we need some definitions. A nearly semi-regular bipartite

273: graph is said to

274: have parameters $(\theta_1,\theta_2,\theta'_1,\theta'_2)$ if the $i$th

275: vertex set has size nearly equal to $\theta_i$ for $i=1,2$, and the

276: degrees of

277: vertexes of the first set is nearly equal to $\theta'_2$ and vice

278: versa. With a slight abuse of notation, a

279: nearly semi-regular bipartite  graph is said to

280: have parameters $(\theta_0,\theta_1,\theta_2,\theta'_1,\theta'_2)$

281: if it is the union of $\theta_0$ disjoint subgraphs each having

282: parameters $(\theta_1,\theta_2,\theta'_1,\theta'_2)$.

283: A tuple of rates  $(R_0,R_1,R_2,R'_1,R'_2)$ is said to be achievable

284: for the given broadcast channel if \emph{there exists} a bipartite graph

285:  with parameters $(2^{nR_0},2^{nR_1},2^{nR_2},2^{nR'_1},2^{nR'_2})$

286:  whose edges can be reliably transmitted by using the channel $n$

287:  times for large $n$. Similarly, a tuple of rates

288:  $(R_0,R_1,R_2,R'_1,R'_2)$ is said to be

289: achievable for a given pair of correlated sources if \emph{there

290:   exists} a bipartite graph with parameters

291: $(2^{nR_0},2^{nR_1},2^{nR_2},2^{nR'_1},2^{nR'_2})$  which can reliably

292: represent $n$ realizations of the pair for large $n$.

293: We provide information-theoretic partial

294:  characterizations of the sets of achievable tuples for a

295:  broadcast channel  and a correlated source pair.

296: These are presented in Theorem 1-4.

297: Having the significance of the proposed framework mentioned first, let

298: us look at the limitations of this optimistic framework as well.

299: For that we need to look at the big picture.

300:

301: For the point-to-point case, to

302: check the transmissibility of a source over a channel, we just need to

303: check the non-emptiness of the intersection of two intervals

304: $[H,\infty)$ and $[0,C]$, where $H$ denotes the entropy of the source,

305:   and $C$ denotes the capacity of the channel. Note that only one parameter

306:   specifies the interval. For the one-to-many communication, the

307: conventional separation-approach gives a sufficient condition for checking the

308:   transmissibility: non-emptiness of the rate region of the

309:   source pair and that of the broadcast channel. We need three

310:   parameters $(R_0,R_1,R_2)$ to specify the rate region in both source

311:   coding as well as channel coding.

312: Clearly, a  characterization that involves the fewest number of

313: parameters is what we want.

314:

315:

316: In the graph-based framework, we consider rate regions for the source

317: and the channel, which are specified using five parameters:

318: $(R_0,R_1,R_2,R'_1,R'_2)$.  However, the non-empty intersection of

319: the rate region of the source pair with that of the channel still does not

320: guarantee successful transmission. This is because, graphs having the

321: same set of parameters may have different structures. It turns out

322: that these

323: graphs (that have the same set of parameters) can be partitioned into

324: equivalence classes, where all graphs

325: in an equivalence class have the same structure. This structure of the

326: graphs has also been studied under the name of graph isomorphism

327: \cite{vanlint} in the literature.  We will address this

328: issue more formally in later sections.  Hence a graph with

329: parameters (say) $(2^{nR_0},2^{nR_1},2^{nR_2},2^{nR'_1},2^{nR'_2})$

330: which can reliably represent a source pair may not belong to the

331: equivalence class of a graph with the same set of parameters whose

332: edges can be transmitted reliably over the channel.

333: In other words, to guarantee successful transmission of the source

334: over the channel, we need to

335: construct at least one pair of transmission systems (one for the source

336: component and one for the channel component) for every equivalence class.

337: Rather, in this work we have shown (a) the existence of an  equivalence

338: class for which a transmission system could be built in the source

339: coding component, and (b)  the existence of an  equivalence

340: class for which a transmission system could be built in the channel

341: coding component.

342: We plan to address this issue further in our future work.

343: But we believe that the results given in this paper may be a first

344: step toward an optimal discrete interface for multiuser

345: communication.

346:

347: The outline of the remaining part of this paper is as follows. In

348: Section \ref{sec:bc-prelim}, we first provide a summary

349: of the results in a formal setting that are available in the

350: literature that are closely related to our work. Then, in

351: Section \ref{sec:bc-problem}, we formulate the problem and consider

352: certain properties of bipartite graphs that are relevant to our

353: later discussion. Then channel coding part will be discussed in

354: Section \ref{sec:general-bc}, resulting in an achievable rate region

355: for the broadcast channel with correlated messages. We also consider

356: the special case of broadcast channels with one deterministic

357: component. Thereafter, the complementary source coding part, the representation

358: of correlated sources into graphs, will be described in

359: Section \ref{sec:bc-sc}. After that, an example and its

360: interpretation are provided in Section \ref{sec:bc-example}.

361: Finally, Section \ref{sec:bc-conclusion} provides some concluding

362: remarks.

363:

364: \section{Preliminaries}\label{sec:bc-prelim}

365:

366: In this section, we provide an overview of the most important prior

367: results in the literature on broadcast channels, and the related

368: source coding problem,  which are closely related to our work.

369:

370: \subsection{Broadcast Channel with Independent private Messages}

371: A broadcast channel is composed of one sender and many receivers.

372: The objective is to broadcast information from a sender to the many

373: receivers. We consider broadcast channels with only two receivers

374: since multiple receivers cases can be similarly treated. Figure

375: \ref{fig:broadcast} shows a broadcast channel with one sender and

376: two receivers. The discrete memoryless stationary broadcast channel consists of

377: an input alphabet $\mathcal{X}$ and two output alphabets

378: $\mathcal{Y}_1$ and $\mathcal{Y}_2$ and a conditional distribution

379: $p(y_1,y_2|x)$. The induced $n$-length conditional distribution is

380: given by $p(y_1^n,y_2^n|x^n)=\prod_{i=1}^n p(y_{1i},y_{2i}|x_i)$ when used

381: without feedback. In other words, a broadcast channel is an ordered

382: tuple $(\mc{X},\mc{Y}_1,\mc{Y}_2, p(y_1,y_2|x))$.

383: \begin{figure}[!h]

384: \centering \epsfig{file=broadcast.eps, clip=, width=.7\linewidth}

385: \centering \caption{\small Broadcast channel}\label{fig:broadcast}

386: \end{figure}

387: We consider the following definitions for transmission of

388: independent messages over a broadcast channel.

389:

390: \begin{dfn}

391: A transmission system with parameters $(n,\Delta_0,\Delta_1,\Delta_2,\tau)$

392: for the given broadcast channel $(\mc{X},\mc{Y}_1,$ $\mc{Y}_2,

393: p(y_1,y_2|x))$ is an ordered tuple $(f,g_1,g_2)$, consisting of one

394: encoding mapping $f$ and two decoding mappings $g_1$ and $g_2$ where

395: \begin{itemize}

396: \item $f: \{1,2,\ldots,\Delta_0\} \times

397: \{1,2,\ldots,\Delta_1\} \times \{1,2,\ldots,\Delta_2\}

398:   \rightarrow \mc{X}^n$,

399: \item $g_i: \mc{Y}_i^n \rightarrow \{1,2,\ldots,\Delta_0\} \times

400: \{1,2,\ldots,\Delta_i\}$, for $i=1,2$,

401: \item such that a performance measure given by the average probability

402:   of error satisfies:

403:     \[

404:       \tau= \sum_{k=1}^{\Delta_0}  \sum_{i=1}^{\Delta_1} \sum_{j=1}^{\Delta_2}

405: \frac{1}{\Delta_0 \Delta_1 \Delta_2} Pr \left[ (g_1(Y_1^n), g_2(Y_2^n)) \neq

406: ((k,i),(k,j)) | X^n=f(k,i,j) \right].

407:     \]

408: \end{itemize}

409: \end{dfn}

410:

411: \begin{dfn}

412:   A rate tuple $(R_0,R_1, R_2)$ is said to be \emph{achievable} for the

413:   given broadcast channel if for all $\e > 0$, and for sufficiently

414:   large $n$, there exists a transmission system as defined above

415:   satisfying $\frac{1}{n}\log \Delta_i > R_i -\e$ for $i=0,1,2$ and with

416:   the average probability of error $\tau < \e$.

417: \end{dfn}

418:

419: \begin{dfn}

420:   The \emph{capacity region} $\mathcal{C}_{BCI}$ of the broadcast

421:   channel is the set of all achievable rate tuples

422:   $(R_0,R_1, R_2)$.

423: \end{dfn}

424:

425: The capacity region of general broadcast channels is still not known. But

426: Marton \cite{marton79} and Gelfand and Pinsker \cite{gelfand80} have

427: obtained an  achievable rate region for the general discrete

428: memoryless broadcast channel,

429:  which is the largest known inner bound to the capacity region.

430: An achievable region of the discrete memoryless broadcast channel

431: \cite{csiszar-korner} is given by all rate tuples $(R_0,R_1,R_2)$

432: satisfying

433:   \begin{align}

434:     R_0 &< \min\{I(Z;Y_1),I(Z;Y_2)\},\\

435:     R_0 + R_1 &< I(Z,U;Y_1), \\

436:     R_0 + R_2 &< I(Z,V;Y_2), \\

437:     R_0 + R_1 + R_2 &< \min\{I(Z;Y_1),I(Z;Y_2)\}+I(U;Y_1|Z)+I(V;Y_2|Z)-I(U;V|Z)

438:   \end{align}

439: for some $p(z,u,v,x)$ on ${\mathcal Z} \times $ ${\mathcal U} \times

440: {\mathcal V} \times {\mathcal X}$, where $Z$, $U$  and $V$ are

441: auxiliary random variables with finite alphabets ${\mathcal Z}$,

442: ${\mathcal U}$,  and ${\mathcal V}$, respectively  such

443: that $(Z,U,V) \rightarrow X \rightarrow (Y_1,Y_2)$ form a Markov

444: chain.

445:

446: \subsection{Gray-Wyner problem}

447:

448: Consider a pair of correlated sources with a joint distribution

449: $p(s,t)$ and with finite alphabets ${\mathcal S}$ and ${\mathcal

450:   T}$. The sources are assumed to be stationary and memoryless. These

451: sources are denoted as an ordered tuple $({\mathcal S},{\mathcal

452:   T},p(s,t))$.

453:

454: \begin{dfn}

455: A transmission system with parameters

456: $(n,\Delta_0,\Delta_1,\Delta_2,\tau)$  for representing a pair of

457: correlated sources $({\mathcal S},{\mathcal T},p(s,t))$ is an ordered

458: tuple $(f_0,f_1,f_2,g_1,g_2)$ consisting of three encoding mappings

459: $f_0$, $f_1$ and $f_2$,  and two decoding mappings $g_1$ and $g_2$ where

460: \begin{itemize}

461: \item $f_i: \mc{S}^n \times \mc{T}^n \rightarrow

462:  \{1,2,\ldots,\Delta_i\},  \ \ \mbox{for } \ \ i=0,1,2$,

463: \item $g_1:  \{1,2,\ldots,\Delta_0\} \times \{1,2,\ldots,\Delta_1\}

464: \rightarrow  \mc{S}^n$, \ \ \

465:  $g_2:  \{1,2,\ldots,\Delta_0\} \times \{1,2,\ldots,\Delta_2\}

466: \rightarrow  \mc{T}^n,$

467: \item such that a performance measure given by the probability

468:   of error satisfies:

469:     \[

470:       \tau= Pr \left[(g_1(f_0(S^n,T^n),f_1(S^n,T^n)),

471: g_2(f_0(S^n,T^n),f_2(S^n,T^n)) \neq (S^n,T^n)\right].

472:     \]

473: \end{itemize}

474: \end{dfn}

475:

476: \begin{dfn}

477:   A rate tuple $(R_0,R_1, R_2)$ is said to be \emph{achievable} for the

478:   given correlated sources if for all $\e > 0$, and for sufficiently

479:   large $n$, there exists a transmission system as defined above

480:   satisfying $\frac{1}{n}\log \Delta_i < R_i +\e$ for $i=0,1,2$, and with

481:   the average probability of error $\tau < \e$.

482: \end{dfn}

483:

484: \begin{dfn}

485:   The \emph{achievable region} $\mathcal{R}_{GWI}$ for the correlated

486:   sources is the set of all achievable rate tuples.

487: \end{dfn}

488:

489: An information-theoretic characterization of this rate region

490: \cite{gray74,wyner75} is given in the

491: following. For a given pair of correlated sources a  tuple

492: $(R_0,R_1,R_2)$ is achievable  if and only if

493:  \begin{align}

494:     R_0 &> I(S,T;Z), \\

495:     R_1 &> H(S|Z), \\

496:     R_2 &> H(T|Z),

497:     \end{align}

498: for some distribution

499: $p(z,s,t)=p(s,t)p(z|s,t)$, where $Z$ is an auxiliary random variable with a

500: finite alphabet ${\mathcal Z}$. Using convexity arguments, it can be

501: shown that there is no loss of optimality if $|{\mathcal Z}| \leq

502: |{\mathcal S}||{\mathcal T}|$.

503:

504: This problem was  also considered in \cite{wyner75} in a slightly different

505: form. The minimum $R_0$ that belongs to the rate region such that the

506: corresponding $Z$ satisfies $T \rightarrow Z \rightarrow S$ a Markov

507: chain,  is called as

508: Wyner's common information $C(S,T)$.

509:

510: \subsection{Joint source-channel coding}

511: Consider the joint source-channel coding scheme studied in

512: \cite{han-costa87}. Suppose we are given a pair of correlated sources

513: without common part \cite{gacs,wit,wyner75}

514: \footnote{\cite{han-costa87} considered the general

515:   setting where sources may have non-zero common part.}   and a

516: broadcast channel.

517:

518: \begin{dfn}A transmission system with parameters $(n,\tau)$ for

519:   transmission of a pair of correlated sources $({\mathcal

520:   S},{\mathcal T}, p(s,t))$ and a broadcast channel

521: $({\mathcal X},{\mathcal Y}_1,{\mathcal Y}_2,p(y_1,y_2|x))$ is an

522: ordered tuple $(f,g_1,g_2)$ where

523: \begin{itemize}

524: \item $f:{\mathcal S}^n \times {\mathcal T}^n \rightarrow {\mathcal

525:   X}^n$

526: \item $g_1: {\mathcal Y}_1^n \rightarrow {\mathcal S}^n$, and

527: $g_2: {\mathcal Y}_2^n \rightarrow {\mathcal T}^n$.

528: \item such that a performance measure given by the probability of

529:   decoding error satisfies

530: \[

531: \tau= \sum_{(s^n,t^n) \in {\mathcal S}^n \times {\mathcal T}^n}

532: p^n(s^n,t^n) Pr[(g_1(Y_1^n),g_2(Y_2^n))\neq (s^n,t^n)|X^n=f(s^n,t^n)].

533: \]

534: \end{itemize}

535:

536: \end{dfn}

537:

538: \begin{dfn}

539: A pair of correlated sources is said to be transmissible over a

540: broadcast channel if $\forall \e>0$, and for all sufficiently large

541: $n$, there exists a transmission system as defined above with

542: parameters $(n,\tau)$ such that $\tau < \e$.

543: \end{dfn}

544:

545: The result of \cite{han-costa87} says that a pair of correlated

546: sources is transmissible over a broadcast channel if

547: \begin{align}

548: H(S) &< I(S,W,U;Y_1)-I(T;W,U|S) \\

549: H(T) &< I(T,W,V;Y_2)-I(S;W,V|T) \\

550: H(S,T) &< \min \{I(W;Y_1),I(W;Y_2)\} +I(S,U;Y_1|W)+

551: I(T,V;Y_2|W)-I(S,U;T,V|W)

552: \end{align}

553: for some $p(w,u,v,x|s,t)$ on

554: ${\mathcal W} \times {\mathcal U} \times {\mathcal V} \times {\mathcal

555:   X} \times  {\mathcal S} \times {\mathcal T}$, where $W,U,V$ are

556: auxiliary random variables with finite alphabets

557: ${\mathcal W}$, ${\mathcal U}$ and ${\mathcal V}$ such that

558: $(S,T) \rightarrow (WUV)\rightarrow X \rightarrow (Y_1,Y_2)$ form

559: a Markov chain.

560:

561: This problem has also been considered in different settings recently in

562: \cite{tuncel05,coleman06}.

563:

564:

565: \subsection{An Example of transmission of Correlated Sources over the Broadcast

566:   Channel}\label{sec:bc-ex}

567:

568: Let us consider an interesting example given in \cite{han-costa87}

569: showing the advantage of encoders that exploit the correlation

570: between sources. Consider the transmission of a set of correlated

571: sources $(S,T)$ with the joint distribution $p(s,t)$ given by

572: \begin{center}

573:

574: \renewcommand{\arraystretch}{1.5}

575:

576: \setlength{\fboxsep}{2pt}\setlength{\tabcolsep}{3pt}

577:

578: \begin{tabular}{|c|c|c|}

579: \hline

580:  \setlength{\backslashbox{$T$}{$S$}} &0 & 1 \\ \hline

581:  0 & $~~~\frac{1}{3}~~~$ & 0 \\ \hline

582:  1 & $\frac{1}{3}$ & $~~~\frac{1}{3}~~~$ \\\hline

583: \end{tabular}

584: \end{center}

585: with finite alphabet $\mc{S}=\mc{T}=\{0,1\}$ over a Blackwell

586: channel with $\mc{X}=\{1,2,3\}$, $\mc{Y}_1=\mc{Y}_2=\{0,1\}$ where

587: the channel transition probabilities are specified by

588: $p(0,0|1)=p(0,1|2)=p(1,1|3)=1$. If we assign $X$=1, 2, and 3 to

589: $(S,T)$ = (0,0), (0,1), and (1,1), respectively, then $Y_1$ and

590: $Y_2$ determine $S$ and $T$ without error, respectively.

591:

592: In the conventional separation-approach,

593: first, factor $(S^n,T^n)$ into three independent messages

594:   $W_1,W_2$, and $W_0$.

595: Next, transmit $(W_0,W_1)$ and $(W_0,W_2)$ to receivers 1 and 2,

596:   respectively, over the broadcast channel using some, hitherto unknown,

597:   optimal coding scheme, so that $(W_0,W_1)$ and $(W_0,W_2)$ reliably

598:   determine $S^n$ and $T^n$, respectively.

599: Let the rate of $W_i$ be $R_i=\frac{1}{n}H(W_i)$ for

600: $i$ = 0, 1 and 2. Then, as the channel in consideration is

601: deterministic, the sum of these rates must be bounded as  $R_0

602: + R_1 + R_2 \leq \frac{1}{n} H(Y_1^n, Y_2^n) \leq \frac{1}{n} H(X^n)

603: \leq \log_2 3=H(S,T)$.

604:

605: According to \cite{wyner75}, the most efficient

606: decomposition of this kind with the constraint $R_0 + R_1 + R_2

607: \leq H(S,T)$ is attained when $R_0=C(S,T)$ In this case,

608: \begin{align}

609:   R_0 + R_1 + R_2 &= H(S,T) = \log_2 3 \textrm{(bits)}\\

610:   2R_0 + R_1 + R_2 &= \log_2 3 + I(S,T;Z) \geq  \log_2 3+C(S;T).

611: \end{align}

612: For this triangular source,

613: $C(S;T)=\log_2 3-h_2(\frac{2}{3}) = \frac{2}{3}$ where

614: $h_2(p)=-p\log_2 p-(1-p)\log_2(1-p)$. Therefore, $2R_0 + R_1 + R_2

615: \geq \log_2 3 +\frac{2}{3}\approx 2.252$ (bits). Consequently,

616: $R_0+R_1 > 1$ or $R_0+R_2 >1$. Since $H(Y_1) \leq 1$ and $H(Y_2)

617: \leq 1$ for any distribution, receiver 1 cannot reliably reproduce

618: $(W_0,W_1)$ or receiver 2 cannot reliably reproduce $(W_0,W_2)$.

619: Thus there is no way of reliably transmitting this triangular

620: source via the Blackwell channel by factoring the sources into $W_1,W_2$, and

621: $W_0$ as shown above.

622:

623: \section{Problem Formulation}\label{sec:bc-problem}

624:

625: \begin{figure}[h]

626: \centering \epsfig{file=bc-block.eps, clip, width=0.65\linewidth}

627: \centering \caption{\small Transmission of correlated sources over a

628: broadcast channel} \label{fig:bc-block}

629: \end{figure}

630:

631: The problem we are addressing is the simultaneous transmission of

632: two correlated sources $(\mc{S},\mc{T},p(s,t))$ over a

633: broadcast channel

634: $({\mathcal X},{\mathcal Y}_1,{\mathcal Y}_2,p(y_1,y_2|x))$,

635: with one sender and

636: two receivers as shown in Figure \ref{fig:bc-block}. Here, the

637: encoder can access both sources $(S,T)$ and the receivers can not

638: communicate with each other.

639: The encoder is given by a mapping $f: \mc{S}^n \times \mc{T}^n \ra

640: \mc{X}^n$. The decoders are given by mappings $g_1:\mc{Y}_1^n \ra

641: \mc{S}^n$ and $g_2:\mc{Y}_2^n \ra \mc{T}^n$. The performance measure

642: associated with this transmission system is the probability of

643: decoding error:

644: \begin{equation}

645:   Pr[(S^n,T^n)\neq(g_1(Y_1^n),g_2(Y_2^n))].

646: \end{equation}

647:

648:

649: \subsection{Basic Concepts}

650:

651: \begin{figure}[h]

652: \centering \epsfig{file=bc-block-sc-cc.eps, clip,

653: width=0.9\linewidth} \centering \caption{\small The outputs of

654: sources are first mapped into edges in a bipartite graph, and the

655: edges coming from this graph are reliably transmitted over a

656: broadcast channel} \label{fig:bc-block-sc-cc}

657: \end{figure}

658:

659: We consider a modular approach to this problem, which is shown in

660: Figure \ref{fig:bc-block-sc-cc}. The system has two modules: the

661: source coding module and the channel coding module. The outputs of

662: two correlated sources are first represented efficiently into a

663: triple of messages $(W_0,W_1,W_2)$ in the source coding module.

664: Then, these messages are reliably transmitted over the broadcast

665: channel in the channel coding module.

666: In more detail, the source coding module produces three

667: messages $W_0, W_1$ and $W_2$, where $W_0$ is a common message to

668: both receivers which contains common information about both sources

669: $S$ and $T$, and $W_1$ and $W_2$ are private messages which contain

670: the individually remaining information about the source $S$ and $T$

671: after extracting the common information, respectively. The messages

672: $W_0,W_1$ and $W_2$ belong to integer sets

673: $\mathcal{W}_0=\{1, 2, \ldots, \Delta_0\}$, $\mathcal{W}_1=\{1, 2,

674: \ldots, \Delta_1\}$ and $\mathcal{W}_2=\{1, 2, \ldots, \Delta_2\}$,

675: respectively. In general, $W_0,W_1$ and $W_2$ are not independent.

676: Then, the channel

677: coding module wants to reliably transmit a message pair $(W_0,W_1)$

678: to receiver 1 and $(W_0,W_2)$ to receiver 2.

679:

680: We assume that there is some kind of correlation between two

681: messages $W_1$ and $W_2$, i.e., private messages

682:  for the receivers can not be chosen independently.

683: \begin{dfn}

684: \begin{itemize}

685:   \item The private messages of receivers are said to be \emph{correlated}, if

686: for every $w_0 \in \mc{W}_0$,   there exists a set $A(w_0)$ such that $A(w_0)

687: \subset \mathcal{W}_1 \times

688: \mathcal{W}_2$,  and conditioned on $W_0=w_0$ the message

689:   pairs $(W_1,W_2) \in A(w_0)$ are equally likely with probability

690:   $\frac{1}{|A(w_0)|}$, and the message pairs $(W_1,W_2) \notin A(w_0)$ have

691:   probability zero.

692:   \item The private messages of the receivers are said to be

693:   \emph{independent},

694:   if $A(w_0) = \mathcal{W}_1 \times

695: \mathcal{W}_2$ for all $w_0 \in \mc{W}_0$.  In this case, the message pairs

696:     $(W_1,W_2)$ are equally likely with probability

697:     $\frac{1}{|\mathcal{W}_1

698: \times \mathcal{W}_2|}$.

699: \end{itemize}

700: \end{dfn}

701: We use bipartite graphs to model the correlation of the messages,

702: i.e., the set $A(w_0)$ is taken to be a bipartite graph for all $w_0

703: \in \mc{W}_0$. Let us first

704: define a bipartite graph and related mathematical terms before we

705: discuss the main problem.

706:

707: \begin{dfn}

708: \begin{itemize}

709:   \item A \emph{bipartite graph} $G$ is defined as an ordered tuple

710: $G=(A_1,A_2,B)$ where $A_1$ and $A_2$ are two

711:   non-empty sets of vertexes, and $B$ is a set of edges where every edge

712:    of $B$ joins a vertex in $A_1$ to a vertex in $A_2$, i.e., $B \subseteq A_1 \times A_2$.

713:   \item If $G$ is a bipartite graph, let $V_1(G)$ and $V_2(G)$ denote  the

714:     first and the second vertex sets of $G$, respectively,  and

715:   $E(G)$ denote the edge set of $G$.

716:   \item If $(i,j) \in E(G)$, then $i$ and $j$ are said to be

717:   \emph{adjacent}.

718:   \item If each vertex in one set is adjacent to every vertex in the other

719:   set, then $G$ is said to be a \emph{complete} bipartite graph. In this case, $E(G) = V_1(G) \times

720:   V_2(G)$.

721:   \item The \emph{degree} of a

722: vertex $v \in V_i(G)$ in a graph $G$, denoted by

723: $\mathrm{deg}_{G,i}(v)$, is the number of edges

724: connected to $v$ for $i=1,2$.

725:   \item A \emph{subgraph} of a graph $G$ is a graph whose vertex and

726:   edge sets are subsets of those of $G$.

727: \end{itemize}

728: \end{dfn}

729: Since we consider a specific type of bipartite graphs in our

730: discussion, let us define those bipartite graphs as well. Although our main

731: results deal with nearly semi-regular graphs, for the purpose of

732: illustration, we consider semi-regular graphs for this section

733: alone.

734:

735: \begin{dfn}\label{def:graph-4-parameters}

736: A bipartite graph $G$ is called \emph{semi-regular} \cite{vanlint}

737:  with parameters $(\theta_1, \theta_2, \theta_1',

738: \theta_2')$, if it satisfies:

739: \begin{itemize}

740:   \item $|V_i(G)|=\theta_i$ for $i$=1, 2,

741:   \item $\forall u \in V_1(G)$, $\mathrm{deg}_{G,1}(u)= \theta_2'$,

742:   \item $\forall v \in V_2(G)$, $\mathrm{deg}_{G,2}(v)= \theta_1'$.

743: \end{itemize}

744: \end{dfn}

745:

746: As an example, two semi-regular bipartite graphs with parameters

747: $(4,6,2,3)$ are shown in Figure \ref{fig:semiregular-graph-ex}. Note

748: that there exist many semi-regular bipartite graphs with the same

749: set of parameters.

750:

751: \begin{figure}[h]

752: \centering \epsfig{file=semiregular-graph-ex.eps, clip,

753: width=0.4\linewidth} \centering \caption{\small Examples of

754: semi-regular bipartite graphs with parameters $(4, 6, 2, 3)$}

755: \label{fig:semiregular-graph-ex}

756: \end{figure}

757:

758: \begin{dfn}\label{def:graph-5-parameters}

759:  A bipartite graph $G$ is called \emph{nearly semi-regular} with parameters $(\Delta_1$, $\Delta_2$, $\Delta_1'$,

760: $\Delta_2'$, $\mu)$ for $\mu > 1$ if it satisfies:

761: \begin{itemize}

762:   \item $|V_i(G)|=\Delta_i $ for $i=1, 2$,

763:   \item $\forall u \in V_1(G)$, $\Delta_2' \mu^{-1} \leq

764:   \mathrm{deg}_{G,1}(u) \leq \Delta_2' \mu$, \ \ \ and \ \

765:   $\forall v \in V_2(G)$, $\Delta_1' \mu^{-1}  \leq \mathrm{deg}_{G,2}(v)  \leq \Delta_1' \mu$.

766: \end{itemize}

767: \end{dfn}

768: Note that $\mu$ is a slack parameter, which determines the range of

769: degrees of vertexes.

770:

771: \begin{dfn}\label{def:bc-graph-5-parameters}

772:  A nearly semi-regular

773: bipartite graph $G$ is said to have parameters $(\Delta_0, \Delta_1,

774: \Delta_2, \Delta_1', \Delta_2', \mu)$ for $\mu > 1$, if it

775: satisfies the following conditions:

776: \begin{itemize}

777:   \item $G$ is the union  of $\Delta_0$ disjoint subgraphs

778:   $\tilde{G}_m$ for $m=1,2,\ldots,\Delta_0$ where $\tilde{G}_m$ is

779:   nearly semi-regular with parameters

780: $(\Delta_1, \Delta_2, \Delta_1', \Delta_2', \mu)$ as in Definition

781: \ref{def:graph-5-parameters}.

782:   \item $\forall m \in \{1,2,\ldots, \Delta_0\}$,

783:     $V_i(\tilde{G}_m)=\{(m-1)\Delta_i+1,(m-1)\Delta_i+2,\ldots,m\Delta_i\}$

784:     for

785:     $i$=1, 2,

786:   \item $V_i(G)=\bigcup_{m=1}^{\Delta_0} V_i(\tilde{G}_m)$ for $i$=1, 2,

787:   \ \ and \ \  $E(G)=\bigcup_{m=1}^{\Delta_0} E(\tilde{G}_m)$,

788:  \end{itemize}

789: \end{dfn}

790:

791: A triple of messages (random variables) $(W_0,W_1,W_2)$ can be

792: associated with a graph $G$ with parameters $(\Delta_0$, $\Delta_1$,

793: $\Delta_2$, $\Delta_1'$, $\Delta_2'$, $\mu)$ in the following way.

794: $W_i \in

795: \mc{W}_i$ and $\mc{W}_i=\{1,2,\ldots,\Delta_i\}$ for $i=0,1,2$, and

796: an edge $((m-1)\Delta_1+i,(m-1)\Delta_2+j)\in E(G)$ denotes a

797: realization of the triple

798: of messages $(W_0,W_1,W_2)=(m,i,j)$. In other words,

799: $Pr[(W_0,W_1,W_2)=(m,i,j)]=\frac{1}{|E(G)|}$ if

800: $((m-1)\Delta_1+i,(m-1)\Delta_2+j)\in E(G)$ and $0$ otherwise.

801: So, different correlation

802: structures of the messages can be modeled by varying the

803: structure of the graph. Figure \ref{fig:bc-msg-graph-ex} illustrates

804: an example of a bipartite graph $G$ with parameters

805: $(2, 4, 4, 2, 2, 1)$ composed of two

806: disjoint subgraphs. This graph can be associated with a triple of

807: messages $(W_0,W_1,W_2)$ where $W_i \in \mc{W}_i$ for $i=1,2,3$ such

808: that $\mc{W}_0=\{1,2\}$, and $\mc{W}_1=\mc{W}_2=\{1,2,3,4\}$. Note

809: that in the figure two subgraphs $\tl{G}_1$ and $\tl{G}_2$ have

810: different edge structures. This implies that the correlation of

811: $(W_1,W_2)$ depends on $W_0$. $W_0$ and $W_1$ are independent,

812: and so are $W_0$ and $W_2$.

813:

814: \begin{figure}[h]

815: \centering \epsfig{file=bc-msg-graph-ex.eps, clip,

816: width=0.2\linewidth} \centering \caption{\small An example of

817: bipartite graph $G$, with parameters $(2, 4, 4, 2, 2,1)$, composed of

818: two disjoint  subgraphs $\tl{G}_1$ and $\tl{G}_2$ where $\mc{W}_0=\{1,2\}$,

819: $\mc{W}_1=\mc{W}_2=\{1,2,3,4\}$.} \label{fig:bc-msg-graph-ex}

820: \end{figure}

821:

822:

823: \subsection{Equivalence classes of graphs}\label{sec:bc-equivalence-classes}

824:

825: Let us consider the set of all semi-regular bipartite

826: graphs with fixed parameters

827: $(\theta_1,\theta_2,\theta_1',\theta_2')$. It is well-known that

828: this set can

829: be partitioned into equivalence classes where equivalence relation is

830: permutation and relabeling of the vertexes in the graphs. In more

831: detail, one element (or graph) in a class can be obtained from the

832: other in the same class by permutation and relabeling of the

833: vertexes. However, if two elements (or graphs) belong to different

834: classes, they can not be obtained from each other by permutation and

835: relabeling since they have different correlation structures.

836:

837: This means that if we have a transmission system which can reliably

838: transmit the edges of a graph $G$ (i.e., the correlation of the

839: message pairs are modeled using $G$) then this transmission system can

840: be used to reliably transmit edges of any graph  that belongs to the

841: equivalence class of $G$. Similarly, if one can construct a source

842: representation system that can represent a source pair using a graph

843: $G$ (i.e., the correlation of the index pairs produced by the source

844: encoder is modeled using $G$), then it can be used to represent the

845: source pair using any graph that belongs to the equivalence class of

846: $G$.

847:

848:

849: \section{Broadcast Channels with Correlated

850: Messages}\label{sec:general-bc}

851:

852: In this section we characterize transmissibility of certain

853: correlated messages over a stationary discrete memoryless broadcast

854: channel.

855:

856: \subsection{Summary of Results}

857:

858: \begin{figure}[h]

859: \centering \epsfig{file=bc-cc-block.eps, clip=, width=.6\linewidth}

860: \centering \caption{\small Transmission of correlated messages over

861: the broadcast channel} \label{fig:bc-cc-block}

862: \end{figure}

863:

864: Although, ideally, we would want to use semi-regular graphs for

865: source representation and communication of information over broadcast

866: channels, for the sake

867: of analytical tractability, as is typical in Shannon theory, we will

868: allow some slack with regard to the degrees of the vertexes of these

869: graphs, and consider the asymptotic case when this slack is bounded in

870: some way.

871:

872: \begin{dfn}\label{def:bc-cc-tx-system}

873: An $(n,\tau)$-transmission system for a nearly semi-regular

874: bipartite graph $G$ with parameters $(\Delta_0$, $\Delta_1$,

875: $\Delta_2$, $\Delta_1'$, $\Delta_2'$, $\mu)$ and a broadcast channel

876: $(\mc{X},\mc{Y}_1, \mc{Y}_2,p(y_1,y_2|x))$ is an ordered tuple

877: $(f,g_1,g_2)$, consisting of one encoding mapping $f$ and two

878: decoding mappings $g_1$ and $g_2$ where

879: \begin{itemize}

880: \item $f: E(G) \ra \mc{X}^n$, i.e,\\ $\forall ((m-1)\Delta_1+i,(m-1)\Delta_2+j) \in E(G)$, assign

881:   $x^n=f(m,i,j)$ where $m \in \{1,2,\ldots,\Delta_0\}$, $i \in \{1,2,\ldots,\Delta_1\}$, and $j \in

882:   \{1,2,\ldots,\Delta_2\}$,

883: \item $g_i: \mc{Y}_i^n \ra V_i(G)$ for $i=1,2$, i.e.,

884:   $g_i: ~\mc{Y}_i^n \ra

885:   \{1,2,\ldots,\Delta_0\}\times\{1,2,\ldots,\Delta_i\}$,

886: \item such that a performance measure given by the following average

887: probability of error satisfies: \beq \tau= \frac{1}{|E(G)|} \sum_{

888: ((m-1)\Delta_1+i,(m-1)\Delta_2+j) \in E(G)} \!\! Pr \left[

889: (g_1(Y_1^n),g_2(Y_2^n))\!\!\neq\!\! ((m,i),(m,j))

890: |X^n=f(m,i,j)\right]. \eeq

891: \end{itemize}

892: \end{dfn}

893:

894: In Definition \ref{def:bc-cc-tx-system}, the messages $W_0$, $W_1$

895: and $W_2$ are assumed to have the following distribution:

896: \begin{itemize}

897:   \item Alphabet of $W_i$ is $\{1,2,\ldots,\Delta_i\}$ for $i=0,1,2$,

898:   \item $Pr\{(W_0,W_1,W_2)=(m,i,j)\}=\left\{

899:                                        \begin{array}{cl}

900:                                          \frac{1}{E(G)}, & \hbox{if $((m-1)\Delta_1+i,(m-1)\Delta_2+j)\in E(G)$,} \\

901:                                          0, & \hbox{else.}

902:                                        \end{array}

903:                                      \right.$

904:

905: \end{itemize}

906:

907: \noindent

908: \begin{rem}

909: \emph{In broadcast channels, the goal of the channel encoder is

910: to reliably transmit two pairs of messages $(W_0,W_1)$ and

911: $(W_0,W_2)$ to receiver 1 and receiver 2, respectively, over the

912: channel. In terms of graphs, it is to reliably transmit edges of a

913: graph which is associated with the triple of messages

914: $(W_0,W_1,W_2)$ over the channel.

915:  As an analogy, the conventional Shannon's

916: channel coding theorem in a point-to-point communication scenario

917: can be interpreted as finding the maximum number of codewords

918: (colors, if each codeword has a different color) that are

919: distinguishable at the noisy receiver. In the conventional broadcast

920: channel with one sender and two receivers, the goal is to

921: distinguish colors at the noisy receivers, where the first color,

922: which is common to two receivers, can come from one set and the

923: second and the third colors, which are private to each receiver, can

924: come from other two sets, respectively, and all possible combination

925: of triples in the three sets are allowed. A natural question to ask

926: is: if only a fraction of all possible combination of pairs of

927: colors is permitted depending on the common color, what is the

928: maximum size of the sets of these colors which can be reliably

929: distinguished at the receivers.}

930: \end{rem}

931:

932: \begin{dfn}

933:   A tuple of rates $(R_0,R_1,R_2,R_1',R_2')$ is said to be

934:   \emph{achievable} for a given broadcast channel with

935: correlated messages, if for all $\e > 0$, and for sufficiently

936: large $n$,

937:   there exists a bipartite graph $G$ with parameters $(\Delta_0,

938:   \Delta_1, \Delta_2, \Delta_1', \Delta_2',\mu)$ and an associated

939:   $(n,\tau)$-transmission system as defined above satisfying:

940: $\frac{1}{n}\log \Delta_0 > R_0-\e$,  $\frac{1}{n}\log \Delta_i >

941: R_i-\e$, $\frac{1}{n}\log \Delta_i' > R_i'-\e$ for $i=1, 2$,

942: $\frac{1}{n}\log \mu < \e$ and the corresponding average probability

943: of error $\tau < \e$.

944: \end{dfn}

945:

946:

947: Note that in the above definition, we have taken an optimistic point

948: of view. As long as one can find a sequence of nearly semi-regular

949: graphs where the number of vertexes and the degrees are increasing

950: exponentially with given rates, such that the edges from these

951: graphs are reliably transmitted over the given broadcast channel, we

952: allow the corresponding rate tuple to belong to the achievable rate

953: region. The goal is to find the capacity region $\mc{C}_{BC}$ which

954: is the set of all achievable tuple of rates $(R_0, R_1, R_2, R_1',

955: R_2')$. In the following we provide an information-theoretic

956: characterization of an achievable rate region. This is an inner

957: bound to the capacity region ${\mathcal C_{BC}}$, and is also a

958: per-letter characterization. This is one of the main results of this

959: paper.

960:

961: \begin{thm}\label{thm:rate-region-bc-cor}

962: For a discrete memoryless broadcast channel $(\mc{X}, \mc{Y}_1,

963: \mc{Y}_2, p(y_1,y_2|x))$, $\mc{R}_{BC}^* \subset \mc{C}_{BC}$ where

964:   \begin{align}

965: \mc{R}_{BC}^* =\bigcup_{p(z,u,v,x)}\{(R_0, R_1, R_2, R_1', R_2'):~~~& \notag\\

966: R_0 &\leq \min\{I(Z;Y_1),I(Z;Y_2)\},\\

967:     R_1 &\leq I(U;Y_1|Z), \\

968:     R_2 &\leq I(V;Y_2|Z), \\

969:     R_1 + R_2' = R_1' + R_2& \leq I(U;Y_1|Z)+I(V;Y_2|Z)-I(U;V|Z)\}

970:   \end{align}

971: where $Z$, $U$ and $V$ are auxiliary random variables with finite

972: alphabets ${\mathcal Z}$, ${\mathcal U}$ and ${\mathcal V}$,

973: respectively, and

974: $p(z,u,v,x,y_1,y_2)=p(z)p(u,v|z)p(x|z,u,v)p(y_1,y_2|x)$

975: satisfies a Markov chain $(Z,U,V) \ra X \ra (Y_1,Y_2)$.

976: \end{thm}

977:

978: \begin{rem}

979: \emph{

980: When the private messages are

981:   independent, i.e., when all the elements in the set $\mc{W}_1 \times

982:   \mc{W}_2$ can occur with non-zero and equal probability, the rate region

983:   becomes exactly the same as Marton's  \cite{marton79}.

984:   However, when the private messages are correlated, i.e., only some

985:   elements in the set $\mc{W}_1 \times

986:   \mc{W}_2$ can occur equally likely, the sum rate $R_1+R_2$ can be

987:   larger. As the amount of correlation between the messages increases

988:   the achievable rate region also becomes larger.  }

989: \end{rem}

990:

991: \begin{rem}

992: \emph{

993:   The limitations of this theorem are as follows. Note that this

994:   theorem gives only a partial characterization of the set of all

995:   nearly semi-regular graphs whose edges can be reliably transmitted

996:   over a broadcast channel. In the formulation of the achievable rate

997:   region, we

998:   have the freedom of choosing the correlation of the messages for

999:   every block-length $n$. The theorem characterizes the rate of

1000:   exponential growth (as a function of the number of channel uses) of

1001:   size of certain nearly semi-regular graphs, such that edges

1002:   coming from any such graph can be reliably transmitted over the

1003:   broadcast channel.

1004:    This obviously also means that it is possible to transmit

1005:   edges of every graph that belongs to the equivalence class of any of

1006:   these graphs. However, this fact does not mean that the edges of any

1007:   graph with those parameters can be reliably transmitted.}

1008: \end{rem}

1009:

1010: \subsection{Proof of Theorem

1011: {\ref{thm:rate-region-bc-cor}}}\label{sec:proof-general-bc}

1012:

1013: In this section we prove Theorem \ref{thm:rate-region-bc-cor} by

1014: using the method of random coding, random binning, the concept of joint

1015: typicality of sequence pairs and some concepts from the theory of

1016: random graphs \cite{jansbook}.  In addition to using the

1017: techniques given in \cite{elgamal81}, we devise a concept of a

1018: ``\emph{super-bin}'', which is a group of consecutive bins, to take

1019: into account the correlation between the messages.

1020: Given a broadcast channel with distribution $p(y_1,y_2|x)$,

1021: consider a fixed joint distribution $p(z,u,v,x)$ = $p(z)$

1022: $p(u,v|z)$ $p(x|z,u,v)$ where $z,u$ and $v$ are auxiliary random

1023: variables on $\mc{Z} \times \mc{U} \times \mc{V}$. Also, fix $\e

1024: >0$, an integer $n \geq 1$, and positive real numbers $R_0$,

1025: $R_1$, and $R_2$.

1026:

1027: \vspace{10pt}

1028:

1029: \noindent \textbf{Random sequences and bin generation}: Draw

1030: $2^{nR_0}$ sequences $Z^n(m)$ for  $m \in \{1,2,\ldots,2^{nR_0} \}$, of

1031: length $n$, independently with replacement from $\Ae(Z)$ each with probability

1032: $\frac{1}{|\Ae(Z)|}$ where $\Ae(Z)$ is the strongly $\e$-typical

1033: set with respect to the distribution $p(z)$ which is a marginal of

1034: the joint distribution $p(z,u,v)$ and $|A|$ denotes the

1035: cardinality of a set $A$.

1036:

1037: For each $m \in \{1,2,\ldots, 2^{nR_0}\}$, draw

1038: $2^{n(I(U;Y_1|Z)-\e)}$ $n$-length sequences

1039: $U^n(k,m)$ for $k \in \{1,2,\ldots, 2^{n(I(U;Y_1|Z)-\e)}\}$

1040: independently with replacement from  $A(U|Z^n(m))$ with probability

1041: $\frac{1}{|A(U|Z^n(m))|}$. Call this collection as $\C_1(m)$.

1042: Here, $A(U|z^n)$ is the set of

1043: $n$-sequences $u^n$ which are strongly jointly typical with the

1044: sequence $z^n$, i.e., for $z^n \in A_{\epsilon}^{(n)}(Z)$, the set

1045: $A(U|z^n) \triangleq \{u^n|(u^n,z^n)\in \Ae(U,Z)\}$.

1046: Similarly, generate $2^{n(I(V;Y_2|Z)-\e)}$

1047: sequences $V^n(l,m)$ for $l \in$  $\{1,2,\ldots,$ $2^{n(I(V;Y_2|Z)-\e)}\}$

1048: independently with replacement from  $A(V|Z^n(m))$, where

1049: for $z^n \in A_{\epsilon}^{(n)}$, the set  $A(V|z^n) \triangleq

1050: \{v^n|(v^n,z^n)\in \Ae(V,Z)\}$. Call this collection as $\C_2(m)$.

1051: Without loss of generality $2^{n(I(U;Y_1|Z)-\e)}$ and

1052: $2^{n(I(V;Y_2|Z)-\e)}$ are assumed to be integers.

1053:

1054: Next, for each $m$ in $\{1,2,\ldots, 2^{nR_0}\}$, define random bins

1055: $B(i,m)$ and $C(j,m)$ for

1056: $i \in \{1,2,\ldots, 2^{nR_1} \}$ and $j \in \{1,2,\ldots,

1057: 2^{nR_2} \}$ such that

1058: \begin{align}

1059: B(i,m) &= \{U^n(k,m)~|~ k \in [(i-1)\cdot 2^{n(I(U;Y_1|Z)-R_1-\e)}+1,

1060:   ~i\cdot2^{n(I(U;Y_1|Z)-R_1-\e)}] \} \label{bins1} \\

1061: C(j,m) &= \{V^n(l,m)~|~ l \in [(j-1)\cdot 2^{n(I(V;Y_2|Z)-R_2-\e)}+1,

1062: ~j\cdot2^{n(I(V;Y_2|Z)-R_2-\e)}] \}

1063: \label{bins2}

1064: \end{align}

1065: where without loss of generality $2^{n(I(U;Y_1|Z)-R_1-\e)}$ and

1066: $2^{n(I(V;Y_2|Z)-R_2-\e)}$ are considered to be integers, and

1067: $[a,b]$ denotes the set of integers from $a$ to $b$.

1068: This imposes the following constraints on $R_1$ and $R_2$:

1069: $R_1 \leq I(U;Y_1|Z)-\e$, and  $R_2 \leq I(V;Y_2|Z)-\e$.

1070:

1071:

1072: \begin{figure}[h]

1073: \centering \epsfig{file=sub-graph.eps, clip=,

1074: width=.9\linewidth} \centering \caption{\small A subgraph

1075: $\tl{\mb{G}}_1$ ($m=1$) with parameters $(2^{nR_1}, 2^{nR_2},

1076: 2^{nR_1'}, 2^{nR_2'}, \mu)$. }

1077: \label{fig:bin-index-subgraph}

1078: \end{figure}

1079:

1080: \noindent \textbf{Graph generation}: As shown in Figure

1081: \ref{fig:bin-index-subgraph}, for each $m$ in $\{1,2,\ldots,

1082: 2^{nR_0}\}$, a random graph

1083: $\tl{\mb{G}}_m$ can be associated with the bins as follows. (1)

1084: $V_1(\tl{\mb{G}}_m)=\{(m-1)2^{nR_1}+1, \ldots, m2^{nR_1}\}$ and

1085: $V_2(\tl{\mb{G}}_m)=\{(m-1)2^{nR_2}+1, \ldots, m2^{nR_2}\}$, (2)

1086: $\forall (i,j) \in V_1(\tl{\mb{G}}_m) \times

1087: V_2(\tl{\mb{G}}_m)$, $(i,j) \in E(\tl{\mb{G}}_m)$ if and only if there

1088: exists in $B(i,m) \times C(j,m)$ at least one $\e$-strongly jointly

1089: typical sequence pair

1090: that belongs to $A(U,V|Z^n(m))$, where

1091: $A(U,V|z^n)$ is the set of pairs of sequences

1092: that are jointly $\e$-typical with the  sequence $z^n$, i.e., for $z^n

1093: \in \Ae(Z)$, the set

1094: $A(U,V|z^n)$$ \triangleq \{(u^n,v^n)|(u^n,v^n,z^n)\in \Ae(U,V,Z)\}$.

1095: Let $\mb{G}$ denote the random graph that is the union of

1096: $2^{nR_0}$ random graphs $\tl{\mb{G}}_m$ for $m

1097: \in \{1,2,\ldots,2^{nR_0}\}$.

1098:

1099: \vspace{10pt}

1100:

1101: \noindent \textbf{Codebook generation}: A random channel codebook $\mb{C}$

1102: can be generated from the graph $\mb{G}$ as follows. For every $m

1103: \in \{1,2,\ldots,2^{nR_0}\}$,  and every $(i,j) \in

1104: E(\tl{\mb{G}}_m)$, first find one pair of sequences $(U^n(k,m),V^n(l,m))

1105: \in A(U,V|Z^n(m)) \cap (B(i,m) \times C(j,m))$. Then draw a random

1106: codeword $X^n(m,i,j)=f(m,i,j)$ uniformly from

1107: $A(X|U^n(k),V^n(l),Z^n(m))$, where

1108: $A(X|u^n,v^n,z^n) \triangleq \{x^n| (x^n,u^n,v^n,z^n) \in \Ae (X,U,V,Z)\}$.

1109: Thus the size of the

1110: codebook is equal to the size of the edge set of the graph

1111: $\Bbb{G}$, i.e., $|\mb{C}|=|E(\mb{G})|$.

1112:

1113:

1114: \vspace{10pt} \noindent \textbf{Encoding error events due to the

1115: degree condition}: Before we proceed to the encoding and decoding

1116: procedure, we need to make sure that the generated codebooks satisfy

1117: certain properties. If vertexes of the graph $\mb{G}$ do not satisfy

1118: the degree conditions, the message pairs can not be transmitted with

1119: arbitrarily small probability of error. Let

1120: \begin{equation}

1121: A=I(U;Y_1|Z)+I(V;Y_2|Z)-I(U;V|Z)-2 \epsilon

1122: \label{A},

1123: \end{equation}

1124: and choose $R'_1=A-R_2$ and $R'_2=A-R_1$. Since $0 \leq R'_i \leq R_i$

1125: for $i=1,2$, we have the following conditions on $R_1$ and $R_2$:

1126: \begin{equation}

1127: \max\{R_1,R_2\} \leq A \leq R_1+R_2.

1128: \label{AA}

1129: \end{equation}

1130:

1131: In summary,  the nonnegative tuple $(R_1,R_2,R'_1,R'_2)$ that we use

1132: for the random coding  satisfies:

1133: \beq

1134: R_1 \leq I(U;Y_1|Z)-\e, \ \ \ R_1 \leq I(V;Y_2|Z)-\e,

1135: \label{constraint1}

1136: \eeq

1137: \beq

1138: R_1+R'_2=R_2+R'_1= I(U;Y_1|Z)+I(V;Y_2|Z)-I(U;V|Z)-2 \e.

1139: \label{constraint2}

1140: \eeq

1141:

1142:

1143:

1144: For a precise characterization of the error events, we need a function of

1145: $\epsilon$, and in turn certain properties of typical sets. For any

1146: triple $(U,V,Z)$ of finite-valued random variables, there

1147: exists \cite{csiszar-korner} a continuous positive function

1148: $\epsilon_1(\epsilon)$ (that depends on the triple)

1149: such that (a) $\epsilon_1(\epsilon) \rightarrow 0$ as $\epsilon

1150: \rightarrow 0$ and (b) for all $\epsilon>0$ (sufficiently small),

1151: there exists an integer $N_0(\epsilon)>0$ such that $\forall

1152: n>N_0(\epsilon)$ the following conditions hold simultaneously

1153: \begin{itemize}

1154: \item  for all $z^n \in \Ae(Z)$,

1155: \beq

1156: 2^{n(H(U|Z) -\epsilon_1)} \leq |A(U|z^n)| \leq

1157: 2^{n(H(U|Z)+\epsilon_1)}

1158: \label{eqU}

1159: \eeq

1160: \beq

1161: 2^{n(H(V|Z) -\epsilon_1)} \leq |A(V|z^n)| \leq

1162: 2^{n(H(V|Z)+\epsilon_1)}

1163: \label{eqV}

1164: \eeq

1165: \beq

1166: 2^{n(H(U,V|Z) -\epsilon_1)} \leq |A(U,V|z^n)| \leq

1167: 2^{n(H(U,V|Z)+\epsilon_1)}

1168: \label{eqUV}

1169: \eeq

1170: \item  $\forall (u^n,z^n) \in \Ae(U,Z)$

1171: \beq

1172: 2^{n(H(V|U,Z) -\epsilon_1)} \leq |A(V|u^n,z^n)| \leq

1173: 2^{n(H(V|U,Z)+\epsilon_1)}

1174: \label{eqV|U}

1175: \eeq

1176: \item  $\forall (v^n,z^n) \in \Ae(V,Z)$

1177: \beq

1178: 2^{n(H(U|V,Z) -\epsilon_1)} \leq |A(U|v^n,z^n)| \leq

1179: 2^{n(H(U|V,Z)+\epsilon_1)}.

1180: \label{eqU|V}

1181: \eeq

1182: \end{itemize}

1183:

1184:

1185: Coming back to the encoding error event, an error will be

1186: declared if either one of the following events occur.

1187: Let $\e'>3\e_1>0$.

1188: \begin{itemize}

1189:   \item $E_1$: $\exists i \in V_1(\mb{G})$ such that $\left|

1190:   \frac{1}{n}\log{\rm deg}_{\mb{G},1}(i)-R_2' \right| > \e'$,

1191:   \item $E_2$: $\exists j \in V_2(\mb{G})$ such that $\left|

1192:   \frac{1}{n}\log{\rm deg}_{\mb{G},2}(j)-R_1' \right| >   \e'$.

1193: \end{itemize}

1194:

1195: Now we show that the probability of these error events can be made

1196: arbitrarily small under certain conditions. Let us define four events

1197: as follows.

1198: \begin{itemize}

1199: \item $E_{0,1}$ : $\exists i \in V_1(\mb{G})$ such that

1200: ${\rm deg}_{\mb{G},1}(i) <

1201: 2^{n(R'_2-\e')}$

1202: \item $E_{0,2}$ : $\exists j \in V_2(\mb{G})$ such that ${\rm

1203: deg}_{\mb{G},2}(j) <

1204: 2^{n(R'_1-\e')}$

1205: \item $E^*_{0,1}$ : $\exists i \in V_1(\mb{G})$ such that ${\rm

1206: deg}_{\mb{G},1}(i) >

1207: 2^{n(R'_2+\e')}$

1208: \item $E^*_{0,2}$ : $\exists j \in V_2(\mb{G})$ such that ${\rm

1209: deg}_{\mb{G},2}(j) >

1210: 2^{n(R'_1+\e')}$

1211: \end{itemize}

1212:

1213: Toward proving the required statements, we need a lemma given in

1214: \cite{krithivasan} about certain properties of typical sets.

1215: \begin{lem}\label{lem:bc-cc-0}

1216: For any triple of finite-valued random variables $(U,V,Z)$,

1217: any $\epsilon>0$ (sufficiently small), any two positive

1218: real numbers $R_1$ and $R_2$ such that $R_1+R_2>I(U;V|Z)$,

1219: and any  $z^n \in \Ae(Z)$ consider the following random experiment.

1220: Generate two collections of sequences $\C_U(z^n)$ and $\C_V(z^n)$

1221: of size $2^{nR_1}$ and $2^{nR_2}$ from $A(U|z^n)$ and

1222: $A(V|z^n)$, respectively, with uniform distribution and with

1223: replacement. Let $P_{\epsilon}(z^n,R_1,R_2)$ denote the probability that

1224: $|(\C_U \times \C_V) \cap A(U,V|z^n)|=0$. Then $\forall \epsilon>0$,

1225: $\forall R_1,R_2$ with $R_1+R_2>I(U;V|Z)$ and

1226: $\forall z^n \in \Ae(Z)$,

1227: \beq

1228: \lim_{n \rightarrow \infty} -\frac{1}{n} \log P_{\epsilon}(z^n,R_1,R_2)=

1229: \infty.

1230: \eeq

1231: \end{lem}

1232:

1233: \noindent {\it Proof:} See Theorem 2.1 of \cite{krithivasan}.

1234:

1235: The proofs of the next two lemmas use this result.

1236: \begin{lem}\label{lem:bc-cc-1}

1237:   For any $\e>0$, and sufficiently large $n$:\\

1238: \begin{equation}

1239:  P\{E_{0,i}\} < \frac{\e}{12}, ~~\mbox{for $i=1,2$}

1240: \end{equation}

1241: \end{lem}

1242:

1243: \noindent {\it Proof}: Refer to Appendix \ref{appendix:bc-lem-cc-1}.

1244:

1245:

1246: \begin{lem}\label{lem:bc-cc-2}

1247:   For any $\e>0$, and sufficiently large $n$:\\

1248: \begin{equation}

1249:  P\{E_{0,i}^*\} < \frac{\e}{12}, ~~\mbox{for $i=1,2$}

1250: \end{equation}

1251: \end{lem}

1252:

1253: \noindent {\it Proof}: Refer to Appendix \ref{appendix:bc-lem-cc-2}.

1254:

1255: Note that $E_1=E_{0,1} \cup E_{0,1}^*$ and $E_2=E_{0,2} \cup

1256: E_{0,2}^*$. Thus, according to the Lemma \ref{lem:bc-cc-1} and

1257: \ref{lem:bc-cc-2}, it is easy to see that, for sufficiently large

1258: $n$, $P(E_1) < \frac{\e}{6}$, and  $P(E_2) < \frac{\e}{6}$,

1259: if $R_2'=I(U;Y_1|Z)-R_1+I(V;Y_2|Z)-I(U;V|Z)-2\e$ and

1260: $R_1'=I(U;Y_1|Z)-R_2+I(V;Y_2|Z)-I(U;V|Z)-2\e$, respectively.

1261: So, it is shown that with high probability we can obtain a nearly

1262: semi-regular bipartite graph $\mb{G}$ composed of $2^{nR_0}$

1263: disjoint subgraphs $\tl{\mb{G}}_m$ such that for each  $m$,  each vertex

1264: in $V_1(\tl{\mb{G}}_m)$ has degree nearly equal to $2^{nR'_2}$

1265: and each vertex in $V_2(\tl{\mb{G}}_m)$ has degree nearly equal to

1266: $2^{nR'_1}$. The size of $V_1(\tl{\mb{G}}_m)$ is $2^{nR_1}$ and

1267: that of $V_2(\tl{\mb{G}}_m)$ is $2^{nR_2}$.

1268:

1269: \vspace{10pt}\noindent \textbf{Choosing message correlation}: If

1270: none of the above two error events $E_1$ and $E_2$ occurs, choose $G=\mb{G}$.

1271: Clearly  $G$ has parameters $(2^{nR_0}$,

1272: $2^{nR_1}$, $2^{nR_2}$, $2^{nR_1'}$, $2^{nR_2'}$, $2^{n\e'})$.

1273: If any of the

1274: above two error events occurs, then pick any graph with parameters

1275: $(2^{nR_0}$, $2^{nR_1}$, $2^{nR_2}$, $2^{nR_1'}$, $2^{nR_2'}$,

1276: $2^{n\e'})$ and call it as $G$.

1277: The distribution of $W_0$, $W_1$ and $W_2$ is chosen as follows.

1278: $Pr\{(W_0,W_1,W_2)=(m,i,j)\}=\frac{1}{E(G)}$ if

1279: $((m-1)2^{nR_1}+i,(m-1)2^{nR_2}+j)\in E(G)$ and

1280: $Pr\{(W_0,W_1,W_2)=(m,i,j)\}=0$ else. For this  graph $G$, and the given

1281: broadcast channel, using the above random codebook $\Bbb{C}$, we

1282: construct an $(n,\tau)$-transmission system, where $\tau$ will be

1283: specified in the sequel.

1284:

1285: \vspace{10pt}\noindent \textbf{Encoding}: Sender transmits the

1286: codeword $X^n(m,i,j)$ over the channel to deliver two pair of

1287: messages $(m,i)$ and $(m,j)$ to receiver 1 and receiver 2,

1288: respectively.

1289:

1290: \vspace{10pt}\noindent \textbf{Decoding}: Both Receiver 1 and

1291: Receiver 2

1292: first find the unique index $\hat{m}$ such that $Z^n(\hat{m})$ is

1293: jointly typical with received sequence $Y_1^n$ and $Y_2^n$,

1294: respectively. Then, Receiver 1 finds the unique index $\hat{k}$ such

1295: that $U^n(\hat{k},\hat{m})$ is jointly typical with the received sequence

1296: $Y_1^n$ and $Z^n(\hat{m})$, i.e., $(U^n(\hat{k},\hat{m}), Y_1^n,

1297: Z^n(\hat{m})) \in \Ae(U,Y_1,Z)$. Similarly, Receiver 2 finds the

1298: unique index $\hat{l}$ such that

1299: $(V^n(\hat{l},\hat{m}),

1300: Y_2^n, Z^n(\hat{m})) \in \Ae(V,Y_2,Z)$. Then, each receiver finds

1301: the decoded private messages $\hat{i}$ and $\hat{j}$ such that

1302: $U^n(\hat{k},\hat{m}) \in

1303: B(\hat{i},\hat{m})$ and $V^n(\hat{l},\hat{m}) \in C(\hat{j},\hat{m})$,

1304: respectively. Otherwise, an error will be declared.

1305:

1306: \vspace{10pt}\noindent \textbf{Probability of error analysis}: So,

1307: the probability of error $P(E)$ can be given by

1308: \begin{align}

1309:   P(E)&=P(E_1 \cup E_2)P(E|E_1 \cup E_2)+P(E \cap E_1^c \cap E_2^c)\\

1310:        &\leq P(E_1 \cup E_2)+P(E \cap E_1^c \cap E_2^c)

1311: \end{align}

1312: The second probability in the above equation can be bounded as given

1313: in the following lemma.

1314:

1315: \begin{lem} \label{lem:bc-cc-3}

1316: For any $\epsilon>0$, and sufficiently large $n$, \beq P(E \cap

1317: E_1^c \cap E_2^c) < \frac{2 \epsilon}{3} \eeq provided

1318: \begin{align}

1319:   R_0 &\leq \min\{I(Z;Y_1),I(Z;Y_2) \}-\e. \\

1320: \end{align}

1321: \end{lem}

1322: {\it Proof}: Refer to Appendix \ref{appendix:bc-lem-cc-3}. \\

1323: Therefore, by applying the union bound we have

1324: $P(E) \leq  P(E_1)+P(E_2)+P(E \cap E_1^c \cap E_2^c) \leq \e$.

1325:

1326:

1327: Since in every realization of random codebooks, we have chosen a

1328: nearly semi-regular graph $G$ with parameters $(2^{nR_0}$,

1329: $2^{nR_1}$, $2^{nR_2}$, $2^{nR_1'}$, $2^{nR_2'}$, $2^{n\e'})$, and

1330: averaged over the ensemble of random codebooks, the average

1331: probability of error is smaller than $\e$, there must exist a graph

1332: $G$ with parameters $(2^{nR_0}$, $2^{nR_1}$, $2^{nR_2}$,

1333: $2^{nR_1'}$, $2^{nR_2'}$, $2^{n\e'})$ and a codebook such that the

1334: average probability of error is smaller than $\e$. This is true only

1335: under the condition given by the statement of the theorem. Hence,

1336: the proof of Theorem \ref{thm:rate-region-bc-cor} has been

1337: completed. \hfill $\blacksquare$

1338:

1339:

1340: \subsection{The Capacity Region of a Broadcast Channel with One

1341: Deterministic Component with Correlated

1342: Messages}\label{sec:one-deterministic-bc-cor}

1343:

1344: In this section, we consider the

1345: capacity region of broadcast channels with one deterministic

1346: component with correlated

1347: messages, $\mc{\tilde{C}}_{BC}$, when there is no common message,

1348: i.e., when $R_0=0$. In this case, as expected, we can provide a

1349: converse coding theorem.

1350:

1351: \begin{thm}\label{thm:one-det-bc-cor}

1352: For a discrete memoryless semi-deterministic broadcast channel where

1353: $y_1=f_1(x)$ and the probability of $y_2$ given $x$ is $p(y_2|x)$,

1354: $\mc{\tilde{R}}_{BC}^* = \mc{\tilde{C}}_{BC}$ where

1355: \begin{align}

1356: \mc{\tilde{R}}_{BC}^* = \bigcup_{p(v,x)}\{(R_1, R_2, R_1',

1357:   R_2'):~~~~~& \notag\\

1358:   R_1 &\leq H(Y_1) \label{eq:12}\\

1359:   R_2 &\leq I(V;Y_2) \label{eq:13}\\

1360:   R_1 + R_2'= R_1'+R_2 &\leq H(Y_1|V)+I(V;Y_2)\}\label{eq:14}

1361: \end{align}

1362: for some $p(v,x)$ on ${\mathcal V} \times

1363: {\mathcal X}$, where $V$ is an

1364: auxiliary random variable with finite alphabet

1365: ${\mathcal V}$ such

1366: that $V \rightarrow X \rightarrow (Y_1,Y_2)$,

1367: $|{\mathcal V}| \leq |{\mathcal X}|+2$.

1368: \end{thm}

1369: {\it Proof:} See Appendix \ref{appendix:one-det-bc-cor}.

1370:

1371:

1372: \section{Representation of Correlated Sources into Graphs}\label{sec:bc-sc}

1373:

1374: In this section, we consider the problem of  representation of

1375: correlated sources into graphs for transmission over  broadcast

1376: channels, i.e., source

1377: coding module  in broadcast channels with correlated sources.

1378: This problem can be interpreted as Gray-Wyner problem with correlated

1379: messages. In in this setup, the  source encoder can access both sources to be

1380: transmitted, but the source decoders can not collaborate with each

1381: other.  We consider two  correlated sources $(\mc{S},\mc{T},p(s,t))$.

1382:

1383: \subsection{Summary of Results}

1384:

1385: In this problem, the goal is to reliably represent two correlated

1386: sources into a triple of messages $(W_0,W_1,W_2)$ which can be associated

1387: with a nearly semi-regular bipartite graph $G$ with parameters

1388: $(\Delta_0, \Delta_1, \Delta_2, \Delta_1',\Delta_2', \mu)$ as

1389: defined in Definition \ref{def:bc-graph-5-parameters}. As shown in

1390: Figure \ref{fig:bc-sc-block}, the output of source encoder is the triple

1391: $W_0$, $W_1$ and $W_2$. We assume that two pairs of messages

1392: $(W_0,W_1)$ and $(W_0,W_2)$ are sent to receiver 1 and receiver 2,

1393: respectively, over the channel without error. From the received

1394: message pairs, the two source decoders wish to reliably reconstruct the

1395: original source sequences $S^n$ and $T^n$, respectively, without

1396: communicating with each other.

1397: For ease of exposition let is consider two simple functions

1398: $I_1$ and $I_2$ both having domain as $\Bbb{Z} \times \Bbb{Z}$ and

1399: range as $\Bbb{Z}$ given by $I_1(a,b)=a$ and $I_2(a,b)=b$ for all

1400: $(a,b) \in \Bbb{Z} \times \Bbb{Z}$.

1401:

1402: \begin{figure}[h]

1403: \centering \epsfig{file=bc-sc-block.eps, clip=, width=.5\linewidth}

1404: \centering \caption{\small  Gray-Wyner problem with  correlated

1405: messages} \label{fig:bc-sc-block}

1406: \end{figure}

1407:

1408: \begin{dfn}

1409: An $(n,\tau)$-transmission system for a nearly semi-regular

1410: bipartite graph $G$ with parameters $(\Delta_0$, $\Delta_1$,

1411: $\Delta_2$, $\Delta_1'$, $\Delta_2'$, $\mu)$ and a pair of

1412: correlated sources $({\mathcal S},{\mathcal T},p(s,t))$

1413: is an ordered

1414: tuple $(f,g_1,g_2)$, consisting of one encoding mapping $f$ and two

1415: decoding mappings $g_1$ and $g_2$ where

1416: \begin{itemize}

1417: \item $f: \mc{S}^n \times \mc{T}^n \ra E(G)$,

1418: \item $ g_1: V_1(G) \ra \mc{S}^n$, $g_2: V_2(G) \ra \mc{T}^n$,

1419:  \item such that a performance measure given by the

1420: probability of error satisfies:

1421: \begin{align}

1422: \tau&=Pr\{(g_1(I_1(f(S^n, T^n))),g_2(I_2(f(S^n, T^n)))) \!\neq\! (S^n,T^n) \}.

1423: \end{align}

1424: \end{itemize}

1425: \end{dfn}

1426:

1427: \noindent Now we define achievable rates for this problem as follows.

1428: \begin{dfn}

1429:   A tuple of rates $(R_0,R_1,R_2,R_1',R_2')$ is said to be

1430:   \emph{achievable} for a pair of correlated sources

1431:   $(\mc{S},\mc{T},p(s,t))$ (for

1432:   transmission over broadcast channels), if for all

1433: $\e > 0$, and for all sufficiently large $n$,

1434:   there exists a bipartite graph $G$ with parameters

1435:   $(\Delta_0,\Delta_1, \Delta_2, \Delta_1', \Delta_2',\mu)$ and an

1436:   associated

1437:   $(n,\tau)$-transmission system as defined above satisfying:

1438:   $\frac{1}{n}\log \Delta_i < R_i + \e$ for $i=0,1,2$,

1439:   $\frac{1}{n}\log \Delta_j' < R_j' + \e$ for $j=1, 2$,

1440:   $\frac{1}{n}\log \mu < \e$

1441:   and the corresponding probability of error $\tau < \e$.

1442: \end{dfn}

1443:

1444: The goal is to find the achievable rate region $\mc{R}_{GW}$ which

1445: is the set of all achievable tuple of rates $(R_0, R_1, R_2, R_1',

1446: R_2')$. We have obtained an inner bound to the achievable rate region. It

1447: is one of the main results of this paper and is given by the following theorem.

1448:

1449: \begin{thm}\label{thm:bc-sc-main}

1450: $\mc{R}_{GW}^* \subset \mc{R}_{GW}$ where

1451: \begin{align}

1452: \mc{R}_{GW}^* =\bigcup_{p(z|s,t)}\{(R_0, R_1, R_2, R_1', R_2')&: \notag\\

1453:  R_0 &\geq I(S,T;Z), \label{eq:bc-sc-main1}\\

1454:  R_1 &\geq H(S|Z),  \ \ \

1455:  R_2 \geq H(T|Z), \label{eq:bc-sc-main3}\\

1456:  R_1' &\geq H(S|T,Z), \ \ \

1457:  R_2' \geq H(T|S,Z), \label{eq:bc-sc-main5}

1458: \end{align}

1459: where $Z$ is an auxiliary random variable with a finite alphabet

1460: $\mc{Z}$ such that $p(z,s,t)=p(s,t)p(z|s,t)$.

1461:

1462: \end{thm}

1463:

1464: \begin{rem}

1465: Note that we can obtain $R_0+R_1+R_2'= R_0+R_1'+R_2 \geq H(S,T)$ by

1466: combining the conditions in Theorem \ref{thm:bc-sc-main}. This

1467: means that in every nearly semi-regular bipartite graph

1468: used to  represent the source, as expected,

1469: the total number of edges  must be greater than or equal to $2^{nH(S,T)}$.

1470: \end{rem}

1471:

1472: \subsection{Proof of Theorem

1473: \ref{thm:bc-sc-main}}\label{sec:bc-sc-thm-proof}

1474:

1475: In this section, we present a proof of Theorem

1476: \ref{thm:bc-sc-main}. We use a random coding procedure and the notion of

1477: strongly jointly typical sequences.

1478: Let us consider a fixed finite set ${\mathcal Z}$, and a

1479: joint probability distribution $p(z,s,t)$ on

1480: $\mc{Z} \times \mc{S} \times \mc{T}$.  Also fix $\e>0$, and real

1481: numbers $R_0$, $R_1$, $R_2$, $R'_i=R_i-I(S;T|Z)$ for $i=1,2$.

1482: Without loss of generality we assume that

1483: $R_i>I(S;T|Z)$ for $i=1,2$.

1484:

1485:

1486: \vspace{10pt} \noindent \textbf{Random sequence generation}: First,

1487: draw $2^{nR_0}$ sequences $Z^n(m)$, $m \in \{1, 2$, $\ldots,

1488: 2^{nR_0}\}$, of length $n$, independently from the strongly

1489: $\e$-typical set $\Ae(Z)$. That is,

1490: $P\{Z_m^n=z^n\}=\frac{1}{|\Ae(Z)|}$ if $z^n \in \Ae(Z)$, and

1491: $P\{Z_m^n=z^n\}=0$ if $z^n \notin \Ae(Z)$. For every $m \in \{1,2,

1492: \ldots, 2^{nR_0}\}$, draw $2^{nR_1}$  sequence  from $\Ae(S|Z^n(m))$

1493: independently, equally likely and with replacement. Call this

1494: collection $B(m)$. Denote the $i$th sequence as $S^n(i,m)$.

1495: By collecting  these $2^{nR_0}$ bins the first codebook $\Bbb{C}_1$ can be

1496: obtained. In other words, $\Bbb{C}_1=\{B(1),B(2), \ldots,

1497: B(2^{nR_0})\}$.

1498:

1499: Similarly, for every  $m \in \{1,2,\ldots,2^{nR_0}\}$,

1500: construct  a bin $C(m)$ containing $2^{nR_2}$ sequences from

1501: $\Ae(T|Z^n(m))$. Let the $j$th sequence in $C(m)$ be denoted by $T^n(j,m)$.

1502: The second codebook $\Bbb{C}_2$ can be

1503: similarly generated, i.e., $\Bbb{C}_2=\{C(1),C(2), \ldots,

1504: C(2^{nR_0})\}$.

1505:

1506: \vspace{10pt} \noindent \textbf{Graph generation}: Now we generate a

1507: bipartite graph $\Bbb{G}$ from the codebooks $\Bbb{C}_1$ and

1508: $\Bbb{C}_2$ as follows:

1509: \begin{itemize}

1510:   \item $\Bbb{G}$ is composed of $2^{nR_0}$ disjoint subgraphs

1511:   $\tl{\Bbb{G}}_m$ for $m \in \{1, 2, \ldots, 2^{nR_0}\}$,

1512:   \item $V_i(\Bbb{G})\!=\!\cup_{m=1}^{2^{nR_0}}V_i(\tl{\Bbb{G}}_m)$

1513:   for $i=1,2$, \ \ \

1514:   $E(\Bbb{G})=\cup_{m=1}^{2^{nR_0}}E(\tl{\Bbb{G}}_m)$,

1515:   \item $\forall m \!\in \!\{1, 2, \ldots, 2^{nR_0}\}$,

1516:   $V_1(\tl{\Bbb{G}}_m)\!=\!\{(m-1)2^{nR_1}-1, \ldots, m2^{nR_1}\}$,

1517: $V_2(\tl{\Bbb{G}}_m)$=$\{(m-1)2^{nR_2}-1, \ldots, m2^{nR_2}\}$,

1518:   \item $\forall m \!\in \!\{1, 2, \ldots, 2^{nR_0}\}$,

1519:                $(i,j) \in E(\tl{\Bbb{G}}_m)$ if and only if

1520: $(S^n(i,m)$, $T^n(j,m)$, $Z^n(m))$ $\in

1521:          \Ae(S,T,Z)$.

1522: \end{itemize}

1523:

1524: \vspace{10pt} \noindent \textbf{Encoding error events}: Before we

1525: proceed further, let us make sure that the generated codebooks

1526: satisfy certain properties. If the vertexes of $\Bbb{G}$ do not

1527: satisfy certain degree requirements, we may not be able to reliably

1528: represent the sources using this graph.

1529: For the triple $(S,T,Z)$, consider the function $\epsilon_1(\e)$ as

1530: defined in (\ref{eqU})-(\ref{eqU|V}). Let $\e'>3\e_1>0$.

1531: An encoding error will be declared if either one of the following

1532: events occurs. \\

1533: $E_1: \exists i \in V_1(\Bbb{G}) \mbox{ such } \mbox{ that }

1534: | \frac{1}{n} \log {\rm deg}_{\Bbb{G},1}-R'_2 | > \e'$, \\

1535: $E_2: \exists j \in V_2(\Bbb{G}) \mbox{ such } \mbox{ that }

1536: | \frac{1}{n} \log {\rm deg}_{\Bbb{G},2}-R'_1 | > \e'$.

1537:

1538: Using the following lemma it can be shown that the probability of

1539: these two events can be made arbitrarily small for large $n$.

1540:

1541: \begin{lem}\label{lem:bc-sc-1}

1542: For any $\e>0$, and sufficiently large $n$,

1543: we have

1544: $P(E_i)< \frac{\e}{8}, \ \ {\rm for } \ \ i=1,2.$

1545: \end{lem}

1546: {\it Proof:} The proofs of these two results are similar,

1547: respectively, to those of Lemma

1548: \ref{lem:bc-cc-1} and Lemma \ref{lem:bc-cc-2}. Hence for conciseness

1549: we omit the proof.

1550:

1551: So by Lemma \ref{lem:bc-sc-1}, $P(E_1)+P(E_2) < \frac{\e}{4}$.

1552: Hence we can obtain a bipartite graph

1553: $\Bbb{G}$ where each vertex in $V_1(\Bbb{G})$ has degree nearly

1554: equal to $ 2^{nR'_2}$ and each vertex in $V_2(\Bbb{G})$ has

1555: degree nearly equal to $ 2^{nR'_1}$.

1556:

1557: \vspace{10pt} \noindent \textbf{Choosing message correlation}:

1558: If none of the above error events $E_1$ and $E_2$ occurs, then choose

1559: $G=\Bbb{G}$. If any of the above two error events occurs, then pick

1560: any graph with parameters

1561: $(2^{nR_0},2^{nR_1},2^{nR_2},2^{nR'_1},2^{nR'_2},2^{n\e'})$

1562: and call it as $G$ and no guarantee will be given regarding the

1563: probability of decoding error.

1564: For this graph $G$, and the given correlated sources $S$ and $T$, using

1565: the above random codebooks $\Bbb{C}_1$ and $\Bbb{C}_2$, we construct

1566: an $(n,\tau)$-transmission system, where $\tau$ will be specified in

1567: the sequel.

1568:

1569: \vspace{10pt} \noindent \textbf{Encoding}: If $(E_1\cup E_2)$ occurs,

1570: then the encoder is some arbitrary mapping $\mc{S}^n \times \mc{T}^n

1571: \rightarrow E(G)$. Otherwise,

1572: for a given $(S^n,T^n)

1573: \!\in\! \Ae(S,T)$, find an index $m$, for $m \in \{1, 2,$ $\ldots,

1574: 2^{nR_0}\}$, such that $(S^n$, $T^n$, $Z^n(m))$ $\in \Ae(S,T,Z)$. If

1575: there is no such index $m$, let $m$ be a random index chosen uniformly

1576: from $\{1,2,\ldots,2^{nR_0}\}$. Also, find a pair of

1577: indexes $(i,j)$ where $i$ (and $j$, respectively) is the index such

1578: that $S^n(i,m)=S^n$ (and $T^n(j,m)=T^n$, respectively). If there is no

1579: such sequence pair let $(i,j)$ be a random edge from the corresponding

1580: sub-graph of $G$. The encoder sends $(m,i)$ and $(m,j)$ to

1581: receiver 1 and receiver 2, respectively.

1582:

1583: \vspace{10pt} \noindent \textbf{Decoding}:

1584: Given the received

1585: index pair $(m,i)$, receiver 1 declares $\h{S}^n=S^n(i,m)$. Similarly,

1586: given $(m,j)$, receiver 2 declares

1587: $\h{T}^n=T^n(j,m)$.

1588:

1589: \vspace{10pt} \noindent \textbf{Probability of error analysis}: Let

1590: $E$ denote the event $\{ (g_1(m,i),g_2(m,j)) \!\neq \!(S^n, T^n)\}$,

1591: that the reconstruction vector pair is not equal to the source

1592: vector pair. The probability of error $P(E)$ can be given by

1593: \begin{align}

1594:   P(E)&=P(E_1 \!\cup\! E_2)P(E|E_1 \! \cup \! E_2)\!+\!P(E \cap E_1^c

1595:        \cap E_2^c)\\

1596:        &\leq P(E_1 \!\cup\! E_2)+P(E \cap E_1^c \cap E_2^c).

1597: \end{align}

1598:

1599: The second probability in the above equation can be bounded as

1600: given in the following lemma.

1601:

1602: \begin{lem}\label{lem:bc-sc-3}

1603:   For any $\e>0$, and sufficiently large $n$,

1604:   \beq P(E \cap E_1^c \cap E_2^c) < \frac{3\e}{4} \eeq

1605:   provided

1606:   \beq

1607:     R_0 > I(S,T;Z) + \e_2(\e), \ \

1608:     R_1 > H(S|Z) + \e_1(\e), \ \

1609:     R_2 > H(T|Z) + \e_1(\e),

1610:   \eeq

1611:   where $\e_2(\e) \ra 0$ as $\e \ra 0$ and $\e_2(\e)>0$.

1612: \end{lem}

1613: {\it Proof}: Refer to Appendix \ref{appendix:bc-lem-sc-3}.

1614:

1615: Thus, $P(E)  \leq P(E_1)\!+\!P(E_2)\!+\! P(E \cap E_1^c \cap E_2^c) < \e$.

1616: Therefore $P(E) < \e$ for sufficiently large $n$ and under the

1617: conditions given by the theorem. In every realization of random

1618: codebooks we have obtained a graph $G$ with the same set of parameters, and

1619: averaged over this ensemble, we have made sure that the probability

1620: of error is within the tolerance level of $\e$. Hence, the proof of

1621: the direct coding theorem is completed. \hfill $\blacksquare$

1622:

1623:

1624: \subsection{Outer bound}\label{sec:bc-sc-thm-proof}

1625:

1626: In this section we provide a  partial converse for  Theorem

1627: \ref{thm:bc-sc-main}.

1628:

1629: \begin{thm}\label{thm:bc-sc-main_c}

1630: $\mc{R}_{GW} \subset  \mc{R}_{GW}^{**}$ where

1631: \begin{align}

1632: \mc{R}_{GW}^{**} =\bigcup_{p(w|s,t)}\{(R_0, R_1, R_2, R_1', R_2')&: \notag\\

1633:  R_0 &\geq I(S,T;Z), \label{eq:bc-sc-main1a}\\

1634:  R_1 &\geq H(S|Z), \ \ \

1635:  R_2 \geq H(T|Z), \label{eq:bc-sc-main3a}\\

1636: R_1+R_2'&= R_1'+R_2 \geq H(S,T|Z)\label{eq:bc-sc-main6a}\},

1637: \end{align}

1638: where $Z$ is an auxiliary random variable such that $p(z,s,t)=p(s,t)p(z|s,t)$.

1639: \end{thm}

1640: {\it Proof:} Refer to Appendix \ref{appendix:bc-sc-main_c}.

1641:

1642:

1643: \subsection{An interpretation of the rate region}\label{sec:different}

1644:

1645: We have shown that for sufficiently large

1646: block length $n$, a pair correlated sources $(S,T)$ can be

1647: represented into  a nearly semi-regular graph $G$ with parameters

1648: $(2^{nR_0}$, $2^{nR_1}$, $2^{nR_2}$, $2^{nR_1'}$, $2^{nR_2'}$,

1649: $2^{n\e'}$) as shown in Figure \ref{fig:bc-message-graph}.

1650: \begin{figure}[!h]

1651: \begin{center}

1652: \psfig{figure=sub-graph_source.eps,height=2.0in,width=3.6in}

1653: \end{center}

1654: \centering \caption{\small Graph $G$ with parameters $(2^{nR_0}$,

1655: $2^{nR_1}$, $2^{nR_2}$, $2^{nR_1'}$, $2^{nR_2'}$, $2^{n\e'})$

1656: composed of $2^{nR_0}$ subgraphs $\tl{G}_m$ with parameters

1657: ($2^{nR_1}$, $2^{nR_2}$,

1658: $2^{nR_1'}$, $2^{nR_2'}$, $2^{n\e'}$)}\label{fig:bc-message-graph}

1659: \end{figure}

1660: Note that many different graphs can be used to represent a pair

1661: correlated sources without increasing the amount of ``global''

1662: redundancy, i.e., the sum rate satisfies:

1663: $R_0+R_1+R_2'=H(S,T)$. This can be seen from the fact that the total

1664: number of edges in

1665: the graph satisfies:  $|E(G)|\approx 2^{n(I(S,T;Z)+H(S,T|Z))}=2^{nH(S,T)}$.

1666: Further, the rate of the common message can vary from $0$ to $H(S,T)$.

1667: As examples, let us consider some special cases as follows.

1668: \begin{itemize}

1669:   \item Case $A$: If $I(S,T;Z)=0$, then $R_0=0$, $R_1=H(S)$, $R_2=H(T)$,

1670:   $R_1'=H(S|T)$, $R_2'=H(T|S)$. Roughly speaking, this

1671:   corresponds to the typicality-graph of $(S,T)$.

1672:   \item Case $B$: If $I(S,T;Z)=H(S,T)$, then $R_0=H(S,T)$, $R_1=0$, $R_2=0$,

1673:   $R_1'=0$, $R_2'=0$.

1674: \end{itemize}

1675: In particular, consider the case when  $R_0=C(S;T)$, where $C(S;T)$

1676: denotes the common information of  Wyner

1677: \cite{wyner75}.

1678: In this case, $R_0=C(S;T)$, $R_1=H(S|Z)$, $R_2=H(T|Z)$,

1679:   $R_1'=H(S|Z)$, $R_2'=H(T|Z)$ since $S$ and $T$ are conditionally

1680: independent given $Z$. So, each subgraph $\tl{G}_m$ becomes

1681:  nearly complete. This also means that the private messages for receiver 1

1682: and 2 become nearly independent. Note that a subgraph $\tl{G}_m$ can

1683: not be complete if $R_0 < C(S;T)$ because, by the definition,

1684: $C(S;T)$ is the infimum of $I(S,T;Z)$ such that $S \ra Z \ra T$.

1685:

1686: In summary, Case $A$ can be thought of as situated at one end of the

1687: spectrum, and Case $B$ as situated on the other end of the spectrum.

1688: For every value of $I(S,T;Z)$, we get an equally efficient

1689: representation of the sources into a nearly semi-regular graph.

1690: Here, an efficient representation means that the cardinality

1691: of the edge set of a graph used for representation (of $n$ source

1692: samples) is nearly equal to  $2^{nH(S,T)}$.

1693:

1694:

1695: \section{Interpretation of the Example in Section

1696:   \ref{sec:bc-ex}}\label{sec:bc-example}

1697:

1698: In this section, we revisit the example of Section \ref{sec:bc-ex}.

1699: We analyze and

1700: interpret this example from the perspective  of graphs and the

1701: proposed coding scheme. This example deals with  transmission of

1702: two correlated sources over a deterministic broadcast channel. This

1703: example can be considered as a special case where the

1704: typicality-graph of the sources and that of channel outputs can

1705: match exactly. So, we can apply our coding theorem to this case.

1706:

1707: Consider the pair of correlated sources of Section \ref{sec:bc-ex}:

1708: $\mc{S}=\mc{T}=\{0, 1\}$ with distribution  $p(0,0)=p(0,1)=p(1,1)=1/3$, and the

1709: Blackwell channel with ${\mathcal X}=\{1,2,3\}$, and ${\mathcal

1710:   Y}_i=\{0,1\}$ for $i=1,2$ with conditional distribution

1711: $p(0,0|1)=p(0,1|2)=p(1,1|3)=1$.  Let $\alpha=h_2(1/3)$ and $\beta=\log 3

1712: -h_2(1/3)$. It follows that $H(S)=H(T)=\alpha$

1713: and $H(S|T)=H(T|S)=\beta$. Consider the rate tuple

1714: $(0,\alpha,\alpha,\beta,\beta)$. Clearly this point belongs to

1715: ${\mathcal R}^*_{GW}$ as this tuple corresponds to the

1716: typicality-graph of $({\mathcal S},{\mathcal T},p(s,t))$.

1717: Note that the above broadcast channel is a deterministic broadcast

1718: channel. Using the arguments presented in Section

1719: \ref{sec:one-deterministic-bc-cor}, it can be shown that for any

1720: deterministic broadcast channel, for the special case when $R_0=0$,

1721: the capacity region $\tilde{{\mathcal C}}_{BC}$ is given by

1722:

1723: \beq

1724: \bigcup_{p(x)} \{(R_1,R_2,R'_1,R'_2): \ \

1725: R_i \leq H(Y_i) \ \ \mbox{for} \ \ i=1,2, \ \

1726: R_1+R'_2=R_2+R'_1 \leq H(Y_1,Y_2) \}.

1727: \eeq

1728:

1729: Choosing $X$ equally likely over $\{1,2,3\}$, it follows that

1730: the rate tuple $(\alpha,\alpha,\beta,\beta)$ belongs to

1731: $\tilde{{\mathcal C}}_{BC}$, which, of course,  implies that

1732: $(0,\alpha,\alpha,\beta,\beta)$ belongs to

1733: ${\mathcal C}_{BC}$. In this case the distribution of the channel

1734: output $(Y_1,Y_2)$ is the same as the distribution of the sources

1735: $(S,T)$. Further, the graph corresponding to the tuple

1736: $(0,\alpha,\alpha,\beta,\beta)$ in source coding is the same as the

1737: graph corresponding to the tuple $(0,\alpha,\alpha,\beta,\beta)$ in channel

1738: coding.

1739:

1740: In summary: (1) there is a

1741: nonempty intersection between the achievable rate regions of the source

1742: coding module and the channel coding module, and (2) the graph $G$

1743: associated with the source coding module matches (is identical) with the

1744: graph $G$ of channel coding module.

1745: Hence the given sources $(S,T)$

1746: can be reliably sent over the given broadcast channel.

1747:

1748: \section{Conclusion}\label{sec:bc-conclusion}

1749: We have considered the problem of transmission of correlated sources over

1750: broadcast channels. We have considered a graph-based modular architecture

1751: involving two components: a channel coding component and a source

1752: coding component.  The graphs are used to model the correlation between the

1753: messages. Correlated sources

1754: are first mapped into such graphs, and the edges coming from these

1755: graphs are reliably transmitted over a broadcast channel.

1756:

1757: We have given a partial characterization of the set of all graphs

1758: that can be used to represent a given pair of correlated sources,

1759: and similarly given a partial characterization of the set of all

1760: graphs such that edges coming those graphs are reliably transmitted

1761: over a given broadcast channel.

1762: We have also considered  special cases such as

1763:  deterministic broadcast channels and broadcast channels with one

1764:  deterministic component, where converse results are provided. We

1765: have applied this analysis to the case of transmission of a triangular source

1766: over a Blackwell channel as an example. The goal of

1767: this work is to show that graphs may be used as discrete interface

1768: in this modular approach to multiterminal communication problems.

1769:

1770:

1771: \appendix

1772:

1773:

1774: \section*{Appendix}

1775:

1776: \section{Proof of Lemma \ref{lem:bc-cc-1}}\label{appendix:bc-lem-cc-1}

1777: The event $E_{0,1}$ can be considered as

1778: \begin{equation}

1779: E_{0,1}=\bigcup_{i=1}^{2^{nR_1}} \bigcup_{m=1}^{2^{nR_0}} E_{0,1}(i,m)

1780: \end{equation}

1781: where  $E_{0,1}(i,m)$ is the event that

1782: ${\rm deg}_{\mb{G},1}((m-1)2^{nR_1}+i) < 2^{n(R'_2-\e')}$.

1783:

1784: Note that  each vertex in $V_1(\mb{G})$ is only connected with a

1785: subset of $V_2(\tl{\mb{G}}_m)$ for some $m$, $m \in

1786: \{1,2,\ldots,2^{nR_0}\}$.

1787: We define super-bins $\mf{B}(p,m)$ and $\mf{C}(q,m)$ for

1788: $p\in [1,2^{n(R'_1-\epsilon')}]$ and $q\in[1,2^{n(R'_2-\epsilon')}]$, each of

1789: which is a union of $2^{n(R_1+R_2-A+\epsilon')}$ consecutive bins $B(i,m)$ and

1790: $C(j,m)$, respectively, i.e.,

1791: \begin{align}

1792:   \mf{B}(p,m)=\bigcup_{i=(p-1)2^{n(R_1+R_2-A+\epsilon')}+1}

1793: ^{p\,2^{n(R_1+R_2-A+\epsilon')}} B(i,m),\\

1794:   \mf{C}(q,m)=\bigcup_{j=(q-1)2^{n(R_1+R_2-A+\epsilon')}+1}

1795: ^{q2^{n(R_1+R_2-A+\epsilon')}} C(j,m)

1796: \end{align}

1797: The size of each super-bin $\mf{B}(p,m)$ and

1798: $\mf{C}(q,m)$ is $2^{n(I(U;Y_1|Z)-\epsilon-R'_1+\epsilon')}$ and

1799: $2^{n(I(V;Y_2|Z)-\epsilon-R'_2+\epsilon')}$, respectively. Recall that

1800: $R_1+R'_2=R_2+R'_1=A$.

1801:

1802: Before we proceed further, let us observe that

1803:  $B(i,m)$, $\mf{B}(p,m)$ and $\C_1(m)$ are

1804: collections of $2^{n(I(U;Y_1|Z)-\epsilon-R_1)}$,

1805: $2^{n(I(U;Y_1|Z)-\epsilon-R'_1+\epsilon')}$ and

1806: $2^{n(I(U;Y_1|Z)-\epsilon)}$, respectively, of random sequences.

1807: Then, the event $E_{0,1}(i,m)$ can be expressed as

1808: \begin{equation}

1809: E_{0,1}(i,m) \subset \bigcup_{q=1}^{2^{n(R'_2-\e')}} E_{0,1}(i,q,m)

1810: \end{equation}

1811: where  $E_{0,1}(i,q,m)$ is the event that $(B(i,m) \times

1812: \mf{C}(q,m)) \cap A(U,V|Z^n(m))$ is empty.  So,

1813: by using the union bound, the probability of this event

1814: $P(E_{0,1}(i,m))$ can be bounded as:

1815: \begin{align}

1816:   P(E_{0,1}(i,m)) &\leq P \left( \bigcup_{q=1}^{2^{n(R'_2-\e')}}

1817:   E_{0,1}(i,q,m) \right)\\

1818:   &\leq \sum_{q=1}^{2^{n(R'_2-\e')}} P(E_{0,1}(i,q,m))\\

1819:   &\leq \sum_{q=1}^{2^{n(R'_2-\e')}} \sum_{z^n \in \Ae(Z)}

1820:   \frac{1}{|\Ae(Z)|} P(E_{0,1}(i,q,m)|Z^n(m)=z^n)

1821: \end{align}

1822: Therefore, for sufficiently large $n$, the probability of the event

1823: $E_{0,1}$ can be bounded by applying the union bound:

1824: \begin{align}

1825:  P(E_{0,1}) &\leq \sum_{i=1}^{2^{nR_1}} \sum_{m=1}^{2^{nR_0}} P(E_{0,1}(i,m))\\

1826:             &\leq \sum_{i=1}^{2^{nR_1}} \sum_{m=1}^{2^{nR_0}}

1827:  \sum_{q=1}^{2^{n(R'_2-\e')}} \sum_{z^n \in \Ae(Z)}

1828:   \frac{1}{|\Ae(Z)|} P(E_{0,1}(i,q,m)|Z^n(m)=z^n) \\

1829:               &\overset{(a)}\leq 2^{n(R_0+R_1+R'_2-\e')} 2^{-nM}\\

1830:               &{\leq} \frac{\e}{12},

1831: \end{align}

1832: where (a) is from Lemma \ref{lem:bc-cc-0} because

1833: \beq

1834: \frac{1}{n} \log |B(i,m)||\mf{C}(q,m)|=I(U;V|Z)+\epsilon',

1835: \eeq

1836:  and $M > 0$ is a

1837: sufficiently large number satisfying $M > R_0+R_1+R'_2-\e'$.

1838:

1839: In a similar way, we can also show that $P(E_{0,2})

1840: <\frac{\e}{12}$ for sufficiently large $n$.

1841:  \hfill $\blacksquare$

1842:

1843:

1844:

1845: \section{Proof of Lemma \ref{lem:bc-cc-2}}\label{appendix:bc-lem-cc-2}

1846:

1847: The event $E_{0,1}^*$ can be expressed as

1848: \begin{equation}

1849: E_{0,1}^*=\bigcup_{i=1}^{2^{nR_1}} \bigcup_{m=1}^{2^{nR_0}} E_{0,1}^*(i,m)

1850: \end{equation}

1851: where  $E_{0,1}^*(i,m)$ is the event that

1852: ${\rm deg}_{\mb{G},1}((m-1)2^{nR_1}+i) > 2^{n(R'_2+\e')}$.

1853:

1854: Let $\mc{D}(i,m)=\{j:  |A(U,V|Z^n(m))\cap (B(i,m) \times C(j,m))|

1855: \neq 0 \}$. Then,

1856: \begin{align}

1857:   |\mc{D}(i,m)|=\sum_{j=1}^{2^{nR_2}} \psi(j),

1858:   \mbox{  where} ~~\psi(j)=\left\{%

1859: \begin{array}{ll}

1860:     1, & \hbox{if} \ \ |A(U,V|Z^n(m))\cap (B(i,m) \times C(j,m))| \neq 0, \\

1861:     0, & \hbox{otherwise.}

1862: \end{array}%

1863: \right.

1864: \end{align}

1865:

1866: In particular,

1867: \begin{align}

1868:   P\{\psi(j)=1\}&= P \{ |A(U,V|Z^n(m)) \cap

1869:   (B(i,m) \times C(j,m))| \neq 0 \}\\

1870:  &= \sum_{z^n \in \Ae(Z)} \frac{1}{|\Ae(Z)|} P \{ |A(U,V|Z^n(m)) \cap

1871:   (B(i,m) \times C(j,m))| \neq 0 | Z^n(m)=z^n\}\\

1872:    &\overset{(a)}{\leq} \sum_{z^n \in \Ae(Z)} \frac{1}{|\Ae(Z)|}

1873: 2^{n(I(U;Y_1|Z)-R_1-\e)}

1874:   2^{n(I(V;Y_2|Z)-R_2-\e)} 2^{-n(I(U;V|Z)-3\e_1)}\\

1875:   &= 2^{n(A-R_1-R_2+3\e_1)}

1876: \end{align}

1877: where (a) is obtained by applying the union bound, and

1878: from the property of strongly jointly $\e$-typical sequences

1879: \cite{cover-thomas}: for all $z^n \in \Ae(Z)$, for a randomly and

1880: independently chosen $U^n

1881: \in A(U|z^n)$ and $V^n \in A(V|z^n)$, for sufficiently large

1882: $n$, the probability that

1883:  $(U^n, V^n) \in A(U,V|z^n)$ is bounded by

1884: \begin{equation}\label{eq:bc-cc-3}

1885: 2^{-n(I(U,V|Z)+3\e_1)} \leq P\{(U^n, V^n) \in A(U,V|z^n)\} \leq

1886: 2^{-n(I(U,V|Z)-3\e_1)}

1887: \end{equation}

1888: where we have used (\ref{eqU}), (\ref{eqV}) and (\ref{eqUV}).

1889:

1890:

1891: So, the expectation of $|\mc{D}(i,m)|$ can be bounded as follows.

1892: \begin{align}

1893:   E|\mc{D}(i,m)| &=\sum_{j=1}^{2^{nR_2}} P\{\psi(j)=1\}\\

1894:              &\leq 2^{nR_2}2^{n(A-R_1-R_2+3\e_1)}\\

1895:              &\leq 2^{n(R'_2+3\e_1)} \label{eq:bc-cc-5}

1896: \end{align}

1897:

1898: Now,

1899: \begin{align}

1900:   P\{E_{0,1}^*(i,m)\}&=P\{|\mc{D}(i,m)|>

1901:                     \underbrace{2^{n(A-R_1+\e')}}_{\triangleq a}\}\\

1902:                     &\overset{(b)}{<}e^{-at}E\{ e^{t|\mc{D}|} \}

1903: \end{align}

1904: for any $t>0$ where $(b)$ follows from the Chernoff bound

1905: \cite{cover-thomas}.

1906:

1907: Now we calculate an upper bound for $E\{ e^{t|\mc{D}|}\}$. Consider an

1908: arbitrary sequence $z^n \in \Ae(Z)$.   Let $u^n[l]$ be the

1909: $l$-th sequence (using some ordering) in $A(U|z^n)$.

1910: Recalling the definition given in (\ref{bins1}) and (\ref{bins2}),

1911: let us denote $|B(i,m)|=\beta$, and

1912: $i_k=(i-1)\beta+k$ for $k \in \{1,2,\ldots,\beta\}$ for notational

1913: simplicity. Now consider the following sequence of arguments:

1914: \begin{align}

1915:   E\{ e^{t|\mc{D}(i,m)|}|Z^n(m)=z^n \}

1916:   &= E \bigg\{ \prod_{j=1}^{2^{nR_2}} e^{t\psi(j)} \bigg| Z^n(m)=z^n

1917:  \bigg\}\\

1918:   &= \sum_{l_1=1}^{|A(U|z^n)|}\!\!\!\!p\{U^n(i_1,m)=u^n[l_1]\} \!\!

1919:   \sum_{l_2=1}^{|A(U|z^n)|}

1920:   \!\!\!\!p\{U^n(i_2,m)=u^n[l_2]\}\ldots

1921:   \!\!\sum_{l_{\beta}=1}^{|A(U|z^n)|}\!\!\!\!p\{U^n(i_\beta,m)=u^n[l_\beta]\}

1922:   \notag\\

1923:   &~~~~~~~~~~~E \bigg\{ \prod_{j=1}^{2^{nR_2}} e^{t\psi(j)}\bigg|

1924:   ~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1925:   for $\theta=1,2,\ldots,\beta$ }, \mbox{and }  Z^n(m)=z^n \bigg\}\\

1926:   &= \sum_{l_1=1}^{|A(U|z^n)|}\sum_{l_2=1}^{|A(U|z^n)|}\ldots

1927:   \sum_{l_{\beta}=1}^{|A(U|z^n)|}\frac{1}{|A(U|z^n)|^{\beta}}

1928:   \notag\\

1929:   &~~~~~~~~~~~E \bigg\{ \prod_{j=1}^{2^{nR_2}}

1930:   e^{t\psi(j)}\bigg|~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1931:   for $\theta=1,2,\ldots,\beta$ }, \mbox{ and } Z^n(m)=z^n

1932:   \bigg\}\\

1933:   &\overset{(c)}{=}

1934:   \sum_{l_1=1}^{|A(U|z^n)|}\sum_{l_2=1}^{|A(U|z^n)|}\ldots

1935:   \sum_{l_{\beta}=1}^{|A(U|z^n)|}\frac{1}{|A(U|z^n)|^{\beta}}

1936:   \notag\\

1937:   &~~~~~~~~~~~\prod_{j=1}^{2^{nR_2}} E \bigg\{

1938:   e^{t\psi(j)}\bigg|~~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1939:   for $\theta=1,2,\ldots,\beta$ }, \mbox{ and } Z^n(m)=z^n

1940:   \bigg\}

1941: \end{align}

1942: where $(c)$ is from the fact that $\psi(j)$'s are independent when

1943: the outcomes of $U^n(i_1,m),U^n(i_2,m),\ldots,U^n(i_\beta,m)$ and

1944: $Z^n(m)$ are fixed. Let

1945: us denote

1946: \beq

1947: p_j=P\{\psi(j)=1 \big|~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1948:   for $\theta=1,2,\ldots,\beta$ }, \mbox{ and } Z^n(m)=z^n \}.

1949: \label{pj}

1950: \eeq

1951:

1952: Then,

1953: \begin{align}

1954:   E \{e^{t\psi(j)} \big|~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1955:   for $\theta=1,2,\ldots,\beta$ }, \mbox{ and } Z^n(m)=z^n  \}&=e^t

1956:   p_j+1\cdot (1-p_j)\\

1957:  &=1-p_j(1-e^t)\\

1958:                     &\leq e^{-p_j(1-e^t)}~~~ \mbox{since $1-x \leq e^{-x}$} .

1959: \end{align}

1960: So,

1961: \begin{align}

1962: &\prod_{j=1}^{2^{nR_2}} E

1963: \{e^{t\psi(j)}|~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1964:   for $\theta=1,2,\ldots,\beta$ }, \mbox{ and } Z^n(m)=z^n  \}\\

1965:   &\leq \prod_{j=1}^{2^{nR_2}} e^{p_j(e^t-1)}\\

1966: &={\rm exp} \bigg\{ (e^t-1) \sum_{j=1}^{2^{nR_2}} p_j\bigg\}\\

1967: &={\rm exp} \bigg\{ (e^t-1) E\left\{|\mc{D}(i,m)| \big|

1968: ~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1969:   for $\theta=1,2,\ldots,\beta$ },  \mbox{ and } Z^n(m)=z^n \right\} \bigg\}

1970: \end{align}

1971:

1972: Then,

1973: \begin{align}

1974:  E\{ e^{t|\mc{D}(i,m)|} | Z^n(m)=z^n \}

1975:  &\leq

1976: \sum_{l_1=1}^{|A(U|z^n)|}\!\!\!\!\ldots\!\!\!\!

1977: \sum_{l_{\beta}=1}^{|A(U|z^n)|}\!\!\!\!\frac{1}{|A(U|z^n)|^{\beta}}

1978:  \notag\\

1979:   &~~~~~~~~{\rm exp} \bigg\{ (e^t-1) E\left\{|\mc{D}(i,m)| \big|

1980: \mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1981:   for $\theta=1,2,\ldots,\beta$ }, \mbox{ and } Z^n(m)=z^n  \right\}

1982:  \!\! \bigg\}\\

1983:   &\overset{(d)}{\leq} {\rm exp} \bigg\{ (e^t-1) 2^{n(R'_2+3\e_1)}  \bigg\}

1984: \end{align}

1985: where $(d)$ is obtained because $E\left\{|\mc{D}(i,m)| \big|

1986: ~\mbox{$U^n(i_\theta,m)=u^n[l_\theta]$,

1987:   for $\theta=1,2,\ldots,\beta$ }, \mbox{ and } Z^n(m)=z^n  \right\}$

1988: is bounded by $2^{n(R'_2+3 \e_1)}$ (the same as the bound on the

1989: unconditional expectation as in inequality

1990: (\ref{eq:bc-cc-5}))  regardless of the particular sequence collection

1991: $u^n[l_1],u^n[l_2],\ldots,u^n[l_\beta]$ and $z^n$. To see this crucial step

1992: clearly let us rewrite  $p_j$ (see (\ref{pj})), using the union bound

1993: on the outcome of the bin $C(j,m)$, as follows:

1994: \beq

1995: p_j \leq  2^{n(I(V;Y_2|Z)-R_2-\epsilon)}

1996:   P \left[  \bigcup_{\theta=1}^{\beta} F_{\theta}  \right],

1997: \eeq

1998: where $F_{\theta}$ is the event that a sequence drawn from $A(V|z^n)$

1999: belongs to $A(V|u^n[l_{\theta}],z^n)$. Since all sequences in

2000: $B(i,m)$, given that $Z^n(m)=z^n$,

2001: are drawn from $A(U|z^n)$, we have using (\ref{eqV}) and (\ref{eqV|U})

2002: \beq

2003: p_j \leq  2^{n(I(V;Y_2|Z)-R_2-\epsilon)}  2^{n(I(U;Y_1|Z)-R_1-\epsilon)}

2004: 2^{-n(I(U;V|Z)-3 \epsilon_1)}= 2^{n(A-R_1-R_2+3\e_1)}.

2005: \eeq

2006:

2007:

2008: Therefore, for $t>0$,

2009: \begin{align}

2010:   P\{E_{0,1}^*(i,m)\} &\leq e^{-at} {\rm exp} \bigg\{ (e^t-1)

2011:     2^{n(R'_2+3\e_1)}  \bigg\}\\

2012:     &={\rm exp} \bigg\{-at+(e^t-1)\underbrace{2^{n(R'_2+3\e_1)}}_{\triangleq

2013:     ~b}\bigg\}

2014: \end{align}

2015:

2016: To get a tighter upper bound, let us denote $f(t)=-at+b(e^t-1)$,

2017: for $t>0$. Then, $f'(t)=-a+be^t$ and $f''(t)=be^t

2018: > 0$. So, $f(t)$ has the minimum value when $t=\ln

2019: \left(\frac{a}{b} \right)$.

2020:

2021: Thus, $P\{E_{0,1}^*(i,m)\}$ is bounded as

2022: \begin{align}

2023:   P\{E_{0,1}^*(i,m)\}& \leq {\rm exp} \bigg\{-a\ln \left( \frac{a}{b}

2024:   \right)+a-b \bigg\}

2025: \end{align}

2026: where $a=2^{n(R'_2+\e')}$ and $b=2^{n(R'_2+3\e_1)}$.

2027:

2028: So,

2029: \begin{align}

2030:   P\{E_{0,1}^*(i,m)\} &\leq {\rm exp} \left\{-2^{n(R'_2+3\e_1)}

2031:   [2^{n(\e'-3\e_1)}\ln(2^{n(\e'-3\e_1)})-2^{n(\e'-3\e_1)}+1 ]

2032:   \right\}\\*

2033:   &= {\rm exp} \big\{-2^{n(R'_2+3\e_1)}\eta \big\} \\

2034:   &\leq  {\rm exp} \big\{-2^{n(R'_2+3\e_1)} \big\}

2035: \end{align}

2036: where

2037: $\eta=2^{n(\e'-3\e_1)}\ln(2^{n(\e'-3\e_1)})-2^{n(\e'-3\e_1)}+1$. Note

2038: that  $\eta >0$ since (i) $\e'>3\e_1$ and (ii) for $x>1$, $g(x)=x\ln(x)-x+1$ is

2039: increasing function of $x$ and $g(x)>0$. Further $g(e)=1$.

2040:

2041: Therefore, for sufficiently large $n$, by applying the union bound,

2042: \begin{align}

2043: P\{E_{0,1}^*\} &=P\Big\{ \bigcup_{i=1}^{2^{nR_1}} \bigcup_{m=1}^{2^{nR_0}}

2044: E_{0,1}^*(i,m) \Big\}\\

2045:                &\leq \sum_{i=1}^{2^{nR_1}}\sum_{m=1}^{2^{nR_0}}

2046:  P\{E_{0,1}^*(i)\}\\

2047:                &\leq 2^{n(R_0+R_1)}{\rm exp} \big\{-2^{n(R'_2+3\e_1)}

2048:  \big\} \\

2049:                &={\rm exp} \big\{n(R_0+R_1)\ln2-2^{n(R'_2+3\e_1)}

2050:   \big\} \\

2051:                &\overset{(e)}{<} \frac{\e}{12}

2052: \end{align}

2053: where (e) is from the fact that

2054: $n(R_0+R_1)\ln2$ is linearly increasing but $2^{n(R'_2+3\e_1)}$ is

2055: exponentially increasing as $n$ increases.

2056:

2057: In a similar way, we can also show that

2058: $P\{E_{0,2}^*\}<\frac{\e}{12}$ for sufficiently large $n$.

2059:  \hfill $\blacksquare$

2060:

2061: \section{Proof of Lemma \ref{lem:bc-cc-3}}\label{appendix:bc-lem-cc-3}

2062: Now let us calculate the probability $P(E \cap E_1^c \cap E_2^c)$.

2063: Without loss of generality, let us assume that the outcome of the

2064: message triple is given by: $(W_0,W_1,W_2)=(m,i,j)$. Let

2065: $(U^n(k_i,m),V^n(l_j,m))$ be the pair of sequences that are jointly

2066: typical in the bin $B(i,m)$ and $C(j,m)$.

2067: Consider the following error events.

2068: \begin{itemize}

2069:   \item[$E_3$:] $(Z^n(m),U^n(k_i,m),V^n(l_j,m),X^n, Y_1^n, Y_2^n) \notin

2070:   \Ae(Z,U,V,X,Y_1,Y_2)$

2071:   \item[$E_4$:] Decoding step fails at receiver 1, i.e., $ \exists

2072:   ~\hat{m} \neq m$ such that $(Z^n(\hat{m}), Y_1^n) \in

2073:   \Ae(Z,Y_1)$ or

2074:   $ \exists ~\hat{k} \neq k_i$ such that $(U^n(\hat{k},m), Y_1^n, Z^n(m))

2075:   \in A_{\e}^{(n)}(U,Y_1,Z)$

2076:   \item[$E_5$:] Decoding step fails at receiver 2, i.e., $ \exists

2077:   ~\hat{m} \neq m$ such that $(Z^n(\hat{m}), Y_2^n) \in

2078:   \Ae(Z,Y_2)$ or

2079:   $ \exists ~\hat{l} \neq l_j$ such that $(V^n(\hat{l},m), Y_2^n, Z^n(m))

2080:   \in A_{\e}^{(n)}(V,Y_2,Z)$

2081: \end{itemize}

2082:

2083: It is easy to see that, for sufficiently large $n$,

2084: $P(E_3) \leq \frac{2\e}{9}$, using  the Markov lemma

2085: \cite{berger,cover-thomas};

2086: $P(E_4) \leq \frac{2\e}{9}$, if $R_0 \leq I(Z;Y_1)-\e$ and

2087: $P(E_5) \leq \frac{2\e}{9}$, if $R_0 \leq I(Z;Y_2)-\e$.

2088: Recall that for all $m \in \{1,2,\ldots,2^{nR_0}\}$,

2089: $|\C_1(m)|=2^{n(I(U;Y_1|Z)-\e)}$ and

2090: $|\C_2(m)|=2^{n(I(V;Y_2|Z)-\e)}$.

2091:

2092: So,

2093: \begin{align}

2094:   P(E \cap E_1^c \cap E_2^c) &= P(\cup_{i=3}^5 E_i \cap E_1^c \cap

2095:   E_2^c)\\

2096:   &\leq \sum_{i=3}^5 P\left(E_i \cap E_1^c \cap E_2^c \right) <

2097:   \frac{2\e}{3}

2098: \end{align}

2099: Hence, the proof of Lemma \ref{lem:bc-cc-3} has been completed.

2100: \hfill $\blacksquare$

2101:

2102:

2103: \section{Proof of Theorem \ref{thm:one-det-bc-cor}}

2104: \label{appendix:one-det-bc-cor}

2105:

2106: Here we provide the converse part, as the direct part follows from

2107: Theorem \ref{thm:rate-region-bc-cor}. The proof is very similar to

2108: Marton's proof \cite{marton79} of the

2109: outer bound of the capacity region of semi-deterministic broadcast

2110: channels. Consider any sequence (indexed by $n$) of $(n,\tau(n))$

2111: transmission systems for a sequence of bipartite graphs $G_n$,

2112: respectively, with parameters

2113: $(1,2^{nR_1},2^{nR_2},2^{nR'_1},2^{nR'_2},\mu(n))$ and the

2114: broadcast channel with one deterministic component such that

2115: $\tau(n) \rightarrow 0$, and $\frac{1}{n} \log \mu(n) \rightarrow 0$

2116: as $n \rightarrow \infty$.

2117:

2118: By Fano's Inequality,

2119: \begin{align}

2120:   H(W_1,W_2|Y_1^n,Y_2^n) ~&\leq ~ \tau(n) \cdot n(R_1+R_2) +

2121:   H(\tau(n)) ~\leq~n\e_n \\

2122:   H(W_1|Y_1^n) ~&\leq ~ \tau(n) \cdot n R_1 + H(\tau(n)) ~\leq~n\e_n \\

2123:   H(W_2|Y_2^n) ~&\leq ~ \tau(n) \cdot n R_2 + H(\tau(n)) ~\leq~n\e_n

2124: \end{align} where $\e_n \rightarrow 0$ as $\tau(n) \rightarrow 0 $.

2125: We can bound the rate $R_1$ as

2126: \begin{align}

2127:   n R_1 &= H(W_1) = I(W_1;Y_1^n)+H(W_1|Y_1^n)\\

2128:         &\leq I(W_1;Y_1^n)+ n\e_n

2129:         \leq H(Y_1^n) + n \e_n \\

2130:         &\leq \sum_{i=1}^{n} H(Y_{1i}) + n\e_n\\

2131:         \Rightarrow R_1 &\leq \frac{1}{n}\sum_{i=1}^{n} H(Y_{1i}) +

2132:         \e_n. \label{eq:15}

2133: \end{align}

2134: $R_2$ can be bounded as

2135: \begin{align}

2136:   nR_2 &= H(W_2) = I(W_2;Y_2^n)+H(W_2|Y_2^n)\\

2137:         &\leq I(W_2;Y_2^n)+ n\e_n

2138:         =\sum_{i=1}^{n} I(W_2;Y_{2i}|Y_{2}^{(i-1)}) + n\e_n\\

2139: 	 &\leq \sum_{i=1}^{n}

2140:         [H(Y_{2i})-H(Y_{2i}|W_2,Y_2^{i-1},Y_{1(i+1)}^n)] + n\e_n\\

2141:         &=\sum_{i=1}^{n} I(V_i;Y_{2i}) + n\e_n, ~~\text{by defining

2142:         $V_i=(W_2,Y_2^{i-1},Y_{1(i+1)}^n)$}\\

2143: \Rightarrow R_2 &\leq \frac{1}{n}\sum_{i=1}^{n} I(V_i;Y_{2i}) +

2144: \e_n. \label{eq:16}

2145: \end{align}

2146:

2147: Using the fact that (a) the channel is discrete memoryless and is used

2148: without feedback, and (b) $Y_1$ is a deterministic function of the

2149: channel input, it can be easily shown that $V_i \rightarrow X_i

2150: \rightarrow (Y_{1i},Y_{2i})$.

2151: Before we bound the sum of rates $R_1+R_2'$, let us recall the

2152: following identity from \cite{gel80}: for any two sequence of

2153: random variables $A^n$ and $B^n$,

2154: \beq

2155: \sum_{i=1}^n I(A^{(i-1)};B_i|B_{(i+1)}^n)=

2156: \sum_{i=1}^n I(B_{(i+1)}^n;A_i|A^{(i-1)}).

2157: \label{specialequality}

2158: \eeq

2159:

2160: The number of all possible pairs of messages $(W_1,W_2)$ which have

2161: non-zero probability is lower

2162: bounded by $2^{n(R_1+R_2')}\mu^{-1}=2^{n(R_1'+R_2)}\mu^{-1}$.

2163: So, we can bound the sum rate $n(R_1+R_2')$ as

2164: \begin{align}

2165:   n(R_1+R_2') &\leq H(W_1,W_2) + \log \mu\\

2166:         &=H(W_2)+H(W_1|W_2) + \log \mu\\

2167:         &=I(W_2;Y_2^n)+\underbrace{H(W_2|Y_2^n)}_{\leq n\e_n}

2168:         +I(W_1;Y_1^n|W_2)+\underbrace{H(W_1|Y_1^n,W_2)}_{\leq

2169:         H(W_1|Y_1^n) \leq ~n\e^n} + \log \mu\\

2170:         &\leq I(W_1;Y_1^n|W_2)+I(W_2;Y_2^n)+2n\e_n + \log \mu\\

2171:         &= H(Y_1^n)-I(W_2;Y_1^n)+I(W_2;Y_2^n)+2n\e_n + \log \mu\\

2172:         &\overset{(a)}{\leq} H(Y_1^n) -\left[ \sum_{i=1}^n (I(W_2,

2173:         Y_{1(i+1)}^n; Y_{1i}) - I(Y_{1(i+1)}^n;Y_{1i})) \right] +

2174:         \sum_{i=1}^n I(W_2,Y_2^{(i-1)}; Y_{2i})+2n\e_n + \log \mu\\

2175:         &\overset{(b)}= \sum_{i=1}^n H(Y_{1i})+ \sum_{i=1}^n \left[

2176:         I(V_i;Y_{2i})-I(V_i;Y_{1i}) \right] + 2n\e_n + \log \mu\\

2177:         &= \sum_{i=1}^n \left[H(Y_{1i}|V_i)+I(V_i;Y_{2i}) \right] +

2178:         2n\e_n + \log \mu\\

2179:        \Rightarrow R_1 + R_2' &\leq \frac{1}{n}\sum_{i=1}^{n}

2180: (H(Y_{1i}|V_i)+I(V_i;Y_{2i})) + 2\e_n + \frac{1}{n}\log \mu \label{eq:17}

2181: \end{align}

2182: where\\

2183: $(a)$ follows from the fact that $I(W_2;Y_{2i}|Y_{2}^{(i-1)}) \leq

2184: I(W_2,Y_{2}^{(i-1)};Y_{2i})$. \\

2185: $(b)$ from the fact that $H(Y_1^n)+\sum_{i=1}^n I(Y_{1(i+1)}^n;Y_{1i})=

2186: \sum_{i=1}^n H(Y_{1i})$, and (\ref{specialequality}).

2187:

2188: To complete the proof, let us define a new random variable $Q$

2189: uniformly distributed over the set $Q=\{1,2,\ldots,n\}$ with

2190: probability $\frac{1}{n}$ as shown in \cite{cover-thomas,gelfand80}. By

2191: defining $V=V_Q$, $Y_1=Y_{1Q}$, $Y_2=Y_{2Q}$, it can be shown that

2192: the inequalities (\ref{eq:12}), (\ref{eq:13}), and (\ref{eq:14})

2193: are equivalent to (\ref{eq:15}), (\ref{eq:16}), and (\ref{eq:17}),

2194: respectively. \hfill $\blacksquare$

2195:

2196:

2197:

2198: \section{Proof of Lemma \ref{lem:bc-sc-3}}\label{appendix:bc-lem-sc-3}

2199: Let us calculate the probability $P(E \cap E_1^c \cap E_2^c)$. If

2200: previous error events $E_1$ or $E_2$ do not occur, we define other

2201: error events as follows.

2202: \begin{itemize}

2203:   \item[$E_3$]: $(S^n,T^n) \notin \Ae$,

2204:   \item[$E_4$]: $\nexists m \in \{1,2,\ldots,2^{nR_0}\}$ such that

2205:   $(S^n,T^n,Z^n(m))\in \Ae(S,T,Z)$,

2206:   \item[$E_5$]: $\nexists i \in \{1,2,\ldots,2^{nR_1}\}$ such that

2207:     $S^n(i,m)=S^n$ and

2208: $S^n(i,m) \in B(m)$,

2209:   \item[$E_6$]: $\nexists j \in \{1,2,\ldots,2^{nR_2}\}$ such that

2210:     $T^n(j,m)=T^n$ and

2211: $T^n(j,m) \in C(m)$,

2212: \end{itemize}

2213: Then,

2214: \begin{align}

2215:   P(E \cap E_1^c \cap E_2^c)&=P\left( \cup_{i=3}^6 E_i \cap (E_1^c \cap

2216:                                    E_2^c) \right)\\

2217:      &\leq  P\left[E_3 \cap (E_1^c \cap E_2^c) \right] +

2218: \sum_{i=4}^6 P\left(E_i \cap E^c_3 \cap  (E_1^c \cap E_2^c) \right)

2219: \end{align}

2220:

2221: By the property of jointly typical sequences \cite{cover-thomas},

2222: $P[E_3 \cap (E_1^c \cap E_2^c)]< \frac{\e}{8}$ for sufficiently large

2223: $n$. Using arguments of Chapter 13 of \cite{cover-thomas}, it can be

2224: shown that  for sufficiently large $n$, if $ R_0 > I(S,T;Z) + \e_2(\e)$ where

2225: $\e_2(\e) \ra 0$ as $\e \ra 0$, then

2226: $P[E_4 \cap E_3^c \cap (E_1^c \cap E_2^c)] \leq \frac{\e}{8}$.

2227:  See Chapter 13 of \cite{cover-thomas}

2228: for a characterization of $\e_2$. Now

2229: \begin{align}

2230:   P[E_5 \cap E_3^c \cap E_4^c \cap (E_1^c \cap E^c_2)]

2231:   &\overset{(a)}{=}P[\forall S^n(i,m) \in B(m), S^n(i,m) \neq S^n | S^n

2232:   \in A(S|Z^n(m))]\\

2233:          &\overset{(b)}{\leq} \left[1-2^{-n(H(S|Z)+\e_1(\e))}

2234:   \right]^{2^{nR_1}}\\

2235:          &\overset{(c)}{\leq} {\rm exp}\{-2^{n(R_1-H(S|Z)-\e_1(\e))}\}\\

2236:          &< \frac{\e}{8}

2237: \end{align}

2238: for sufficiently large $n$, if $R_1>H(S|Z)+\e_1(\e)$ where\\

2239: (a) is from the fact that $(S^n,T^n,Z^n(m))\in \Ae(S,T,Z)$ implies

2240: $S^n \in A(S|Z^n(m))$,\\

2241: (b) is obtained using the fact that each $S^n(i,m)$ has the same chance of

2242: equaling $S^n$ and independently chosen, and

2243: the fact that  $|A(S|Z^n(m))|\leq 2^{n(H(S|Z)+\e_1(\e))}$,\\

2244: (c) follows from $(1-x)^n \leq e^{-xn}$ for $0 \leq x \leq 1$ and

2245: $n>0$.

2246:

2247: So,

2248: \begin{align}

2249:   P[E_5 \cap E_3^c \cap (E_1^c\cap E^c_2)]&=P[E_5 \cap E_3^c \cap E_4

2250:                                    \cap (E_1^c \cap E^c_2)]

2251:                                    + P[E_5 \cap E_3^c \cap E_4^c \cap

2252:                                    (E_1^c \cap E^c_2)]\\

2253:                                    &\leq P[E_4 \cap E_3^c \cap (E_1^c

2254:                                    \cap E^c_2)]

2255:                                    + P[E_5 \cap E_3^c \cap E_4^c \cap

2256:                                    (E_1^c \cap E_2^c)]\\

2257:                                    &< \frac{\e}{4}

2258: \end{align}

2259:

2260: Similarly, it can be shown that $P[E_6 \cap E_3^c \cap (E_1^c \cap E^c_2)] <

2261: \frac{\e}{4}$ if $R_2>H(T|Z)+\e_1(\e)$ for sufficiently large $n$.

2262: Hence $P[E \cap(E_1^c \cap E_2^c)] \leq \frac{3 \e}{4}$.

2263: \hfill $\blacksquare$

2264:

2265:

2266: \section{Proof of Theorem \ref{thm:bc-sc-main_c}}

2267: \label{appendix:bc-sc-main_c}

2268:

2269: Let $f(n),g_1(n),g_2(n)$ be a fixed sequence of encoder and

2270: decoders. Also, let $f(S^n,T^n)=(W_0,W_1,W_2)$.

2271: We can bound the rate $R_0$ as

2272: \begin{align}

2273:   nR_0 &\geq H(W_0)\geq I(S^n,T^n;W_0)\\

2274:                &=\sum_{i=1}^n [H(S_i,T_i|S^{i-1},T^{i-1})-

2275:                H(S_i,T_i|\underbrace{S^{i-1},T^{i-1},W_0}_{\triangleq

2276:                Z_i})]\\

2277:               &= \sum_{i=1}^n [H(S_i,T_i)- H(S_i,T_i|Z_i)]

2278:               = \sum_{i=1}^n I(S_i,T_i;Z_i) \label{eq:bc-sc-12}

2279: \end{align}

2280: Using the constraints on the degrees of the vertexes in the graph

2281: associated with messages, we can also have

2282: \begin{align}

2283: n(R_2+R'_1)=  n(R_1+R_2') &\overset{(a)}{\geq} H(W_1,W_2|W_0) - \log \mu\\

2284:               &= I(S^n,T^n;W_1,W_2|W_0)+H(W_1,W_2|S^n,T^n,W_0) - \log \mu\\

2285:               &\overset{(b)}{=} I(S^n,T^n;W_1,W_2|W_0) - \log \mu\\

2286:               &=H(S^n,T^n|W_0)-H(S^n,T^n|W_1,W_2,W_0) - \log \mu\\

2287:               &\overset{(c)}{\geq} H(S^n,T^n|W_0)-n\e_n - \log \mu \\

2288:               &= \sum_{i=1}^n

2289:               H(S_i,T_i|\underbrace{S^{i-1},T^{i-1},W_0}_{=Z_i})-n\e_n

2290:               - \log

2291:               \mu,\\

2292:               &= \sum_{i=1}^n H(S_i,T_i|Z_i)-n\e_n - \log \mu,

2293:               \label{eq:bc-sc-10}

2294: \end{align}

2295: where

2296: (a) is from the constraints on the degrees of the graph,

2297: (b) is obtained since $W_1$ and $W_2$ is a function of $S^n$ and

2298: $T^n$,

2299: (c) follows from Fano's inequality.

2300: Also, we can also bound the rate $R_1$ as

2301: \begin{align}

2302:   nR_1 &\geq H(W_1) \geq H(W_1|W_0)

2303:               = I(S^n;W_1|W_0)+H(W_1|S^n,W_0) \\

2304:               &\overset{(a)}{\geq} I(S^n;W_1|W_0)

2305:               = H(S^n|W_0)-H(S^n|W_1,W_0)\\

2306:               &\overset{(b)}{\geq} H(S^n|W_0)-n\e_n

2307:               = \sum_{i=1}^n

2308:               H(S_i|S^{i-1},W_0)-n\e_n,\\

2309:               &\overset{(c)}{\geq} \sum_{i=1}^n

2310:               H(S_i|\underbrace{S^{i-1},T^{i-1},W_0}_{=Z_i})-n\e_n,\\

2311:               &= \sum_{i=1}^n H(S_i|Z_i)-n\e_n, \label{eq:bc-sc-13}

2312: \end{align}

2313: where

2314: (a) is obtained since $H(W_1|S^n,W_0) \geq 0$,

2315: (b) follows from Fano's inequality, and

2316: (c) follows from adding conditioning.

2317:

2318: Similarly, we also can obtain

2319: \begin{align}

2320:   nR_2 &\geq \sum_{i=1}^n H(T_i|Z_i) - n\e_n. \label{eq:bc-sc-14}

2321: \end{align}

2322:

2323: Therefore, using arguments similar to those of \cite{wyner75},

2324: we can have the partial converse by dividing the

2325: inequalities (\ref{eq:bc-sc-10}),

2326: (\ref{eq:bc-sc-12}), (\ref{eq:bc-sc-13}), and (\ref{eq:bc-sc-14})

2327:  by $n$, and taking the limit as $n\ra\infty$.

2328:  \hfill $\blacksquare$

2329:

2330:

2331:

2332: \begin{thebibliography}{10}

2333: \bibitem{han-costa87}

2334: T.~S. Han and H.~M. Costa, ``Broadcast channels with arbitrarily correlated

2335:   sources,'' \emph{IEEE Trans. Inform. Theory}, vol. IT-33, no.~5, pp.

2336:   641--650, Sep. 1987.

2337:

2338: \bibitem{cover72}

2339: T.~M. Cover, ``Broadcast channels,'' \emph{IEEE Trans. Inform. Theory}, vol.

2340:   IT-18, no.~1, pp. 2--14, Jan. 1972.

2341:

2342: \bibitem{cover75}

2343: ------, ``An achievable rate region for the broadcast channel,'' \emph{IEEE

2344:   Trans. Inform. Theory}, vol. IT-21, no.~4, pp. 399--404, Jul. 1975.

2345:

2346: \bibitem{bergmans73}

2347: P.~P. Bergmans, ``Random coding theorems for the broadcast channels with

2348:   degraded components,'' \emph{IEEE Trans. Inform. Theory}, vol. IT-15, pp.

2349:   197--207, Mar. 1973.

2350:

2351: \bibitem{bergmans74}

2352: ------, ``A simple converse for broadcast channels with additive white

2353:   {G}aussian noise,'' \emph{IEEE Trans. Inform. Theory}, vol. IT-20, pp.

2354:   279--280, Mar. 1974.

2355:

2356: \bibitem{gallager74}

2357: R.~G. Gallager, ``Capacity and coding for degraded broadcast channels,''

2358:   \emph{Probl. Pered. Inform.}, vol.~10, no.~3, pp. 3--14, Jul.-Sep. 1974,

2359:   ;translated in \emph{Probl. Inform. Transm.}, pp. 185-193, Jul.-Sep. 1974.

2360:

2361: \bibitem{ahlswede-korner75}

2362: R.~Ahlswede and J.~{K\"{o}rner}, ``Source coding with side information and a

2363:   converse for degraded broadcast channels,'' \emph{IEEE Trans. Inform.

2364:   Theory}, vol. IT-21, pp. 629--637, Nov. 1975.

2365:

2366: \bibitem{korner-marton77}

2367: J.~{K\"{o}rner} and K.~Marton, ``General broadcast channels with degraded

2368:   message sets,'' \emph{IEEE Trans. Inform. Theory}, vol. IT-23, pp. 60--64,

2369:   Jan. 1977.

2370:

2371: \bibitem{elgamal79}

2372: A.~{El Gamal}, ``The capacity of a class of broadcast channels,'' \emph{IEEE

2373:   Trans. Inform. Theory}, vol. IT-25, pp. 166--169, Mar. 1979.

2374:

2375: \bibitem{marton77}

2376: K.~Marton, ``The capacity region of deterministic broadcast channels,'' in

2377:   \emph{Trans. Int. Symp. Inform. Theory}, Paris-Cachan, France, 1977.

2378:

2379: \bibitem{pinsker78}

2380: M.~S. Pinsker, ``Capacity of noiseless broadcast channels,'' \emph{Probl.

2381:   Pered. Inform.}, vol.~14, no.~2, pp. 28--34, Apr.-Jun. 1978, translated in

2382:   \emph{Probl. Inform. Transm.}, pp. 97-102, Apr.-June 1978.

2383:

2384: \bibitem{marton79}

2385: K.~Marton, ``A coding theorem for the discrete memoryless broadcast channel,''

2386:   \emph{IEEE Trans. Inform. Theory}, vol. IT-25, no.~3, pp. 306--311, May 1979.

2387:

2388: \bibitem{gelfand80}

2389: S.~I. Gelfand and M.~S. Pinsker, ``Capacity of a broadcast channel with one

2390:   deterministic component,'' \emph{Probl. Pered. Inform.}, vol.~16, no.~1, pp.

2391:   24--34, Jan.-Mar. 1980, ; translated in \emph{Probl. Inform. Transm.}, vol.

2392:   16, no. 1, pp. 17-25, Jan.-Mar. 1980.

2393:

2394: \bibitem{elgamal}

2395: A.~{El Gamal}, ``The capacity of the product and sum of two reversely degraded

2396:   broadcast channels,'' \emph{Probl. Pered. Inform.}, vol.~16, pp. 3--23,

2397:   Jan.-Mar. 1980.

2398:

2399: \bibitem{tse97}

2400: D.~Tse, ``Optimal power allocation over parallel {G}aussian broadcast

2401:   channels,'' \emph{Proc. Int. symp. on Inform. Theory (ISIT)}, July 1997.

2402:

2403: \bibitem{li01}

2404: L.~Li and A.~Goldsmith, ``Capacity and optimal resource allocation for fading

2405:   broadcast channels-{P}art {I}: Ergodic capacity,'' \emph{IEEE Trans. on

2406:   Inform. Theory}, vol.~47, pp. 1083--1102, March 2001.

2407:

2408: \bibitem{caire03}

2409: G.~Caire and S.~Shamai, ``{O}n the achievable throughput of a multiantenna

2410:   gaussian broadcast channel hannel,'' \emph{IEEE Trans. on Inform. Theory},

2411:   vol.~49, no.~7, pp. 1691--1706, July 2003.

2412:

2413: \bibitem{viswanathtsc03}

2414: P.~Viswanath and D.~N.~C. Tse, ``{S}um capacity of the vector {G}aussian

2415:   channel and uplink-downlink duality,'' \emph{IEEE Trans. on Inform. Theory},

2416:   vol.~49, no.~8, pp. 1912--1921, August 2003.

2417:

2418: \bibitem{vishwanathjgda03}

2419: S.~Vishwanath, N.~Jindal, and A.~Goldsmith, ``{D}uality, achievable rates and

2420:   sum-rate capacity of {G}aussian {M}{I}{M}{O} broadcast channel,'' \emph{IEEE

2421:   Trans. on Inform. Theory}, vol.~49, pp. 2658--2668, October 2003.

2422:

2423: \bibitem{yu04}

2424: W.~Yu and J.~M. Cioffi, ``Sum capacity of {G}aussian vector broadcast

2425:   channels,'' \emph{IEEE Trans. on Inform. Theory}, vol.~50, no.~9, pp.

2426:   1875--1892, September 2004.

2427:

2428: \bibitem{weingarten04}

2429: H.~Weingarten, Y.~Steinberg, and S.~{Shamai(Shitz)}, ``The capacity region of

2430:   the {Gaussian MIMO} broadcast channel,'' \emph{Proc. Conf. on Inform.

2431:   Sciences and Systems (CISS)}, Mar. 2004.

2432:

2433: \bibitem{meulen75}

2434: E.~C. {Van der Meulen}, ``Random coding theorems for the general discrete

2435:   memoryless broadcast channel,'' \emph{IEEE Trans. Inform. Theory}, vol.

2436:   IT-21, no.~2, pp. 180--190, Mar. 1975.

2437:

2438: \bibitem{hajek79}

2439: B.~E. Hajek and M.~B. Pursley, ``Evaluation of an achievable rate region for

2440:   the broadcast channel,'' \emph{IEEE Trans. Inform. Theory}, vol. IT-25,

2441:   no.~1, pp. 36--46, Jan. 1979.

2442:

2443: \bibitem{elgamal81}

2444: A.~{El Gamal} and E.~{Van der Meulen}, ``A proof of marton's coding theorem for

2445:   the discrete memoryless broadcast channel,'' \emph{IEEE Trans. Inform.

2446:   Theory}, vol. IT-27, no.~1, pp. 120--122, Jan. 1981.

2447:

2448: \bibitem{sato78}

2449: H.~Sato, ``{A}n outer bound on the capacity region of broadcast channel,''

2450:   \emph{IEEE Trans. on Inform. Theory}, vol.~24, pp. 374--377, May 1978.

2451:

2452: \bibitem{nair06}

2453: C.~Nair and A.~E. Gamal, ``An outer bound to the capacity region of the

2454:   broadcast channel,'' in \emph{Proc. IEEE Int. Symp. on Inform. Theory

2455:   (ISIT)}, July 2006.

2456:

2457: \bibitem{cover98}

2458: T.~M. Cover, ``Comments on broadcast channels,'' \emph{{I}{E}{E}{E} Trans. on

2459:   Inform. Theory}, vol.~44, no.~6, pp. 2524--2530, October 1998.

2460:

2461: \bibitem{gray74}

2462: R.~M. Gray and A.~D. Wyner, ``Source coding for a simple network,'' \emph{Bell

2463:   Syst. Tech. J.}, vol.~53, pp. 1681--1721, Nov. 1974.

2464:

2465: \bibitem{cover-elgamal-salehi80}

2466: T.~M. Cover, A.~{El Gamal}, and M.~{Salehi}, ``Multiple-access channel with

2467:   arbitrarily correlated sources,'' \emph{IEEE Trans. Inform. Theory}, vol.

2468:   IT-26, no.~6, pp. 648--657, Nov. 1980.

2469:

2470: \bibitem{pradhan-choi-isit}

2471: S.~S. Pradhan, S.~Choi, and K.~Ramchandran, ``Achievable rates for

2472:   multiple-access channels with correlated messages,'' in \emph{Proc. IEEE Int.

2473:   Symp. on Inform. Theory (ISIT)}, Jun. 2004.

2474:

2475: \bibitem{pradhan-choi06}

2476: ------, ``A graph-based framework for transmission of correlated sources over

2477:   multiple access channel,'' \emph{\emph{submitted to} IEEE Trans. Inform.

2478:   Theory}, Jan. 2006.

2479:

2480: \bibitem{ahlswede-han83}

2481: R.~Ahlswede and T.~S. Han, ``On source coding with side information via a

2482:   multiple-access channel and related problems in multi-user information

2483:   theory,'' \emph{IEEE Trans. Inform. Theory}, vol. IT-29, pp. 396--412, May

2484:   1983.

2485:

2486: \bibitem{cover-thomas}

2487: T.~M. Cover and J.~A. Thomas, \emph{Elements of Information Theory}.\hskip 1em

2488:   plus 0.5em minus 0.4em\relax New York:Wiley, 1991.

2489:

2490: \bibitem{vanlint}

2491: J.~H. van Lint and R.~M. Wilson, \emph{A course in combinatorics}.\hskip 1em

2492:   plus 0.5em minus 0.4em\relax Cambrigde University Press, 2003.

2493:

2494: \bibitem{csiszar-korner}

2495: I.~Csisz\'{a}r and {J. K\"{o}rner}, \emph{Information Theory: Coding Theorems

2496:   for Discrete Memoryless Systems}.\hskip 1em plus 0.5em minus 0.4em\relax

2497:   Academic Press, New York, 1981.

2498:

2499: \bibitem{wyner75}

2500: A.~D. Wyner, ``The common information of two dependent random variables,''

2501:   \emph{IEEE Trans. Inform. Theory}, vol. IT-21, no.~2, pp. 163--179, Mar.

2502:   1975.

2503:

2504: \bibitem{gacs}

2505: P.~{G\'{a}cs} and J.~{K\"{o}rner}, ``Common information is much less than

2506:   mutual information,'' \emph{Problems of Control and Information Theory},

2507:   vol.~2, pp. 149--162, 1973.

2508:

2509: \bibitem{wit}

2510: H.~S. Witsenhausen, ``On sequences of pairs of dependent random variables,''

2511:   \emph{SIAM J. Appl. Math.}, vol.~28, pp. 100--113, Jan. 1975.

2512:

2513: \bibitem{tuncel05}

2514: E.~Tuncel, ``Lossless joint source-channel coding across broadcast channels

2515:   with decoder side information,'' in \emph{Proc. IEEE Int. Symp. on Inform.

2516:   Theory (ISIT)}, September 2005.

2517:

2518: \bibitem{coleman06}

2519: T.~Coleman, E.~Martinian, and E.~Ordentlich, ``Joint source-channel decoding

2520:   for transmitting correlated sources over broadcast channels,'' in \emph{Proc.

2521:   IEEE Int. Symp. on Inform. Theory (ISIT)}, July 2006.

2522:

2523: \bibitem{jansbook}

2524: S.~Janson, T.~Luczak, and A.~Ruci\'{n}ski, \emph{Random Graphs}.\hskip 1em plus

2525:   0.5em minus 0.4em\relax John Wiley and Sons, Inc., 2000.

2526:

2527: \bibitem{krithivasan}

2528: D.~Krithivasan and S.~S. Pradhan, ``On certain large deviation analysis of

2529:   sampling from typical sets,'' Univ. of Michigan, Ann Arbor, Communication and

2530:   Signal Processing Laboratory (CSPL) Report No. 341, July 2006.

2531:

2532: \bibitem{berger}

2533: T.~Berger, \emph{Multiterminal Source Coding. In: The Information Theory

2534:   Approach to Communications (ed. G. Longo), CISM Courses and Lecture Notes No.

2535:   229}.\hskip 1em plus 0.5em minus 0.4em\relax Springer, Wien-New York, 1977.

2536:

2537: \bibitem{gel80}

2538: S.~Gel'fand and M.~Pinsker, ``Coding for channel with random parameters,''

2539:   \emph{Problems of Control and Information Theory}, vol.~9, pp. 19--31, 1980.

2540:

2541: \end{thebibliography}

2542:

2543:

2544:

2545: \end{document}

2546: