cs0701196/TSPv4.tex
1: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
2: %2345678901234567890123456789012345678901234567890123456789012345678901234567890
3: %        1         2         3         4         5         6         7         8
4: 
5: \documentclass[11pt,journal,draftcls,letterpaper,onecolumn,oneside]{IEEEtran}
6: %\documentclass[10pt,journal,final,letterpaper,twocolumn,twoside]{IEEEtran}
7: 
8: \IEEEoverridecommandlockouts                              % This command is only
9:                                                           % needed if you want to
10:                                                           % use the \thanks command
11: \overrideIEEEmargins
12: % See the \addtolength command later in the file to balance the column lengths
13: % on the last page of the document
14: 
15: \usepackage{epsfig,latexsym,amsfonts,amsmath,amssymb,verbatim,cite,mathrsfs}
16: \usepackage{graphicx}
17: \usepackage[dvips]{color}
18: 
19: \bibliographystyle{IEEEtran}
20: 
21: \def\nN{{\mathbb N}}
22: \def\zZ{{\mathbb Z}}
23: \def\rR{{\mathbb R}}
24: \def\eE{{\mathbb E}}
25: \def\pP{{\mathbb P}}
26: \def\cC{{\mathbb C}}
27: 
28: \def\QED{\mbox{\rule[0pt]{1.5ex}{1.5ex}}}
29: \def\proof{\noindent\hspace{2em}{\it Proof: }}
30: \def\endproof{\hspace*{\fill}~\QED\par\endtrivlist\unskip}
31: 
32: \def\@begintheorem#1#2{\tmpitemindent\itemindent\topsep 0pt\rm\trivlist
33:     \item[\hskip \labelsep{\indent\it #1\ #2:}]\itemindent\tmpitemindent}
34: \def\@opargbegintheorem#1#2#3{\tmpitemindent\itemindent\topsep 0pt\rm \trivlist
35:     \item[\hskip\labelsep{\indent\it #1\ #2\
36:     \rm(#3):}]\itemindent\tmpitemindent}
37: \def\@endtheorem{\endtrivlist\unskip}
38: 
39: \newtheorem{definition}{Definition}[section]
40: \newtheorem{remark}{Remark}[section]
41: \newtheorem{theorem}{Theorem}[section]
42: \newtheorem{fact}{Fact}[section]
43: \newtheorem{proposition}{Proposition}[section]
44: \newtheorem{corollary}{Corollary}[section]
45: %\newtheorem{fact}[theorem]{Fact}
46: %\newtheorem{proposition}[theorem]{Proposition}
47: %\newtheorem{corollary}[theorem]{Corollary}
48: \renewcommand{\theequation}{\arabic{section}.\arabic{equation}}
49: \newcommand{\Section}[1]{\section{#1}
50: \setcounter{equation}{0}}
51: \newcommand{\proofover}{\hfill\vrule height8pt width8pt depth 0pt}
52: 
53: %% I would rather use the label command since it is recognized by WinEdt
54: %% and makes it easier to call up a label   ~YW
55: % \newcommand{\seclbl}[1]{\label{sec:#1}}
56: \newcommand{\secref}[1]{Section~\ref{#1}}
57: \newcommand{\appref}[1]{Appendix~\ref{#1}}
58: % \newcommand{\eqnlbl}[1]{\label{eq:#1}}
59: \newcommand{\eqnref}[1]{(\ref{#1})}
60: % \newcommand{\figlbl}[1]{\label{fig:#1}}
61: \newcommand{\figref}[1]{Figure~\ref{#1}}
62: \newcommand{\thmref}[1]{Theorem~\ref{#1}}
63: \newcommand{\defref}[1]{Definition~\ref{#1}}
64: \newcommand{\remref}[1]{Remark~\ref{#1}}
65: \newcommand{\corref}[1]{Corollary~\ref{#1}}
66: 
67: \setcounter{page}{1}
68: 
69: \title{One--bit Distributed Sensing and Coding for \\Field Estimation in
70: Sensor Networks
71: %$^*$\thanks{$^*$ This material is based upon work supported by the US
72: %National Science Foundation (NSF) under award (CAREER)
73: %CCF--0546598. Any opinions, findings, and conclusions or
74: %recommendations expressed in this material are those of the authors
75: %and do not necessarily reflect the views of the NSF.}
76: }
77: 
78: \author{Ye Wang, Prakash Ishwar, and Venkatesh
79: Saligrama$^\dagger$
80: \thanks{$^\dagger$ Y.~Wang, P.~Ishwar, and V.~Saligrama are with the
81:         Department of Electrical and Computer Engineering,
82:         Boston University, Boston, MA 02215. Email:
83:         {\tt\small \{yw,pi,srv\}@bu.edu}.}
84: }
85: 
86: 
87: \begin{document}
88: 
89: \maketitle
90: %\thispagestyle{empty}
91: %\pagestyle{empty}
92: \thispagestyle{plain}
93: \pagestyle{plain}
94: 
95: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
96: \begin{abstract}
97: This paper formulates and studies a general distributed field
98: reconstruction problem using a dense network of noisy one--bit
99: randomized scalar quantizers in the presence of additive observation
100: noise of unknown distribution.  A constructive quantization, coding,
101: and field reconstruction scheme is developed and an upper--bound to
102: the associated mean squared error (MSE) at any point and any snapshot
103: is derived in terms of the local spatio--temporal smoothness
104: properties of the underlying field. It is shown that when the noise,
105: sensor placement pattern, and the sensor schedule satisfy certain weak
106: technical requirements, it is possible to drive the MSE to zero with
107: increasing sensor density at points of field continuity while ensuring
108: that the per--sensor bitrate and sensing--related network overhead
109: rate simultaneously go to zero.  The proposed scheme achieves the
110: order--optimal MSE versus sensor density scaling behavior for the
111: class of spatially constant spatio--temporal fields.
112: \end{abstract}
113: 
114: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
115: \Section{\label{sec:intro}Introduction and Overview} We study the
116: problem of reconstructing, at a data fusion center, a temporal
117: sequence of spatial data fields, in a bounded geographical region of
118: interest, from finite bit--rate messages generated by a dense
119: noncooperative network of sensors. The data--gathering sensor network
120: is made up of noisy low--resolution sensors at known locations that
121: are statistically identical (exchangeable) with respect to the sensing
122: operation.  The exchangeability
123: % interchangeability
124: assumption reflects the property of an unsorted collection of
125: inexpensive mass--produced sensors that behave in a statistically
126: identical fashion. We view each data field as an unknown deterministic
127: function over the geographical space of interest and make only the
128: weak assumption that they have a known bounded maximum dynamic
129: range. The sensor observations are corrupted by bounded, zero--mean,
130: additive noise which is independent across sensors with arbitrary
131: dependencies across field snapshots. This {\em noise has an arbitrary,
132: unknown distribution} but a known maximum dynamic range. The sensors
133: are equipped with binary analog--to--digital converters (ADCs) in the
134: form of comparators with random thresholds which are uniformly
135: distributed over the (known) sensor dynamic range. These thresholds
136: are assumed to be independent across sensors with arbitrary
137: dependencies across snapshots. These modeling assumptions partially
138: account for certain real--world scenarios that include (i) the
139: unavailability of good initial statistical models for data fields in
140: yet to be well studied natural phenomena, (ii) unknown additive
141: sensing/observation noise sources, (iii) additive model perturbation
142: errors, (iv) substantial variation of preset comparator thresholds
143: accompanying the mass--manufacture of low--precision sensors, (v)
144: significant temperature fluctuations across snapshots affecting
145: hardware characteristics, and (vi) the use of intentional dither
146: signals for randomized scalar quantization.
147: 
148: Building upon prior results in
149: \cite{MasryC-IT1981-BPCNT,Masry-IT1981-RASFS}, and
150: \cite{Luo-IT2005-UDEBCSN}, we develop a simple coding and field
151: reconstruction scheme based on one--bit scalar quantized samples of
152: noisy observations. We characterize the associated scaling behavior of
153: the MSE of field reconstruction with sensor density in terms of the
154: local and global moduli of continuity of the underlying sequence of
155: fields. This MSE characterization is for fixed, positive, and equal
156: sensor coding rates (bits per sensor per snapshot).  These achievable
157: results reveal that for bounded, zero--mean, additive observation
158: noise of unknown distribution, the MSE at every point of continuity of
159: every field snapshot can be made to go to zero as sensor density
160: increases while simultaneously sending the per--sensor bitrate and any
161: sensing--related network rate overheads (e.g., sensor addresses) to
162: zero.  This is possible if the sensor placement and sampling schedule
163: satisfy a certain uniformity property. This property ensures that the
164: field estimate at any given spatial location is formed using the
165: observations from increasingly many sensors that are located within a
166: vanishingly smaller neighborhood of the location.
167: 
168: The MSE results of this work pertain to uniform pointwise convergence
169: to zero, that is, for every spatial location of every field, unlike
170: results pertaining to spatially and temporally averaged MSE which are
171: more commonly encountered.  The rate of decay of field reconstruction
172: MSE at a given location is related to the local modulus of continuity
173: of the field at the given location and time.  Specializing these
174: results to the case of spatially constant fields yields an achievable
175: MSE decay rate of $O(1/N)$ where $N$ is the sensor network
176: size.\footnote{Landau's asymptotic notation: $f(N) = O(g(N))
177: \Leftrightarrow \lim\sup_{N\rightarrow \infty}|f(N)/g(N)| < \infty$;
178: $f(N) = \Omega(g(N)) \Leftrightarrow g(N) = O(f(N))$; $f(N) =
179: \Theta(g(N)) \Leftrightarrow f(N) = O(g(N))\ \text{and}\ g(N) =
180: O(f(N))$.}  A Cram\'{e}r--Rao lower--bound on the MSE for parameter
181: estimation establishes that the $O(1/N)$ MSE scaling behavior is
182: order--optimal in a minimax sense. Since in our problem formulation,
183: the per--sensor bitrate is held fixed and equal across sensors, in a
184: scaling sense, the MSE decreases inversely with the total network
185: rate.
186: 
187: Previous estimation--theoretic studies of one--bit distributed field
188: reconstruction have focused on reconstructing a single field snapshot
189: and have either (i) assumed zero observation noise
190: \cite{MasryC-IT1981-BPCNT,Masry-IT1981-RASFS}, or (ii) assumed a
191: spatially constant field (equivalent to scalar parameter estimation)
192: with a one--bit communication as opposed to a one--bit sensing
193: constraint \cite{Luo-IT2005-UDEBCSN}. The system proposed in this work
194: integrates the desirable field sensing and reconstruction properties
195: of these apparently different one--bit field estimation schemes and
196: establishes the statistical and performance equivalence of these
197: approaches. An important hardware implication of this paper is that
198: noisy op--amps (noisy threshold comparators) are adequate for
199: high--resolution distributed field reconstruction. This should be be
200: contrasted with the framework in \cite{Luo-IT2005-UDEBCSN} which
201: implicitly requires sensors to have the ability to quantize their
202: observations to an arbitrarily high bit resolution. A side
203: contribution of this paper is the holistic treatment of the general
204: distributed field--reconstruction problem in terms of (i) the field
205: characteristics, (ii) sensor placement characteristics, (iii) sensor
206: observation, quantization, and coding constraints with associated
207: sensing hardware implications, (iv) transmission and sensing--related
208: network overhead rates, and (v) reconstruction and performance
209: criteria. We have attempted to explicitly indicate and keep track of
210: what information is known, available, and used where and what is not.
211: 
212: The randomized scalar quantization model for the sensor comparators
213: not only captures poor sensing capabilities but is also an enabling
214: factor in the high--fidelity reconstruction of signals from quantized
215: noisy observations. As shown in \cite{MarcoDLN-IPSN2003-MTCDWSN} in an
216: information--theoretic setting, and alluded to in
217: \cite{Masry-IT1981-RASFS}, the use of {\em identical} deterministic
218: scalar--quantization (SQ) in all sensors will result in the MSE
219: performance being fundamentally limited by the precision of SQ, {\em
220: irrespective of increasing sensor density}, even in the absence of
221: sensor observation noise.\footnote{The problem will persist even for
222: identical block vector--quantization (VQ) with identical binning
223: (hashing) operations.} However, our results further clarify that
224: having ``diversity'' in the scalar quantizers, achieved, for example,
225: through the means of an intentional random dither, noisy threshold, or
226: other mechanisms, can achieve MSE performance that tends to zero as
227: the density of sensors goes to infinity (\secref{sec:MSEanalysis},
228: Implications).  Randomization enables high--precision signal
229: reconstruction because zero--mean positive and negative fluctuations
230: around a signal value can be reliably ``averaged out'' when there are
231: enough independent noisy observations of the signal value.  This
232: observation is also corroborated by the findings reported in the
233: following related studies \cite{Masry-IT1981-RASFS,
234: MasryC-IT1981-BPCNT,
235: Luo-IT2005-UDEBCSN,zorands2000,zorandli2002,IshwarKR-2003-DSDSNBCP,KumarIR-2004-DSSNBLF}.
236: 
237: The results of this work are also aligned with the
238: information--theoretic, total network rate versus MSE scaling results
239: for the CEO problem which was first introduced in
240: \cite{BergerZV-IT1996-TCP} and thereafter studied extensively in the
241: information theory literature (see
242: \cite{ViswanathanB-IT1997-QGCP,PrabhakaranTR-ISIT04-RQGCP} and
243: references therein). However, it should be noted that
244: information--theoretic rate--distortion studies of this and related
245: distributed field reconstruction (multiterminal source coding)
246: problems typically consider stationary ergodic stochastic fields with
247: complete knowledge of the field and observation--noise statistics,
248: block--VQ and binning operations, and time and space--averaged (as
249: opposed to worst--case) expected distortion criteria. In VQ, sensors
250: are allowed to collect long blocks of real--valued field samples (of
251: infinite resolution) from multiple field snapshots before a discrete,
252: finite bit--rate VQ operation.  The fields are often assumed to be
253: spatially constant and independent and identically distributed (iid)
254: across time (frequently Gaussian) and the observation noise is often
255: assumed to be additive with a known distribution (frequently Gaussian)
256: as in the CEO problem. It should also be noted that the MSE scaling
257: results for the CEO problem in \cite{ViswanathanB-IT1997-QGCP} are
258: with respect to the total network rate where the number of agents (or
259: sensors) has already been sent to infinity while maintaining the total
260: network rate a finite value. Recent information--theoretic results for
261: stationary fields under zero observation noise have been developed in
262: \cite{KashyapLXL-2005-DSCDSN,NeuhoffP-2006-UPRIDLSC}.  There is also a
263: large body of work on centralized oversampled A--D conversion, e.g.,
264: see \cite{Cvetkovic-IT2003-RPREUANQ} and references therein.  Our work
265: does not explicitly address physical--layer network data transport
266: issues. In particular, we do not consider joint source--channel coding
267: strategies (however see remark before
268: Section~\ref{sec:deploymentStrats}). For certain types of joint
269: source--channel coding aspects of this and related problems, we refer
270: the reader to the following references
271: \cite{GastparV-2003-SCCSN,GastparRV-2003-TCNCLSCCR,NowakMW-2004-EIFWSN,LiuES-2005-OPFESN,BajwaSN-2005-MSCCFEWSN,LiuU-2006-ODPTGSN}.
272: Networking issues such as sensor scheduling, quality of service, and
273: energy efficiency may be found in \cite{ZhaoST-2006-IBSPNSN} and
274: references therein.
275: 
276: The rest of this paper is organized as follows. The main problem
277: description with all the associated technical modeling assumptions is
278: presented in \secref{sec:problemsetup}. The main technical results of
279: this paper are then crisply summarized and their implications are
280: discussed in \secref{sec:results}.  \secref{sec:ourscheme} describes
281: the proposed constructive distributed coding and field reconstruction
282: scheme and the analysis of MSE performance which leads to the
283: technical results of \secref{sec:results}. For completeness, in
284: Section~\ref{sec:deploymentStrats} we also briefly discuss sensor
285: deployment issues but this is not the focus of this work. In
286: \secref{sec:prevresults}, we discuss the close connections between the
287: work in \cite{Masry-IT1981-RASFS}, \cite{Luo-IT2005-UDEBCSN}, and the
288: present work, and establish the fundamental statistical and
289: performance equivalence of the core techniques in these studies. We
290: also discuss how the scenario of arbitrary unbounded noise and
291: threshold distributions can be accommodated when the statistics are
292: known.  We conclude in \secref{sec:conclusions} by summarizing the
293: main findings of this work. The proofs of two main results are
294: presented in the appendices.
295: % and comment on ongoing work and future research directions.
296: 
297: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
298: \Section{\label{sec:problemsetup}Distributed Field Reconstruction
299: Sensor--network (DFRS) Setup}
300: 
301: \begin{figure*}
302: \centering
303: \includegraphics[width=7.0in]{Figs/MainFig.eps}
304: \caption{\label{fig:probSetup} {\bf Block diagram of a distributed
305: field reconstruction sensor--network using randomized $1$--bit SQ
306: with block--coding.}  Sensor $i$ quantizes its noisy observations,
307: $Y_{i1}, \ldots, Y_{iT}$, to the binary values $B_{i1}, \ldots,
308: B_{iT}$. The sensor then generates the message $M_i \in \{1, \ldots,
309: 2^{rT}\}$ based on these quantized values. These messages $\{M_i\}$
310: are then relayed to the fusion center where the field estimates
311: $\widehat{S}_t$ are produced.}
312: \end{figure*}
313: 
314: \subsection{\label{sec:fieldModel}Field Model}
315: 
316: We consider a sequence of $T$ discrete--time snapshots of a
317: spatio--temporal field.\footnote{If the spatio--temporal field is
318: temporally bandlimited then the field values at intermediate time
319: points can be interpolated from the estimates at discrete time
320: snapshots if the temporal sampling rate is (strictly) higher than
321: the temporal Nyquist rate of the field. The associated MSE will be
322: no larger than the maximum MSE of the estimates across the
323: discrete--time snapshots times a proportionality constant.} Each
324: snapshot is modeled as a continuous\footnote{More generally, our
325: results can be extended to arbitrary, amplitude--bounded, measurable
326: functions. For such functions the pointwise MSE bounds given in
327: \secref{sec:MSEanalysis} still hold. The estimates at the points of
328: continuity will have MSE tending to 0 as the network size scales.
329: However, the points of discontinuity may have a finite, but
330: non--zero MSE floor.} bounded function,
331: \[
332: s_t:G \rightarrow \rR:\ \forall x \in G,\ \forall t \in
333: \{1,\ldots,T\},\ |s_t(x)| \leq a < +\infty,
334: \]
335: where $G \subseteq \rR^d$ is a known geographical region of interest
336: in $d$--dimensional real space and $a$ is a known bound on the
337: maximum field dynamic range. Although the results of this paper hold
338: for any $G$ which is bounded and is the closure of its nonempty
339: interior, for simplicity and clarity of exposition, we will assume
340: $G = [0,1]^d$, the $d$--dimensional unit--hypercube, in the sequel.
341: Distances are measured with respect to a norm\footnote{For
342: asymptotic results in which distance $\longrightarrow 0$, any norm
343: on $\rR^d$ would suffice since all norms on any finite--dimensional
344: Banach space are equivalent \cite[Theorem~23.6,
345: p.~177]{AliprantisB-AP90-PRA}.} $\|\cdot\|$, which for this work
346: will be assumed to be the Euclidean $2$--norm. Since the fields are
347: continuous functions on the compact set $G$, they are in fact
348: uniformly continuous on $G$ \cite{AliprantisB-AP90-PRA}.
349: 
350: Results on the fidelity of the field reconstruction will be
351: described in terms of the local and global moduli of continuity
352: associated with the field:
353: 
354: \begin{definition}\label{def:localMod}\emph{(Local modulus of
355: continuity)} The local modulus of continuity $\omega_t:[0,\infty)
356: \times G \rightarrow [0,\infty)$ of the function $s_t(x)$ at the
357: point $x \in G$ is defined as
358: \[
359: \omega_t(\delta,x) \triangleq \sup_{\{x^\prime \in G:\|x-x^\prime\|
360: \leq \delta\}} |s_t(x) - s_t(x^\prime)|.
361: \]
362: Note that for all $x \in G$, $\omega_t(\delta,x)$ is a nondecreasing
363: function of $\delta$ and that it $\longrightarrow 0$ as $\delta
364: \longrightarrow 0$ since $s_t(x)$ is continuous at each point $x$ in
365: $G$.
366: \end{definition}
367: 
368: \begin{definition}\label{def:globalMod} \emph{(Global modulus of continuity)}
369: The global modulus of continuity $\widetilde{\omega}_t: [0,\infty)
370: \rightarrow [0,\infty)$ of the function $s_t(x)$ is defined as
371: \[
372: \widetilde{\omega}_t(\delta) \triangleq \sup_{x \in G}
373: \omega_t(\delta,x).
374: \]
375: Again note that $\widetilde{\omega}_t(\delta)$ is a nondecreasing
376: function of $\delta$ and that it $\longrightarrow 0$ as $\delta
377: \longrightarrow 0$ since $s_t(x)$ is uniformly continuous over $G$.
378: \end{definition}
379: 
380: The global and local moduli of continuity of a spatial field
381: respectively reflect the degree of global and local spatial
382: smoothness of the field with smaller values, for a fixed value of
383: $\delta$, corresponding to greater smoothness. For example, for a
384: spatially constant field, that is, for all $x\in G$, $s_t(x) = s_t
385: \text{ (a constant)}$, we have $\widetilde{\omega}_t(\delta) = 0$
386: for all $\delta \geq 0$. For $d=1$ and fields with a uniformly
387: bounded derivative, that is, for all $x \in G$, $\sup_{x\in
388: G}|d(s_t(x))/dx| = \Delta < +\infty$, $\widetilde{\omega}_t(\delta)
389: \leq \Delta \cdot \delta$. More generally, for a Lipschitz--$\gamma$
390: spatial function (see \cite{MasryC-IT1981-BPCNT}) $s_t(x)$, we have
391: $\widetilde{\omega}_t(\delta) \propto \delta^\gamma$. Closed--form
392: analytical expressions of moduli of continuity may not be available
393: for arbitrary fields but bounds often are. Sometimes bounds that are
394: tight in the limit as $\delta \longrightarrow 0$ are also available.
395: From Definitions~\ref{def:localMod}, \ref{def:globalMod}, and the
396: boundedness of the field dynamic range, it also follows that for all
397: $\delta \geq 0$, for all $x \in G$, and for all $t \in
398: \{1,\ldots,T\}$, we have
399: \begin{eqnarray*}
400: %% The following label has not been referenced anywhere!
401: %\label{eqn:modulusBounds}
402: 0 \leq \omega_t(\delta,x) \leq \widetilde{\omega}_t(\delta) \leq 2a
403: < +\infty.
404: \end{eqnarray*}
405: 
406: \subsection{\label{sec:sensePlace}Sensor Placement}
407: 
408: We assume that we have a dense, noncooperative network of $N$ sensors
409: distributed uniformly over a hypercube partitioning of $G =
410: [0,1]^d$. The space $G = [0,1]^d$ is uniformly partitioned into $L =
411: l^d$ (where $l$ is an integer) disjoint, hypercube supercells of
412: side--length $(1/l)$. Each supercell is then further uniformly
413: partitioned into $M = m^d$ (where $m$ is an integer) hypercube
414: subcells of side--length $(1/(lm))$, giving a total of $LM$
415: subcells. In our distributed field coding and reconstruction scheme,
416: described in \secref{sec:ourscheme}, the field estimate for each
417: snapshot is constant over each supercell and is formed by averaging
418: the measurements from a partial set of the sensors determined by the
419: subcells. This field reconstruction scheme requires knowledge of the
420: sensor locations only up to supercell (not subcell)
421: membership. Therefore, it has some natural robustness against sensor
422: location uncertainty or error. The significance of the super and
423: subcells will become clear in the sequel (Sections~\ref{sec:results}
424: and \ref{sec:ourscheme}).
425: %Issues related to randomness in sensor deployment are discussed in
426: %\secref{sec:deploymentStrats}.
427: 
428: We assume that the sensor deployment mechanism is able to uniformly
429: distribute the sensors over the subcells. We define this uniform
430: sensor deployment condition with:
431: 
432: \begin{definition}\label{def:UnifPlacement} \emph{(Uniform sensor deployment)}
433: We say that a sensor deployment method is uniform if exactly $n
434: \triangleq (N/(LM))$ sensors are located in each subcell.
435: \end{definition}
436: 
437: \defref{def:UnifPlacement} describes ideal sensor deployment
438: conditions and can be achieved by locating the sensors over a uniform
439: grid. However, precise control of sensor locations may not be possible
440: in practice.
441: %Since incorporating uncertainty in the deployment models will not
442: %drastically change performance results and
443: Since we are not primarily concerned about the details of deployment,
444: we defer discussion of such issues to \secref{sec:deploymentStrats},
445: where we introduce a stochastic deployment model in order to capture
446: the uncertainty of realistic deployment mechanisms. In
447: \secref{sec:deploymentStrats}, we show that this deployment method
448: satisfies a relaxed version of \defref{def:UnifPlacement}, the
449: asymptotic nearly uniform deployment condition given by
450: \defref{def:NearUnifPlacement}, which does not significantly change
451: the estimator performance.
452: 
453: For clarity of presentation, we will assume that the deployment scheme
454: being used satisfies the uniform sensor deployment condition given in
455: \defref{def:UnifPlacement}. We also assume that each sensor is aware
456: of which subcell it is in. \figref{fig:deployment} illustrates the
457: cell hierarchy and an example sensor deployment for the $d = 2$
458: dimensional case.
459: 
460: \begin{figure}
461: \centering
462: \includegraphics[width=3.0in]{Figs/SensorPlace.eps}
463: \caption{{\bf Example uniform sensor deployment and cell hierarchy
464: over $[0,1]^2$ ($d = 2$).} Here, $N = 864$ sensors are deployed over
465: $L = 4^2$ supercells of side--length $(1/4)$ and $M = 3^2$ subcells
466: per supercell of side--length $(1/(3 \cdot 4))$, resulting in $6$
467: sensors per subcell.} \label{fig:deployment}
468: \end{figure}
469: 
470: \subsection{Sensor Observation and Coding Models}
471: 
472: \subsubsection{\label{sec:obsModel}Sensor Observation Noise}
473: 
474: The sensor observations are corrupted by bounded, zero--mean
475: additive noise which is independent across sensors, but can be
476: arbitrarily correlated across field snapshots\footnote{The
477: measurement snapshot timers of all the participating sensors are
478: assumed to be synchronized.}. Let $Z_{it}$ denote the noise
479: affecting the observation of the $t^{\mathrm{th}}$ snapshot by the
480: $i^{\mathrm{th}}$ sensor, and define the $\mathbf{Z} \triangleq
481: \{Z_{it}\}_{i=1,t=1}^{N,T}$ (the collection of all of the noise
482: random variables) and $\mathbf{Z}_i \triangleq \{Z_{it}\}_{t=1}^{T}$
483: (the collection of all of the noise random variables for a given
484: sensor $i$). The noise $\mathbf{Z}$ has an unknown joint cumulative
485: distribution function (cdf) $F_{\mathbf{Z}}(\mathbf{z})$ that can be
486: arbitrary within the zero--mean, boundedness and independence
487: constraints already stated. The maximum dynamic range of the noise
488: $b \in [0,+\infty)$ is known. The noisy observation of field
489: snapshot $t \in \{1, \ldots, T \}$ made by sensor $i \in \{1,
490: \ldots, N\}$ is given by
491: \[
492: Y_{it} = s_t(x_i) + Z_{it},
493: \]
494: where $x_i$ is the location of the $i^\mathrm{th}$ sensor and
495: $\mathbf{Z} \sim \mbox{cdf } F_{\mathbf{Z}}(\mathbf{z})$. We use
496: $\mathcal{F}$ to denote the set of all joint cdfs that are
497: factorizable into $N$ zero--mean joint cdfs on $\rR^T$ with support
498: within $[-b,+b]^T$, that is, $F_{\mathbf{Z}}(\mathbf{z}) =
499: \prod_{i=1}^N F_{\mathbf{Z_i}}(\mathbf{z_i})$ where
500: $F_{\mathbf{Z_i}}(\mathbf{z_i})$ is a zero--mean joint cdf
501: (corresponding to the noise random variables for sensor $i$) with
502: support within $[-b,+b]^T$. Note that $\mathcal{F}$ captures the
503: feasible set of joint noise cdfs for the bounded--amplitude,
504: zero--mean, and independence assumptions. Also note that $|Y_{it}|
505: \leq |s_t(x_i)| + |Z_{it}| \leq c \triangleq (a+b)$.
506: 
507: \subsubsection{\label{sec:1bitSQ}Randomized $1$--bit SQ with Block Coding}
508: 
509: Due to severe precision and reliability limitations, each sensor $i
510: \in \{1, \ldots, N\}$, has access to only to a vector of unreliable
511: binary quantized samples $\mathbf{B}_i \triangleq (B_{i1}, \ldots,
512: B_{iT})$ for processing and coding and not direct access to the
513: real--valued noisy observations $Y_{i1}, \ldots, Y_{iT}$. The
514: quantized binary sample $B_{it}$ is generated from the corresponding
515: noisy observation $Y_{it}$ through a randomized mapping $Q_{it}:
516: [-c,c] \rightarrow \{0,1\}$: for each $i \in \{1, \ldots, N\}$ and
517: each $t \in \{1, \ldots, T \}$,
518: \[
519: B_{it} = Q_{it}(Y_{it}),
520: \]
521: where we assume that the mappings $Q_{it}$ are independent across
522: sensors $i$, but can be arbitrarily correlated across snapshots $t$.
523: We denote the conditional marginal statistics of the quantized
524: samples by $p_{B_{it}|Y_{it}}(y) \triangleq \pP(B_{it} = 1|Y_{it} =
525: y)$. We are specifically interested in cases where
526: $p_{B_{it}|Y_{it}}(y)$ is an affine function of $y$ since it allows
527: estimates of the fields to be made from the $B_{it}$'s without
528: knowledge of the noise distribution (see \appref{app:MSEperfProof}).
529: Specifically we consider the conditional distribution
530: \[
531: p_{B_{it}|Y_{it}}(y) = \left(\frac{y + c}{2c}\right).
532: \]
533: 
534: This conditional distribution can be achieved by a quantization
535: method which is based on comparing the noisy observation with a
536: random uniformly distributed threshold given by
537: \begin{equation}\label{eqn:ourQFunc}
538: B_{it} = Q_{it}^{Th}(Y_{it}) \triangleq \mathbf{1}(Y_{it} > R_{it}),
539: \end{equation}
540: where the $R_{it}$'s are $\mathrm{Unif}[-c,c]$ random thresholds
541: which are independent across sensors $i$, but arbitrarily correlated
542: across snapshots $t$, and $\mathbf{1}(\cdot)$ denotes the indicator
543: function:
544: \[
545: \mathbf{1}(Y_{it} > R_{it}) =
546: \begin{cases} 1 & \mbox{if } Y_{it} > R_{it}, \\
547: 0 & \mbox{otherwise}.
548: \end{cases}
549: \]
550: 
551: This uniform random--threshold $1$--bit SQ model partially accounts
552: for some practical scenarios that include (i) comparators with a
553: floating threshold voltage, (ii) substantial variation of preset
554: comparator thresholds accompanying the mass--manufacture of
555: low--precision sensors, (iii) significant environmental fluctuations
556: that affect the precision of the comparator hardware, or generally
557: (iv) unreliable comparators with considerable sensing noise and
558: jitter. An alternative justification is that the random thresholds
559: are intentionally inserted as a random dither. Scenario (i) can be
560: accommodated by independence across snapshots, scenario (ii) can be
561: accommodated by complete correlation (fixed) across snapshots, and
562: scenarios (iii) and (iv) can be accommodated by arbitrary
563: correlation across snapshots.
564: 
565: \begin{figure}[!htb]
566: \centering
567: \includegraphics[width=2.5in]{Figs/quantizer.eps}
568: \caption{\label{fig:QthModel}{\bf Quantizer hardware example.} The
569: sensing model described by the $Q_{it}^{Th}(\cdot)$ function in
570: \eqnref{eqn:ourQFunc} can be implemented by a comparator with a
571: uniformly distributed threshold. These thresholds are independent
572: across sensors, but arbitrarily correlated across snapshots,
573: allowing many scenarios to be accommodated.}
574: \end{figure}
575: 
576: Each sensor $i$ utilizes a block encoder to ``compress'' its vector
577: of $T$ quantized samples $\mathbf{B}_i$ to a message $M_i \in \{1,
578: 2, \ldots, 2^{rT}\}$ before transmitting to the fusion center.  The
579: block encoder and message for sensor $i$ are given by
580: \[
581: f_i:\{0,1\}^T \rightarrow \{1, 2, \ldots, 2^{rT}\}, \quad M_i =
582: f_i(B_{i1}, \ldots, B_{iT}),
583: \]
584: where $r$ is the coding rate in bits per sensor per snapshot. For $r
585: \geq 1$ compression is trivial since $\mathbf{B}_i$ can assume no
586: more than $2^T$ distinct values which can be indexed using $T$~bits.
587: 
588: \subsection{\label{sec:transModel}Transmission and Field Reconstruction}
589: 
590: In this work, a data fusion center is any point of data aggregation
591: and/or processing in the sensor network and can be real or virtual.
592: For instance, sensors can be dynamically organized into clusters with
593: different sensors assuming the role of a fusion center at different
594: times \cite{ChouPR-Asilomar2002-TECDSN}.  To conform with the existing
595: base of digital communication architectures, our problem setup
596: abstracts the underlying transmission network of sensors effectively
597: as a network of bit pipes. These bit pipes are capable of reliably
598: delivering these $N$ messages (the payloads) and the network addresses
599: of the message origination nodes (the headers) to the fusion
600: center. This enables the fusion center to correctly associate the
601: spatial location information with the corresponding sensor
602: field--measurement information for reliable field reconstruction. In
603: practice, sensor data can be moved to the fusion center through a
604: variety of physical--layer transport mechanisms, example, a stationary
605: base--station with directional antenna arrays, a mobile data
606: collector, and passive sensor querying mechanisms involving, for
607: instance, laser--beams and modulating mirrors
608: \cite{KahnKP-MOBICOM99-NCCMNSD}.
609: 
610: Separating the distributed field reconstruction problem into efficient
611: data acquisition and efficient data transport parts through a
612: finite--rate reliable bit--pipe abstraction may be suboptimal
613: \cite[p.~449]{CoverJ-1991-EoIT}, \cite{GastparV-2003-SCCSN,
614: GastparRV-2003-TCNCLSCCR}. For instance, in some scenarios multihop
615: communication is not needed and the characteristics of the field, the
616: communication channel, and the distortion--metric are ``matched'' to
617: one another. In such a scenario, uncoded ``analog'' transmission can
618: offer huge performance gains if the synchronization of sensor
619: transmissions can be orchestrated at the physical layer to achieve
620: beamforming gains and the network channel state information is
621: available to the transmitting sensors \cite{GastparV-2003-SCCSN}.
622: Certain aspects of this analog transmission can be incorporated within
623: our field reconstruction framework and is briefly discussed in the
624: remark just before Section~\ref{sec:deploymentStrats}.
625: 
626: For our reconstruction scheme, described in \secref{sec:ourscheme},
627: the fusion center only needs to be able to spatially localize the
628: origin of each message to within the supercell resolution.
629: % differentiate which supercell each message originated from.
630: This can be achieved by having each sensor append a $\log(LM)$ bits
631: long label to its message. This results in a total sensor--location
632: rate--overhead of $r_{ohd} = (N/T)\log(LM)$ bits per snapshot on the
633: network information transport costs. This overhead will be negligible
634: if $T \gg N\log(LM)$. If the underlying sequence of fields are
635: spatially constant, then, the sensor location information is not
636: needed at the fusion center (see \corref{cor:constFieldCase} and
637: \secref{sec:ourscheme}).
638: 
639: The fusion center forms the estimates of the $T$ fields based on the
640: sensor messages using the reconstruction functions
641: \[
642: g_t: G \times \{1, 2, \ldots, 2^{rT}\}^N \rightarrow [-a,a], \
643: \forall t \in \{1, \ldots, T\}.
644: \]
645: The estimate of field $t$ at point $x \in G$ is denoted by
646: \[
647: \widehat{S}_t(x) = g_t(x, M_1, \ldots, M_N).
648: \]
649: \begin{definition}\emph{(Rate--$r$ DFRS)}
650: A rate--$r$ DFRS based on randomized $1$--bit SQ with block coding
651: is defined by the set of rate--$r$ encoder functions
652: $\{f_i(\cdot)\}_{i=1}^N$ and the set of reconstruction functions
653: $\{g_t(\cdot)\}_{t=1}^T$.
654: \end{definition}
655: 
656: Figure~\ref{fig:probSetup} depicts a rate--$r$ DFRS using randomized
657: $1$--bit SQ with block coding.
658: 
659: \subsubsection{\label{sec:perfCriteria}Performance Criterion}
660: 
661: \begin{definition} \emph{(Pointwise MSE)}
662: The pointwise MSE of the estimate of field $t$ at location $x \in
663: G$, for a given rate--$r$ DFRS and a specific noise joint cdf
664: $F_\mathbf{Z}(\mathbf{z}) \in \mathcal{F}$, is given by
665: \[
666: D_t(x;F_\mathbf{Z}) = \eE[(\widehat{S}_t(x) - s_t(x))^2].
667: \]
668: \end{definition}
669: 
670: Since we are interested in schemes that will work for {\em any}
671: noise cdf in $\mathcal{F}$, we consider the worst--case
672: $D_t(x;F_\mathbf{Z})$ over all possible $F_\mathbf{Z} \in
673: \mathcal{F}$. We also consider the maximization over all fields and
674: all locations in $G$ since we want to reconstruct every point of
675: every field with high fidelity.
676: \begin{definition}\label{def:worsecaseMSE} \emph{(Worst--case MSE)}
677: The worst--case MSE $D$ is given by
678: \[
679: D = \max_{t \in \{1, \ldots, T\}} \sup_{x \in G} \sup_{F_\mathbf{Z}
680: \in \mathcal{F}} D_t(x;F_\mathbf{Z}).
681: \]
682: \end{definition}
683: 
684: Our objective is to understand the scaling behavior of MSE with $N$,
685: $T$, and $r$. The next section summarizes our partial results in
686: this direction.
687: 
688: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
689: \Section{\label{sec:results}Main Results}
690: 
691: \subsection{\label{sec:MSEanalysis}Achievable MSE Performance}
692: 
693: Our first result gives an upper bound on the MSE achievable through
694: a constructive DFRS based on randomized $1$--bit SQ with block
695: coding for rate $r = 1/M$, where $M$ is the number of subcells per
696: supercell. The actual scheme will be described in
697: \secref{sec:ourscheme}. The MSE analysis appears within the proof of
698: the theorem detailed in \appref{app:MSEperfProof}. This achievable
699: MSE upper bound can be made to decrease to zero as sensor--density
700: goes to infinity (see \eqnref{eqn:LNScaling}) without knowledge of
701: the local or global smoothness properties of the sequence of fields.
702: Furthermore, this scheme is universal in the sense that it does not
703: assume knowledge of $F_\mathbf{Z}(\mathbf{z})$ beyond membership to
704: $\mathcal{F}$.
705: 
706: \begin{theorem}\label{thm:MSEperf} \emph{(Achievable MSE performance:
707: Randomized $1$--bit SQ and $r = 1/M$)} There exists a rate--$r = 1/M$
708: DFRS based on randomized $1$--bit SQ with block coding (e.g., the
709: scheme of \secref{sec:ourscheme}) such that for all $x \in G$, $t \in
710: \{1, \ldots, T\}$, and $F_\mathbf{Z}(\mathbf{z}) \in \mathcal{F}$,
711: \begin{eqnarray*}
712: D_t(x;F_\mathbf{Z}) &\leq&
713: \omega_t^2\left(\frac{\sqrt{d}}{\sqrt[d]{L}} ,x\right) +
714: \left(\frac{LMc^2}{N}\right) \\
715: &\leq&
716: \widetilde{\omega}_t^2\left(\frac{\sqrt{d}}{\sqrt[d]{L}}\right) +
717: \left(\frac{LMc^2}{N}\right).
718: \end{eqnarray*}
719: \end{theorem}
720: 
721: \begin{proof}
722: See Section~\ref{sec:ourscheme} and \appref{app:MSEperfProof}.
723: \end{proof}
724: 
725: Note that Theorem~\ref{thm:MSEperf} holds for arbitrary fields. The
726: modulus of continuity terms in the local (first) and global (second)
727: upper bounds of Theorem~\ref{thm:MSEperf} are due to the bias of the
728: field estimates and the $\left(\frac{LMc^2}{N}\right)$ term is due
729: to the variance of the field estimates (see \eqnref{eqn:1bitreconst}
730: in \secref{sec:ourscheme}). From \thmref{thm:MSEperf} and the
731: properties of moduli of continuity (see \secref{sec:fieldModel}), it
732: follows that for the coding and reconstruction scheme of
733: \secref{sec:ourscheme}, as $N \longrightarrow \infty$, the estimate
734: $\widehat{S}_t(x)$ uniformly converges, in a mean square sense, to
735: $s_t(x)$ for all $x \in G$, provided that
736: \begin{equation} \label{eqn:LNScaling}
737: \mbox{(i) }\left(\frac{N}{L}\right) \longrightarrow \infty, \mbox{
738: and (ii) } L \longrightarrow \infty.
739: \end{equation}
740: It also follows that the worst--case MSE scaling behavior (see
741: Definition~\ref{def:worsecaseMSE}) is bounded by
742: \begin{equation} \label{eqn:MSE-WC-Result}
743: D \leq \max_{t \in \{1, \ldots, T\}} \left\{
744: \widetilde{\omega}_t^2\left(\frac{\sqrt{d}}{\sqrt[d]{L}}\right) +
745: \left(\frac{LMc^2}{N}\right)\right\}
746: \end{equation}
747: and that $D \longrightarrow 0$ as $N$ and $L$ scale as in
748: \eqnref{eqn:LNScaling}.
749: 
750: \noindent {\bf Implications:} These results allow us to make the per
751: sensor per snapshot bit rate $r$, worst--case MSE $D$, and sensor
752: message ID overheads (given by $(N/T) \log(LM)$ bits) simultaneously
753: smaller than any arbitrarily small desired values $r^*, D^*, \epsilon
754: > 0$, respectively. First, we can choose a sufficiently large number
755: of subcells per supercell $M^*$ such that the rate $r = 1/M^* < r^*$.
756: Then we can choose a sufficiently large number of sensors $N^*$ and
757: number of supercells $L^*$ such that the bound on $D$ given by
758: \eqnref{eqn:MSE-WC-Result} is made less than $D^*$. Note that both
759: $N^*$ and $M^*$ can be further increased while keeping the ratio
760: $M^*/N^*$ fixed without changing the bound on $D$. This corresponds to
761: increasing the total number of sensors $N$, decreasing the per sensor
762: rate $r = 1/M$, but keeping the total network per snapshot rate $Nr =
763: N/M$ and distortion $D$ fixed. Finally, we can look at a sufficiently
764: large number of snapshots $T^*$ such that network message overheads
765: $(N^*/T^*) \log(L^*M^*) < \epsilon$.
766: 
767: In the constructive coding and field reconstruction scheme of
768: \secref{sec:ourscheme}, the field estimates are piecewise constant
769: over the supercells. The estimate in each supercell is formed from
770: only $n = (N/(LM))$ of the $Mn = (N/L)$ quantized observed values
771: coming from the sensors located in that supercell. Since only
772: $(1/M)$ of the total available quantized observed values for each
773: snapshot are used, the transmission rate of $(1/M)$ is achievable by
774: indexing only the necessary values (see \secref{sec:ourscheme} for
775: details). As the number of supercells $L$ increases, the piecewise
776: constant estimate becomes finer and the bias is decreased. Also, as
777: the number of sensors per supercell is increased, more observations
778: are used thus decreasing the variance of the estimate.
779: 
780: Since the variance term $\frac{LMc^2}{N}$ in the upper bound of
781: Theorem~\ref{thm:MSEperf} can decrease no faster than $O(1/N)$, the
782: decay of the global MSE upper bound, in the proposed constructive
783: scheme, can be no faster than $O(1/N)$. However, the decay rate of
784: $\frac{LMc^2}{N}$ is hindered by the fact that $L$ simultaneously
785: needs to approach infinity for the bias term
786: $\widetilde{\omega}_t^2\left(\frac{\sqrt{d}}{\sqrt[d]{L}}\right)$ to
787: decay to $0$. When $\widetilde{\omega}_t(\cdot)$ is not identically
788: zero, a bias--variance tradeoff exists and the appropriate relative
789: growth rate for $L$ with $N$ that minimizes the decay rate of the
790: global MSE upper bound of Theorem~\ref{thm:MSEperf} is determined by
791: the following condition
792: \[
793: \widetilde{\omega}_t^2\left(\frac{\sqrt{d}}{\sqrt[d]{L}}\right) =
794: \Theta\left(\frac{L}{N}\right).
795: \]
796: For certain classes of signals for which the global modulus of
797: continuity has a closed form, the optimum growth rate can be
798: explicitly determined. For instance, if $d=1$ and
799: $\widetilde{\omega}_t(\delta) = \Delta \cdot \delta$ (Lipschitz--$1$
800: fields), $L_{opt}(N) = \Theta(N^{1/3})$ for which $\mathrm{MSE} =
801: O(N^{-2/3})$.
802: 
803: \begin{corollary}\label{cor:constFieldCase} \emph{(Achievable MSE
804: performance: Randomized $1$--bit SQ, $r = 1/M$, and constant
805: fields)} If for all $x \in G$ and all $t \in \{1, \ldots, T\}$, we
806: have $s_t(x) = s_t$, or equivalently, for all $\delta \geq 0$ and
807: all $t \in \{1, \ldots, T\}$, $\widetilde{\omega}_t(\delta) = 0$,
808: then the result given by \eqnref{eqn:MSE-WC-Result} reduces to
809: \[
810: D \leq \left(\frac{Mc^2}{N}\right),
811: \]
812: where we can set $L = 1$ to minimize the bound.
813: \end{corollary}
814: Only $L = 1$ supercell is needed for an accurate piecewise constant
815: reconstruction of a constant field. Furthermore, all
816: snapshot--estimates given by the scheme from \secref{sec:ourscheme}
817: are unbiased in this case. Also, the spatial locations of sensors are
818: irrelevant: the MSE behavior is governed purely by the number of
819: sensors $N$ regardless of how they are distributed over the
820: subcells. The $N$ sensors must still be uniformly assigned to one of
821: $M$ groups (for the purpose of transmission coordination to achieve
822: the compression factor of $1/M$), however these groups need not have
823: any geographical significance.
824: 
825: The MSE results given by \thmref{thm:MSEperf} show that the field
826: snapshot estimates converge uniformly in MSE and upper bound the MSE
827: decay rate. Every point of every estimate, in fact, converges almost
828: surely to the true value. We also state a central limit theorem
829: (CLT) result regarding the estimation error.
830: 
831: \begin{theorem}\label{thm:ASConv} \emph{(Almost--sure convergence of field
832: estimates)} There exists a rate--$r = 1/M$ DFRS based on randomized
833: $1$--bit SQ with block coding (described in \secref{sec:ourscheme})
834: such that for all $x \in G$, $t \in \{1, \ldots, T\}$, and
835: $F_\mathbf{Z}(\mathbf{z}) \in \mathcal{F}$,
836: \begin{eqnarray*}
837: \widehat{S}_t(x) \xrightarrow{\mathrm{a.s.}} s_t(x),
838: \end{eqnarray*}
839: as $N$ and $L$ scale as given in \eqnref{eqn:LNScaling}.
840: \end{theorem}
841: \begin{proof}
842: See Section~\ref{sec:ourscheme} and \appref{app:ASConvProof}.
843: \end{proof}
844: 
845: \begin{corollary}\label{cor:errorCLT} \emph{(Central limit theorem for
846: estimation errors)} For the rate $r = 1/M$ DFRS of
847: \secref{sec:ourscheme}, the normalized error at point $x \in G$ for
848: the estimate of field snapshot $t \in \{1, \ldots, T\}$, given by
849: \begin{eqnarray*}
850: \frac{\widehat{S}_t(x)-s_t(x)}{\sqrt{\mathrm{var}[\widehat{S}_t(x)-s_t(x)]}},
851: \end{eqnarray*}
852: is asymptotically zero--mean, unit--variance, normal as $N$ and $L$
853: scale as given in \eqnref{eqn:LNScaling}, for any
854: $F_\mathbf{Z}(\mathbf{z}) \in \mathcal{F}$.
855: \end{corollary}
856: \begin{proof}
857: The proof is similar to and follows directly from the proof of
858: Theorem~2.4 in \cite{Masry-IT1981-RASFS}.
859: \end{proof}
860: 
861: \subsection{Order--Optimal Minimax MSE for Constant Fields}
862: \label{sec:converse}
863: 
864: The minimax reconstruction MSE over the class of constant fields is
865: given by
866: \begin{equation*}
867: \inf_{\{g_t\}_{t=1}^{t=T}} \sup_{F_{\mathbf{Z}} \in \mathcal{F}, s_t
868: \in \mathcal{S}} D,
869: \end{equation*}
870: where the infimum is taken over all possible estimators and the
871: supremum is taken over all noise distributions and fields from the
872: class of constant fields which is denoted by $\mathcal{S}$. The
873: achievable MSE result given by \corref{cor:constFieldCase}
874: establishes an upper bound on the minimax reconstruction MSE.
875: \thmref{thm:converse} lower bounds the minimax reconstruction MSE
876: for any rate $r$ DFRS that produces unbiased estimates for the case
877: of spatially constant fields.
878: 
879: \begin{theorem}\label{thm:converse}
880: \emph{(Lower bound on MSE: Unbiased estimators for constant fields)}
881: For a sequence of spatially constant fields and any DFRS which
882: produces unbiased field estimates, there exists a joint cdf
883: $F_{\mathbf{Z}} \in \mathcal{F}$ such that for noise distributed
884: according to $F_{\mathbf{Z}}$ the MSE is lower bounded by
885: \[
886: \eE[(\widehat{S}_t - s_t)^2] \geq \left(\frac{C_t}{N}\right), \quad
887: \text{for all } t \in \{1, \ldots, T\},
888: \]
889: where $C_t$ is finite, non--zero, and does not depend on $N$.
890: Therefore,
891: \begin{equation*}
892: \inf_{\{g_t\}_{t=1}^{t=T}} \sup_{F_{\mathbf{Z}} \in \mathcal{F}, s_t
893: \in \mathcal{S}} D \geq \max_{t \in \{1, \ldots, T\}}
894: \left(\frac{C_t}{N}\right).
895: \end{equation*}
896: \end{theorem}
897: 
898: \begin{proof}
899: Since $\{s_t\} \rightarrow \{Y_{it}\} \rightarrow \{B_{it}\}
900: \rightarrow \{M_i\}$ forms a Markov chain, the estimates based on
901: the sensor messages $\{M_1, \ldots, M_N\}$ cannot have a lower MSE
902: than estimates based on the noisy observations $\{Y_{it}\}$. Let
903: $F_{\mathbf{Z}} \in \mathcal{F}$ be any well--behaved, non--trivial,
904: joint cdf such that the $Z_{it}$ are iid and the conditional
905: probabilities of $Y_{it}$ given the fields satisfy the regularity
906: conditions necessary for the Cram\'{e}r--Rao bound
907: \cite{Kay-1993-FSSPET} to be applied. By the Cram\'{e}r--Rao bound,
908: the MSE of each field estimate based on $\{Y_{it}\}$ is lower
909: bounded by $\frac{C_t}{N}$ where $C_t$ is finite, non--zero, and
910: depends on $F_{\mathbf{Z}}$, but does not depend on $N$. Note that
911: the bound also applies to general randomized $1$--bit SQ functions
912: $Q_{it}(\cdot)$ including those based on uniform random thresholds
913: $Q_{it}^{Th}(\cdot)$ (see \eqnref{eqn:ourQFunc}).
914: \end{proof}
915: 
916: Combining the results of \corref{cor:constFieldCase} and
917: \thmref{thm:converse} establishes that the order--optimal minimax
918: MSE for spatially constant fields is $\Theta(1/N)$ and that the
919: scheme of \secref{sec:ourscheme} achieves this order optimal
920: performance.
921: 
922: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
923: \Section{\label{sec:ourscheme}Proposed Constructive Distributed
924: Coding and Field Reconstruction Scheme}
925: 
926: In this section we present the proposed DFRS scheme that was alluded
927: to in \secref{sec:results}. In this scheme, sensors create the
928: quantized binary samples $\{B_{it}\}$ from their observations
929: $\{Y_{it}\}$ through comparisons with the random thresholds
930: $\{R_{it}\}$, as described in \eqnref{eqn:ourQFunc} of
931: \secref{sec:1bitSQ}. The field estimates are piecewise constant over
932: the supercells, where the estimate formed in each supercell is a
933: function of only $(N/(LM))$ of the $(N/L)$ quantized observed values
934: coming from the sensors located in that supercell. This allows
935: fractional transmission rates of $r = 1/M$ through a simple
936: time--sharing based compression method. Note that there can be
937: uncertainty in the sensor locations, within a degree given by the
938: size of a supercell, at the fusion center, since it is only
939: necessary for the fusion center to know which supercell each sensor
940: is located in.
941: 
942: Each sensor $i$, instead of transmitting all of its $T$ bits (the
943: vector of its binary quantized observations $\mathbf{B}_i = (B_{i1},
944: \ldots, B_{iT})$), transmits only $rT = T/M$ of them and the remaining
945: observations are dropped. Or alternatively, the sensor may sleep and
946: not record the remaining measurements. The two--level hierarchy of
947: supercells and subcells described in \secref{sec:sensePlace} is used
948: in order to properly determine which bits sensors should drop or
949: keep. Within each supercell, each sensor $i$ from subcell $k \in \{1,
950: \ldots, M\}$ communicates only every $M^\mathrm{th}$ bit (offset by
951: $k$), that is $\{B_{i,k+Ml} \}_{l = 0}^{l = (T/M)-1}$.  These $rT$
952: bits can be uniquely represented by the message $M_i \in \{1, \ldots,
953: 2^{rT}\}$ and losslessly communicated to the fusion center. Thus for
954: snapshot $t \in \{1,\ldots,T\}$, only the bits from senors in the
955: $[((t-1) \mbox{ mod } M) + 1]^{\mathrm{th}}$ subcell of each supercell
956: are communicated to the fusion center.  The set of all sensor indices
957: corresponding to the $n = (N/(LM))$ sensors belonging to the $[((t-1)
958: \mbox{ mod } M) + 1]^{\mathrm{th}}$ subcell of supercell $j$ will be
959: denoted by $I(j,t)$.  In other words, this set of indices corresponds
960: to all those sensors which are located in supercell $j$ and are
961: responsible for recording and encoding a bit in the $t$-th snapshot.
962: 
963: For notational simplicity, the reconstruction function
964: $\widehat{S}_t(x) = g_t(x, M_1, \ldots, M_N)$ will be described
965: directly in terms of the available binary quantized observations
966: $B_{it}$\footnote{The set of binary quantized observations for
967: snapshot $t$ which are available at the fusion center is given by
968: $\{B_{it}\}_{\{i \in \cup_{j=1}^L I(j,t)\}}$} and not the encoded
969: messages $\{M_i\}$ which are information equivalent. The
970: reconstruction function $\widehat{S}_t(x)$ is piecewise constant and
971: is described as follows. The field $s_t(x)$ is reconstructed as a
972: constant over each supercell $j$. The constant estimate is given by
973: \begin{eqnarray}
974: \widehat{S}_{tj} &\triangleq& 2c \left[ \frac{1}{n} \sum_{i \in
975: I(j,t)} B_{it} \right] - c, \label{eqn:simpleavg}
976: \end{eqnarray}
977: which is the simple average (shifted and scaled into $[-c,+c]$) of the
978: available quantized binary observations of snapshot $t$ from sensors
979: located in supercell $j$. The overall piecewise--constant estimate for
980: $s_t(x)$ can be then described as
981: \begin{eqnarray}
982: \widehat{S}_t(x) &=& g_t(x, M_1, \ldots, M_N) \nonumber \\
983: &\triangleq& \sum_{j=1}^L \widehat{S}_{tj} \mathbf{1}(x \in
984: \mathcal{X}_j), \label{eqn:1bitreconst}
985: \end{eqnarray}
986: where $\mathcal{X}_j \subseteq [0,1]^d$ is the set of points within
987: the $j^{\mathrm{th}}$ hypercube supercell and $\mathbf{1}(x \in
988: \mathcal{X}_j)$ given by
989: \[
990: \mathbf{1}(x \in \mathcal{X}_j) =
991: \begin{cases} 1 & \mbox{if } x \in \mathcal{X}_j, \\
992: 0 & \mbox{otherwise},
993: \end{cases}
994: \]
995: is the indicator function of the set $\mathcal{X}_j$. Other more
996: sophisticated reconstruction algorithms are possible. For instance,
997: instead of the simple average used in (\ref{eqn:simpleavg}), one may
998: use a weighted average with convex weights, and for the overall
999: reconstruction in (\ref{eqn:1bitreconst}), one may use a
1000: piecewise--linear or other higher--order interpolation algorithms such
1001: as those based on cubic B--splines (see
1002: \cite{Masry-IT1981-RASFS}). The resulting MSE will be of the same
1003: order. We use the former (simple average, piecewise--constant)
1004: reconstruction because its description and analysis is more compact.
1005: \appref{app:MSEperfProof} proves that the MSE of this constructive
1006: coding and reconstruction scheme is upper bounded by the result
1007: described in Theorem~\ref{thm:MSEperf}.
1008: 
1009: \noindent{\bf Remark:} As discussed at the beginning of
1010: Section~\ref{sec:transModel}, physical--layer network data transport
1011: issues are not the focus of this work. However, if synchronized analog
1012: beamforming from the sensors within each subcell to the fusion center
1013: can be achieved, then the summation in the reconstruction given by
1014: equation (\ref{eqn:simpleavg}) can be realized directly in the analog
1015: physical layer, by ``adding signals in the air'', using a simple
1016: binary pulse amplitude modulation signaling scheme at each sensor. The
1017: additional estimation error variance due to the receiver amplifier
1018: noise at the fusion center will decrease as $1/n$ by scaling the
1019: sampled received waveform by $1/n$ as in (\ref{eqn:simpleavg}). This
1020: will lead to corresponding achievable power versus distortion
1021: tradeoffs (as opposed to bits versus MSE or sensors versus MSE) which
1022: can be quantified.
1023: 
1024: \subsection{\label{sec:deploymentStrats}{Sensor Deployment Considerations}}
1025: 
1026: The conditions given by \defref{def:UnifPlacement} correspond to
1027: exactly $(N/(LM))$ sensors uniformly falling into each subcell with
1028: probability one for all $N$, $L$, and $M$ (ignoring integer
1029: affects). In the regime of perfect sensor placement control (or when
1030: placement error is negligible compared to the width of the cells),
1031: this condition is trivially realized by locating the sensors on a
1032: uniform grid. However, such precise sensor placement control might not
1033: be achievable in practice. In order to address this issue we introduce
1034: a stochastic sensor deployment model, one that captures an extreme
1035: case of (uncontrollable) sensor placement uncertainty where each
1036: sensor is deployed according to a uniform distribution over
1037: $[0,1]^d$. We also relax the uniform sensor placement to an asymptotic
1038: nearly uniform sensor deployment given by
1039: \defref{def:NearUnifPlacement}.
1040: 
1041: %Another deterministic approach would be to apply the theory of
1042: %low--discrepancy sampling sequences (LDSS) that has been used in
1043: %pseudo--random number generation, quasi--Monte Carlo simulation, and
1044: %computer graphics
1045: %\cite{Niederreiter-SIAM92-RNGQMCM,DrmotaT-Springer97-SDA}. This LDSS
1046: %approach has the benefit that number of sensors can be increased
1047: %without relocating previously deployed sensors, however this method
1048: %requires an over--provisioning of sensors in order to guarantee the
1049: %same number of sensors per cell.
1050: 
1051: %In practical situations, it may not be possible to precisely control
1052: %the placement of the sensors unless the sensors have mobility.  See
1053: %\cite{LeociniRS-2005-AWSDPWAEM} for a study of a simple
1054: %probabilistic sensor placement model. We assume that sensors have
1055: %been scattered in a manner that guarantees the nearly uniform
1056: %placement property, at least for all $L$ sufficiently large, and
1057: %that the fusion center has knowledge of the sensor locations. The
1058: %results of Sections~\ref{sec:MSEanalysis} and~\ref{sec:ourscheme}
1059: %will show how the design parameter $N$ can be picked in relation to
1060: %$L$ so that the field reconstruction MSE goes to zero as $L$
1061: %increases to infinity.
1062: 
1063: \begin{definition}\label{def:NearUnifPlacement}
1064: \emph{(Asymptotic nearly uniform sensor deployment)} We say that a
1065: sensor deployment method is asymptotically nearly uniform with
1066: parameters $(\gamma,\epsilon,N^*)$ if at least $\gamma n \triangleq
1067: \gamma(N/(LM))$ are located in each subcell with probability at
1068: least $1-\epsilon$ for all $N > N^*$, where $\gamma \in (0,1]$
1069: represents the inverse of the ``over--provisioning'' factor for the
1070: number of sensors needed to be deployed.
1071: \end{definition}
1072: 
1073: This relaxation does not significantly impact our results since we
1074: are interested in the asymptotic results (as $L$ and $N$ scale as in
1075: \eqnref{eqn:LNScaling}) where the $\gamma$ and $\epsilon$ parameters
1076: of \defref{def:NearUnifPlacement} can be made negligible. Our
1077: stochastic deployment scheme satisfies this asymptotic nearly
1078: uniform condition given in \defref{def:NearUnifPlacement}, and also
1079: almost surely satisfies the uniform deployment condition given by
1080: \defref{def:UnifPlacement} for $N \longrightarrow \infty$.
1081: 
1082: Consider the scenario where $N$ sensors are deployed iid and uniformly
1083: over $G = [0,1]^d$. The corresponding indices of the subcells (the
1084: total $LM$ subcells can be indexed by an integer from $1$ to $LM$)
1085: that the $N$ sensors fall is denoted by the random sequence
1086: $\mathbf{J} = (J_1, \ldots, J_N)$.
1087: %\[
1088: %\mathbf{J} = (J_1, \ldots, J_N),
1089: %\]
1090: Here, $J_i \sim \mbox{iid } U$, where $U \triangleq (1/(LM), \ldots, 1/(LM))$
1091: %where
1092: %\[
1093: %U \triangleq \left(\frac{1}{LM}, \ldots, \frac{1}{LM}\right),
1094: %\]
1095: %is
1096: is the uniform probability mass function over $LM$ discrete values. We
1097: examine the $N$--type (empirical distribution) $P_{\bf J}^{(N)}$ of
1098: $\mathbf{J}$ in order to examine the level of uniformity in the sensor
1099: deployment. An empirical distribution equal to $U$ corresponds to the
1100: uniform deployment condition of \defref{def:UnifPlacement} being
1101: met. Since the indices are also distributed iid according to $U$, by
1102: the strong law of large numbers, the empirical distribution converges
1103: almost surely to $U$ as $N \longrightarrow \infty$, and thus almost
1104: surely the sensors will be deployed uniformly over the subcells
1105: according to \defref{def:UnifPlacement} as $N \longrightarrow
1106: \infty$.
1107: 
1108: Also, a result from large deviations theory bounds the probability
1109: that the empirical distribution will not be in a
1110: $\delta$--neighborhood of the uniform distribution. This corresponds
1111: to the event where there exists a subcell that has more than
1112: $\frac{N(1+\delta)}{LM}$ or fewer than $\frac{N(1-\delta)}{LM}$
1113: sensors located within it.
1114: % Let $P_{\bf J}^{(N)}$ be the type of ${\bf J}$,
1115: Let $\mathscr{P}^N$ be the set of $N$--dimensional probability
1116: distributions, $U^\delta \triangleq [(1- \delta)/(LM), (1 +
1117: \delta)/(LM)]^N$
1118: %\[
1119: %U^\delta \triangleq \left[\frac{1- \delta}{LM}, \frac{1 +
1120: %\delta}{LM} \right]^N
1121: %\]
1122: be the $\delta$--neighborhood around the uniform probability mass
1123: function $U$, $D(\cdot \| \cdot)$ denote the Kullback--Leibler
1124: distance \cite{CoverJ-1991-EoIT}, and
1125: \[
1126: P^* = \arg \min_{P \in \mathscr{P}^N \setminus U^\delta} D(P \| U)
1127: \]
1128: denote the probability distribution not in $U^\delta$ that is closest
1129: to $U$ in Kullback--Leibler distance.  It should be noted that
1130: $D(P^*\|U) > 0$ for all $\delta > 0$.  Then by Sanov's theorem
1131: \cite[p.~292]{CoverJ-1991-EoIT},
1132: \begin{eqnarray}\label{eqn:sanovBound}
1133: \pP(P_J^N \in \mathscr{P}^N \setminus U^\delta) \leq
1134: (N+1)^{LM}2^{-ND(P^*\|U)} \nonumber \\
1135: = 2^{-N\left(D(P^*\|U)-\frac{LM}{N}\log(N+1)\right)}.
1136: \end{eqnarray}
1137: This inequality bounds the probability that not all subcells have at
1138: least $\frac{N(1-\delta)}{LM}$ sensors within them. This shows that as
1139: long as the number of sensors deployed $N$ grows faster than the
1140: actual number of sensors needed $LM$, then the near uniform deployment
1141: condition will be eventually met. Thus, this determines how many total
1142: sensors $N^* > LM$ need to be deployed in order to satisfy the
1143: asymptotic nearly uniform sampling condition of
1144: \defref{def:NearUnifPlacement} for a given desired $\epsilon$ and for
1145: $\gamma = 1-\delta$.
1146: 
1147: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1148: \Section{\label{sec:prevresults} Discussion of Related One--bit Estimation Problems and Extensions
1149: %Connections to Prior One--bit Estimation Approaches
1150: }
1151: 
1152: \begin{figure*}
1153: \centering
1154: \includegraphics[width=6.0in]{Figs/LuoCompare.eps}
1155: \caption{\label{fig:compareImp} {\bf The $Q_{it}^{Th}(\cdot)$ function
1156: in \eqnref{eqn:ourQFunc} and the $Q(\cdot)$ function of
1157: \cite{Luo-IT2005-UDEBCSN} suggest markedly different hardware
1158: implementations.} The former naturally suggests (a), where the binary
1159: quantized value is produced by a simple comparison to a random
1160: threshold $X$. The latter suggests (b), where an arbitrarily--precise
1161: ADC circuitry probabilistically selects an arbitrary bit of the
1162: observed value. Interestingly, these two implementations produce
1163: statistically equivalent quantized outputs $B$ given identical inputs
1164: $Y$.}
1165: \end{figure*}
1166: 
1167: This section discusses the connections between the methods and results
1168: in \cite{Masry-IT1981-RASFS}, \cite{Luo-IT2005-UDEBCSN}, and the
1169: present work. It is shown that the apparently different randomized
1170: $1$--bit field estimation schemes in these studies are in fact
1171: statistically and MSE performance equivalent. We also address how, in
1172: the scenario of known noise statistics, unbounded noise distributions
1173: and arbitrary threshold distributions can be accommodated. The general
1174: framework of the present work integrates the desirable field sensing
1175: and reconstruction properties and insights of the earlier studies and
1176: provides a unified view of the problem that simultaneously considers
1177: unreliable binary quantization, unknown arbitrary noise distributions,
1178: multiple snapshots of a temporally and spatially varying field, and
1179: communication rate issues. Since the work in both
1180: \cite{Masry-IT1981-RASFS} and \cite{Luo-IT2005-UDEBCSN} deal with the
1181: reconstruction of only a single snapshot ($T = 1$), we will drop the
1182: snapshot indices $t$ in our discussion to aid comparison.
1183: 
1184: \subsection{\label{sec:Masry}One--Bit Randomized--Dithering}
1185: 
1186: %\footnote{The quantization in \cite{Masry-IT1981-RASFS} actually
1187: %uses the sign of the sum of the ``threshold'' and the sample, which
1188: %is operationally equivalent to making a direct comparison to the
1189: %negation of the threshold.}
1190: 
1191: The problem setup of \cite{Masry-IT1981-RASFS} may be viewed as the
1192: reconstruction of a single snapshot ($T = 1$) of a bounded,
1193: one--dimensional field ($d = 1$) from noiseless samples ($Z_i = 0$) at
1194: uniformly spaced (deterministic) sampling locations ($x_i = i/N$).  In
1195: \cite{Masry-IT1981-RASFS} the noiseless observations are binary
1196: quantized using random thresholds $R_i$s that have a known general
1197: distribution which satisfies certain technical conditions described in
1198: \cite[Section~II.A]{Masry-IT1981-RASFS}. These technical conditions
1199: include the uniform distribution (considered in this paper) as a
1200: special case. An important conceptual difference is that in
1201: \cite{Masry-IT1981-RASFS} the sensor quantization noise is viewed as a
1202: randomized dither signal which is intentionally added to the
1203: observations and that the dither cdf is known (it need not be
1204: uniform). The reconstruction explicitly exploits the knowledge of the
1205: dither statistics. Specifically, the noiseless observation $Y_i$, at
1206: sensor $i$, and the corresponding quantized binary sample $B_i$ become
1207: \begin{eqnarray*}
1208: Y_i &=& s(x_i), \\
1209: B_i &=& Q(Y_i) \triangleq \mathrm{sgn}(Y_i + X_i),
1210: \end{eqnarray*}
1211: where $X_i$ is iid dithering noise with a known distribution
1212: $p_X(\cdot)$ which satisfies certain technical assumptions as given
1213: in \cite[Section~II.A]{Masry-IT1981-RASFS}. Note that taking the
1214: sign of the sum of the observation and random dither $X_i$ is
1215: equivalent to comparing with the threshold $-X_i$. Thus the
1216: quantization function $Q(\cdot)$ of \cite{Masry-IT1981-RASFS} is
1217: equivalent\footnote{The sign function maps to $\{-1,+1\}$ whereas a
1218: threshold comparator maps to \{0,1\}. However, the replacement of
1219: the $-1$ symbol with the 0 symbol is unimportant from an estimation
1220: viewpoint.} to a comparator with a random threshold that is
1221: distributed according to $p_X(-x)$. The quantization function
1222: $Q_{it}^{Th}(\cdot)$ in \eqnref{eqn:ourQFunc} can be viewed as a
1223: special case of this where $p_X(-x)$ is the uniform distribution
1224: over $[-c,c]$.  The constructive scheme of \secref{sec:ourscheme}
1225: and the analysis of this work shows that $Q_{it}^{Th}(\cdot)$ can in
1226: fact be used even on noisy field observations with an additive noise
1227: of {\em unknown} distribution.
1228: 
1229: \subsection{\label{sec:LuoScheme}Parameter Estimation with One--Bit Messages}
1230: 
1231: The parameter estimation problem in \cite{Luo-IT2005-UDEBCSN}
1232: corresponds to the special case of a spatially constant field
1233: ($s(x_i) = s$ for all $i$ where the index $t$ is omitted since
1234: $T=1$) which is addressed by Corollary~\ref{cor:constFieldCase}. We
1235: summarize below the key features of the randomized binary quantizer
1236: proposed in \cite{Luo-IT2005-UDEBCSN} and show that the randomized
1237: $1$--bit SQ function $Q(\cdot)$ of \cite{Luo-IT2005-UDEBCSN} is
1238: statistically and MSE performance--wise equivalent to the uniform
1239: random threshold quantizer $Q_{it}^{Th}(\cdot)$ in
1240: \eqnref{eqn:ourQFunc}. However, the $Q(\cdot)$ function of
1241: \cite{Luo-IT2005-UDEBCSN} implicitly requires sensors of arbitrarily
1242: high precision, a property that is undesirable for sensor hardware
1243: implementations.
1244: 
1245: In \cite{Luo-IT2005-UDEBCSN}, each sensor $i$ first shifts and
1246: scales it observation $Y_i$ into interval $[0,1]$ creating the value
1247: $\widetilde{Y}_i \triangleq \left(\frac{Y_i + c}{2c}\right)$. Next,
1248: each sensor $i$ generates an auxiliary random variable $\alpha_i$,
1249: which is iid across sensors and is geometrically distributed over
1250: the set of all positive integers: $\pP(\alpha_i = j) = 2^{-j}$ for
1251: all $j \in \{1,2,3,\ldots,\infty\}$. The final quantized binary
1252: sample $B_i$ reported by sensor $i$ is given by the
1253: $\alpha_i^\text{th}$ bit in the binary expansion of
1254: $\widetilde{Y}_i$:
1255: \begin{eqnarray}
1256: B_i &=& Q(Y_i) \triangleq B(\widetilde{Y}_i,\alpha_i), \nonumber \\
1257: &&\mbox{where } \widetilde{Y}_i = \sum_{j=1}^{\infty}
1258: B(\widetilde{Y}_i,j) 2^{-j}. \label{eqn:LuoQFunc}
1259: \end{eqnarray}
1260: Here, $B(\widetilde{Y}_i,j)$ denotes the $j^\text{th}$ bit of
1261: $\widetilde{Y}_i$. For example, if $\widetilde{Y}_i = 0.375$, then
1262: the first four bits of its binary expansion are given by
1263: $B(\widetilde{Y}_i,1) = 0$, $B(\widetilde{Y}_i,2) = 1$,
1264: $B(\widetilde{Y}_i,3) = 1$, and $B(\widetilde{Y}_i,4) = 0$. If
1265: $\alpha_i = 3$, then sensor $i$ reports $B_i = 1$.  This method for
1266: generating binary sensor messages requires sensors to have the
1267: operational ability to quantize an observed real number (the
1268: normalized values $\widetilde{Y}_{i}$) to an arbitrarily high
1269: bit--resolution. Note that the binary values $B_i$ are iid across
1270: all sensors and that its expected value is given by
1271: \begin{eqnarray} \eE[B_i] &=&
1272: \eE_{\widetilde{Y}_i}[\eE_{B_i}[B_i|\widetilde{Y}_i]] \nonumber \\
1273: &=& \eE_{\widetilde{Y}_i}\left[\sum_{j=1}^{\infty} B(\widetilde{Y}_i,j) 2^{-j}\right] \nonumber \\
1274: &=& \eE_{\widetilde{Y}_i}[\widetilde{Y}_i] \nonumber \\
1275: &=& \eE\left[\frac{Y_i + c}{2c}\right] \label{eqn:LuoMsgCondExp} \\
1276: &=& \frac{\eE[s + Z_i] + c}{2c} = \left(\frac{s + c}{2c}\right).
1277: \label{eqn:LuoMsgExp}
1278: \end{eqnarray}
1279: 
1280: In sharp contrast to the $Q(\cdot)$ function described above, which
1281: requires sensors to have the operational ability to resolve any
1282: arbitrary bit in the binary expansion of their normalized
1283: observations, $Q_{it}^{Th}(\cdot)$ requires only a noisy comparator.
1284: Despite the markedly different operational implementations of
1285: $Q(\cdot)$ and $Q_{it}^{Th}(\cdot)$ (see \eqnref{eqn:LuoQFunc},
1286: \eqnref{eqn:ourQFunc}, and Fig.~\ref{fig:compareImp} which depicts
1287: hardware implementations) they are in fact statistically identical:
1288: the binary quantized values $B_i$ generated by the two schemes have
1289: the same $p_{B_{it}|Y_{it}}(\cdot)$ and $p_{B|s}(\cdot)$ functions
1290: where $p_{B_{it}|Y_{it}}(\cdot)$ is the conditional expectation of
1291: $B_i$ given $Y_i = y_i$ and $p_{B|s}(\cdot)$ is the unconditional
1292: expectation of $B_i$ parameterized by the underlying field value
1293: $s(x_i) = s$. These expectations have been evaluated in
1294: \eqnref{eqn:LuoMsgCondExp}, \eqnref{eqn:LuoMsgExp},
1295: \eqnref{eqn:condExpBinMsg} and \eqnref{eqn:expBinMsg}, and we see
1296: that for both functions
1297: \begin{eqnarray*}
1298: \eE[B_i | Y_i = y_i] = p_{B_{it}|Y_{it}}(y_i) = \left(\frac{y_i +
1299: c}{2c}\right), \text{ and} \\ \eE[B_i] = p_{B|s}(s(x_i)) =
1300: \left(\frac{s(x_i) + c}{2c}\right).
1301: \end{eqnarray*}
1302: This statistical equivalence allows the two quantization functions
1303: $Q(\cdot)$ and $Q_{it}^{Th}(\cdot)$ to be interchanged without
1304: affecting the estimation performance.
1305: 
1306: 
1307: \subsection{Extensions to Unbounded Noise and Arbitrary Thresholds with Known Distributions} In this work, we have made assumptions of
1308: zero--mean, amplitude--bounded, additive noise, which can have an
1309: arbitrary, unknown distribution, and uniformly distributed
1310: quantization thresholds. The results of this work can be extended to
1311: deal with noise that is not amplitude--bounded (i.e.  Gaussian,
1312: Laplacian, etc.)  and for thresholds with arbitrary distributions,
1313: however certain technical conditions must be met and the distributions
1314: for both the noise and the threshold must be known.
1315: %Requiring knowledge of the noise distribution is a sacrifice that must
1316: %be made in order for this extension to work, since in the prequel
1317: %amplitude--boundedness of the noise and uniformly distributed
1318: %thresholds were necessary in order to deal with unknown noise
1319: %distributions.
1320: A possible approach is to combine the noise and threshold random
1321: variables into an overall random dither variable $X_{it} \triangleq
1322: Z_{it} + R_{it}$ and applying the method and results used in
1323: \cite{Masry-IT1981-RASFS} (see \secref{sec:Masry}). The main MSE
1324: result of \thmref{thm:MSEperf} will still hold, however with new
1325: constants multiplying each term in the bound. The method is
1326: essentially the same as in \secref{sec:ourscheme}, however the value
1327: of the field estimate at every point is passed through a non--linear
1328: function, instead of a simple scaling and shifting, given by
1329: \begin{eqnarray*}
1330: g(s) =
1331: \begin{cases}
1332: \mu^{-1}(s) & |s| \leq \mu(a') \\
1333: 0 & \mbox{otherwise}
1334: \end{cases}
1335: \end{eqnarray*}
1336: with $\mu(s) \triangleq 1 - 2P_X(-s)$ where $P_X(\cdot)$ is the cdf of
1337: the dither random variable $X_{it}$. The technical requirement for
1338: this extension is that $P_X(\cdot)$ is absolutely continuous with a
1339: probability density function $p_X(\cdot)$ on $(-\infty,\infty)$ which
1340: is continuous and positive over an open interval $(-a',a')$ containing
1341: $[-a,a]$. This ensures that $P_X(\cdot)$ is strictly monotonically
1342: increasing over the signal dynamic range and that $\mu^{-1}(\cdot)$
1343: exists (see \cite{Masry-IT1981-RASFS}).
1344: 
1345: If the sensor observation noise is unbounded (but the field $s_t(x)$
1346: is still bounded), has zero--mean, and has an {\em unknown} probability
1347: density function (pdf) whose tails decay to zero, it is still possible
1348: to make a weak statement about the achievable MSE as the number of
1349: sensors go to infinity. With unbounded noise, the sensor observations
1350: may exceed any finite dynamic range $[-c,c]$ of the one--bit
1351: sensors. This leads to the appearance of additional error bias terms
1352: (see equation (\ref{eqn:expBinMsg}) and (\ref{eqn:biasResult}) in
1353: Appendix~\ref{app:MSEperfProof}) which depend on the unknown signal
1354: value $s_t(x)$ to be estimated and the dynamic range limit $c$. It can
1355: be shown that these terms go to zero as $c \rightarrow \infty$. Thus
1356: one can assert that for a sufficiently large dynamic range limit $c$
1357: and a corresponding sufficiently large number of sensors $N(c)$, the
1358: MSE can be made smaller than some desired tolerance. The actual
1359: scaling behavior will now also depend on the tail decay rate of the
1360: unknown pdf of the observation noise.
1361: 
1362: 
1363: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1364: \Section{\label{sec:conclusions}Concluding Remarks}
1365: The results of this work show that for the distributed field
1366: reconstruction problem, for every point of continuity of every field
1367: snapshot, it is possible to drive the MSE to zero with increasing
1368: sensor density while ensuring that the per--sensor bitrate and
1369: sensing--related network overhead rate simultaneously go to zero.
1370: This can be achieved with noisy threshold (one--bit) comparators with
1371: the minimal knowledge of signal and noise dynamic ranges provided that
1372: the noise samples are zero--mean, and independent across sensors and
1373: the underlying field, and the sensor placement and sampling schedule
1374: satisfy a certain uniformity property. The rate of decay of MSE with
1375: increasing sensor density is related to the the local and global
1376: smoothness characteristics of the underlying fields and is
1377: order--optimal for the class of spatially constant fields. This work
1378: has further clarified the utility of randomization for signal
1379: acquisition to combat limited sensing precision and unknown noise
1380: statistics in a distributed sensor network context. This work has also
1381: attempted to systematically account for sensor placement and hardware
1382: issues in addition to the typical constraints encountered in related
1383: studies.
1384: \begin{comment}
1385: %Cram\'{e}r--Rao lower--bounds for general spatially--varying fields in
1386: %terms of local and global field smoothness characteristics as well as
1387: %randomized sensor deployment strategies with probabilistic guarantees
1388: %for near--uniform sensor deployment together with an analysis of
1389: %associated sensor over--provisioning costs are fruitful research
1390: %directions.
1391: \end{comment}
1392: 
1393: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1394: \section*{Acknowledgment} The authors would like to thank Nan Ma and
1395: Manqi Zhao from the ECE department of Boston University for helpful
1396: comments and suggestions during different stages of this work. This
1397: material is based upon work supported by the US National Science
1398: Foundation (NSF) under award (CAREER) CCF--0546598, (CAREER)
1399: ECS--0449194, CCF--0430983, and CNS--0435353, and ONR (PECASE) grant
1400: no.~N00014-02-100362. Any opinions, findings, and conclusions or
1401: recommendations expressed in this material are those of the authors
1402: and do not necessarily reflect the views of the NSF and ONR.
1403: 
1404: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1405: %\useRomanappendicesfalse
1406: \appendices
1407: \renewcommand{\theequation}{\thesection.\arabic{equation}}
1408: \setcounter{equation}{0}
1409: 
1410: \renewcommand{\thedefinition}{\Alph{section}.\arabic{definition}}
1411: \renewcommand{\theremark}{\Alph{section}.\arabic{remark}}
1412: \renewcommand{\thetheorem}{\Alph{section}.\arabic{theorem}}
1413: 
1414: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1415: \Section{\label{app:MSEperfProof}Proof of Theorem~\ref{thm:MSEperf}}
1416: 
1417: First, note that the expected value of the binary message $B_{it}$
1418: is given by
1419: \begin{eqnarray}
1420: \eE[B_{it}] &=& \eE[\mathbf{1}(Y_{it} > R_{it})] \nonumber \\
1421: &=& \eE_{Y_{it}}[\eE_{R_{it}}[\mathbf{1}(Y_{it} > R_{it}) | Y_{it} ]] \nonumber \\
1422: &=& \eE_{Y_{it}}[\pP\left(R_{it} < Y_{it} | Y_{it}\right)] \nonumber \\
1423: &\stackrel{(i)}{=}& \eE_{Y_{it}}\left[\frac{Y_{it} + c}{2c}\right] \label{eqn:condExpBinMsg} \\
1424: &=& \frac{\eE[s_t(x_i) + Z_{it}] + c}{2c} \nonumber \\
1425: &\stackrel{(ii)}{=}& \left(\frac{s_t(x_i) + c}{2c}\right) \label{eqn:expBinMsg},
1426: \end{eqnarray}
1427: which is the value of the field $s_t(\cdot)$ at location $x_i$ shifted
1428: and normalized to the interval $[0,1]$. Note that the key steps are
1429: step $(i)$ where we used the fact that $R_{it}$ is uniformly
1430: distributed over $[-c,c]$ and step $(ii)$ where we used the fact that
1431: $Z_{it}$ has zero mean. It should be noted that the final result
1432: (\ref{eqn:expBinMsg}) holds for any $F_\mathbf{Z}(\mathbf{z}) \in
1433: \mathcal{F}$.
1434: 
1435: Using \eqnref{eqn:expBinMsg} we can bound the bias and the variance
1436: of the estimator $\widehat{S}_t(x)$. The bound on the MSE follows
1437: from the bounds on these values since, for any estimator of a
1438: non--random parameter, we have
1439: \begin{equation}\label{eqn:MSEidentity}
1440: \mathrm{MSE}\left(\widehat{S}_t(x)\right) =
1441: \mathrm{bias}^2\left(\widehat{S}_t(x)\right) +
1442: \mathrm{var}\left(\widehat{S}_t(x)\right).
1443: \end{equation}
1444: 
1445: Let $j \in \{1, \ldots, L\}$ denote the index of the supercell that
1446: $x$ falls in. We bound the magnitude of bias of the estimate
1447: $\widehat{S}_t(x)$ in the following way
1448: \begin{eqnarray}
1449: \left|\mathrm{bias}\left(\widehat{S}_t(x)\right)\right| &=&
1450: \left|\eE\left[\widehat{S}_t(x) - s_t(x)\right]\right| \nonumber \\
1451: &=& \Bigg| \eE \Bigg[2c \Bigg[\frac{1}{n} \sum_{i \in I(j,t)} B_{it} \Bigg] \nonumber \\
1452: && - c - s_t(x) \bigg] \bigg| \nonumber \\
1453: &=& \Bigg| 2c \Bigg[\frac{1}{n} \sum_{i \in I(j,t)} \eE\left[B_{it}\right] \Bigg] \nonumber \\
1454: &&- c - s_t(x) \bigg| \nonumber \\
1455: &\stackrel{(i)}{=}& \Bigg| 2c \Bigg[\frac{1}{n} \sum_{i \in I(j,t)}
1456: \Bigg(\frac{s_t(x_i) + c}{2c}\Bigg) \Bigg] \nonumber \\
1457: &&- c - s_t(x) \bigg| \nonumber \\
1458: &=& \Bigg| \frac{1}{n} \sum_{i \in I(j,t)} \left(s_t(x_i) -
1459: s_t(x)\right) \Bigg| \nonumber \\
1460: &\leq& \frac{1}{n} \sum_{i \in
1461: I(j,t)} \left|s_t(x_i) -
1462: s_t(x)\right| \nonumber \\
1463: &\stackrel{(ii)}{\leq}& \frac{1}{n} \sum_{i \in I(j,t)}
1464: \omega_t \left(\|x - x_i\|,x\right) \nonumber \\
1465: % The following equation label has not been referenced anywhere!
1466: % \label{eqn:biasOmegaTerm}
1467: &\stackrel{(iii)}{\leq}& \frac{1}{n} \sum_{i \in I(j,t)}
1468: \omega_t \left(\frac{\sqrt{d}}{\sqrt[d]{L}},x\right) \nonumber \\
1469: &=& \omega_t
1470: \left(\frac{\sqrt{d}}{\sqrt[d]{L}},x\right) \nonumber \\
1471: &\stackrel{(iv)}{\leq}& \widetilde{\omega}_t
1472: \left(\frac{\sqrt{d}}{\sqrt[d]{L}}\right), \label{eqn:biasResult}
1473: \end{eqnarray}
1474: where $(i)$ follows from (\ref{eqn:expBinMsg}), $(ii)$ and $(iv)$
1475: follow from Definitions~\ref{def:localMod} and~\ref{def:globalMod},
1476: and $(iii)$ follows because the local modulus of continuity is a
1477: nondecreasing function of its first argument for each fixed value of
1478: its second argument and since any sensor in the supercell containing
1479: $x$ is within distance $\frac{\sqrt{d}}{\sqrt[d]{L}}$ of $x$ (the
1480: length of the diagonal of a supercell).
1481: 
1482: The variance of the estimate is bounded by
1483: \begin{eqnarray}
1484: \mathrm{var}[\widehat{S}_t(x)] &=& \mathrm{var}\left[2c
1485: \left[\frac{1}{n} \sum_{i \in I(j,t)} B_{it} \right] - c\right] \nonumber \\
1486: &=& \left(\frac{4c^2}{n^{2}}\right) \sum_{i \in I(j,t)}
1487: \mathrm{var}[B_{it}] \label{eqn:varBoundLine2} \\
1488: &\leq& \left(\frac{4c^2}{n^{2}}\right)\left(\frac{n}{4}\right) =
1489: \left(\frac{LMc^2}{N}\right), \label{eqn:varBoundLine3}
1490: \end{eqnarray}
1491: where we used standard properties of variance and the fact that
1492: $\{B_{it}\}$ are independent to obtain \eqnref{eqn:varBoundLine2},
1493: and we used the fact the variance of a Bernoulli$\{0,1\}$ random
1494: variable is bounded by $(1/4)$ and that $n = (N/(LM))$ to obtain
1495: \eqnref{eqn:varBoundLine3}.
1496: 
1497: Combining these bounds for the bias and variance given in
1498: \eqnref{eqn:biasResult} and \eqnref{eqn:varBoundLine3} of the
1499: estimator and using the identity in \eqnref{eqn:MSEidentity}, we get
1500: the desired bound on the MSE for all $x \in G$, $t \in \{1, \ldots,
1501: T\}$, and $F_\mathbf{Z}(\mathbf{z}) \in \mathcal{F}$. \proofover
1502: 
1503: \Section{\label{app:ASConvProof}Proof of Theorem~\ref{thm:ASConv}}
1504: 
1505: First, we note that
1506: \begin{equation}\label{eqn:ASequiv}
1507: \widehat{S}_t(x) \xrightarrow{\mathrm{a.s.}} s_t(x) \equiv
1508: \left|\widehat{S}_t(x) - s_t(x)\right| \xrightarrow{\mathrm{a.s.}}
1509: 0.
1510: \end{equation}
1511: Thus, we proceed with the triangle equality to bound
1512: \begin{eqnarray}\label{eqn:TriangleBoundForAS}
1513: \left|\widehat{S}_t(x) - s_t(x)\right| &\leq& \left|\widehat{S}_t(x)
1514: - \eE\left[\widehat{S}_t(x)\right]\right| \nonumber \\
1515: &+& \left|\eE\left[\widehat{S}_t(x)\right] - s_t(x)\right|.
1516: \end{eqnarray}
1517: In the proof of \thmref{app:MSEperfProof} given in
1518: \appref{app:MSEperfProof} we have shown that the second term of
1519: \eqnref{eqn:TriangleBoundForAS}, which is the absolute value of the
1520: estimator bias, is bounded by \eqnref{eqn:biasResult} which shows
1521: that
1522: \begin{equation}\label{eqn:ASConvSecondTerm}
1523: \left|\eE\left[\widehat{S}_t(x)\right] - s_t(x)\right|
1524: \longrightarrow 0
1525: \end{equation}
1526: as $N$ and $L$ scale as in \eqnref{eqn:LNScaling}.
1527: 
1528: Letting $j$ denote the supercell that $x$ falls in, the first term
1529: of \eqnref{eqn:TriangleBoundForAS} can be rewritten as
1530: \[
1531: \left|\widehat{S}_t(x) - \eE\left[\widehat{S}_t(x)\right]\right| =
1532: \left| 2c \left[\frac{1}{n} \sum_{i \in I(j,t)} B_{it} - \eE[B_{it}]
1533: \right] \right|.
1534: \]
1535: Recall that the cardinality of $I(j,t)$ is $n = (N/(LM))$. Since the
1536: $B_{it}$ random variables are independent across sensors and their
1537: fourth central moments are uniformly bounded by 1 (since they are
1538: binary~$\{0,1\}$ random variables), the strong law of large numbers
1539: \cite[pp.~206--207]{Durret-TLC-PTE} can be applied to obtain
1540: \[
1541: \frac{1}{n} \sum_{i \in I(j,t)} B_{it} - \eE[B_{it}]
1542: \xrightarrow{\mathrm{a.s.}} 0,
1543: \]
1544: as $N \longrightarrow \infty$ (since $n = (N/(LM))$) and thus the
1545: first term of \eqnref{eqn:TriangleBoundForAS}
1546: \begin{equation}\label{eqn:ASConvFirstTerm}
1547: \left|\widehat{S}_t(x) - \eE\left[\widehat{S}_t(x)\right]\right|
1548: \xrightarrow{\mathrm{a.s.}} 0.
1549: \end{equation}
1550: Combining \eqnref{eqn:ASConvSecondTerm} and
1551: \eqnref{eqn:ASConvFirstTerm} into \eqnref{eqn:ASequiv} and
1552: \eqnref{eqn:TriangleBoundForAS} finishes the proof. \proofover
1553: 
1554: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1555: 
1556: \bibliography{../LaTeX/biblio}
1557: 
1558: % Use to balance out final columns
1559: %\addtolength{\textheight}{-3cm}
1560: 
1561: \end{document}
1562: