hep-ex0312022/art.tex
1: \documentstyle[psfig,12pt]{aipproc}
2: \pagestyle{plain}
3: \setlength{\textheight}{9.7in}
4: 
5: \tolerance = 10000
6: 
7: \newcommand{\grad}{$^{\circ}$}
8: \newcommand{\vsp}{\vspace{0.125in}}
9: \newcommand{\hsp}{\hspace{0.15in}}
10: \newcommand{\gcm}{~g/cm$^3$~}
11: \newcommand{\gcms}{~g/cm$^2$~}
12: 
13: \newcommand{\muea}{\mbox{\sl $\mu^{-} A \rightarrow e^{-}$ A }}
14: \newcommand{\apr}{$\simeq$~}
15: 
16: %\pagenumbering{arabic}
17: 
18: \begin{document}
19: 
20: \title{ KALMAN FILTER BASED TRACKER STUDY FOR $\mu e$ - CONVERSION EXPERIMENT }
21: 
22: 
23: \author{Rashid M. Djilkibaev \thanks{Permanent address: Institute for Nuclear
24:  Research, 60-th Oct. pr. 7a, Moscow 117312, Russia}$^{,2}$, Rostislav V. Konoplich$^{2,3}$}
25:  \address{$^{2}$Department of Physics,
26:  New York University,
27:  New York, NY 10003 \\
28:  $^{3}$Manhattan College, Riverdale, New York, NY, 10471}
29: \maketitle
30: 
31: 
32: \begin{abstract}
33: The search for muon to electron conversion 
34: with a sensitivity of the order $10^{-17}$ requires a several order of magnitude
35: increase in muon intensity and a high resolution,
36: $\sigma \simeq$  0.1 MeV/c of the electron's momentum.
37: 
38: We present results of a pattern recognition  and track momentum
39: reconstruction algorithm that relies on a Kalman filter approach. 
40: Background from captured protons, neutrons, 
41: photons and from muon decay in orbit were generated by GEANT. 
42: The effective average 
43: straw tube background rate was 800 kHz.
44: 
45: The pattern recognition proceeds in two stages.
46: In the first, simple considerations using only  
47: straw tube center coordinates, without drift time information, were
48: applied to reduce the background to a manageable level. 
49: Then the drift time information is incorporated and a Deterministic Annealing Filter
50: applied to reach the final level of background suppression and 
51: to provide a starting 
52: point for the track momentum reconstruction using the Kalman filter.
53: This procedure reduces the simulated background by a factor 800
54: with small,(2.7\%), losses in real tracker hits.
55:  
56: The momentum resolution of the tracker is $\sigma $ = 0.12 MeV/c 
57: and the acceptance for muon conversion events with momentum
58: above 103.6 MeV/c is 22\%. These numbers do not differ significantly 
59: from the values obtained without background.
60: 
61: The expected number of events from muon decay in orbit (main background) 
62: in which the
63: decay electron has momentum greater than 100 MeV/c is 0.3, compared
64: to 6.5 $\mu e$ - conversion events above the same threshold 
65: for $R_{\mu e} = 10^{-16}$.
66: 
67: \end{abstract}
68: 
69: 
70: \section{Introduction}
71: 
72: The observation of $\mu e$ - conversion would provide the first
73: direct evidence for lepton flavor violation in charged lepton sector 
74: and require new physics,
75: beyond the Standard Model (see ~\cite{kuno} and references therein). 
76: Lepton flavor is not conserved in neutrino oscillations but the
77: modifications to the Standard Model to include the small neutrino
78: masses do not lead to an appreciable rate for $\mu e$ - conversion 
79: 
80: 
81: \begin{center}
82: $\mu^{-} + N \to e^{-} + N$ .
83: \end{center}
84:  
85: This process violates the lepton flavor numbers, $L_{e}$ and $L_{\mu}$,
86: but conserves the total lepton number. The signature of the process is very
87: clear: a single monochromatic electron in the final state with the energy 
88: close to the muon mass:
89: 
90: \begin{center}
91: $E_{e} = m_{\mu} - B_{\mu} - E_{rec}$ 
92: \end{center}
93: where $m_{\mu}$ is a muon mass, $ B_{\mu}$ is a binding energy of the
94: 1s muonic atom, $E_{rec}$ is a nuclear recoil energy. 
95: 
96: The SINDRUM II collaboration at PSI has carried out a program of experiments to 
97: search for $\mu e$ - conversion in various nuclei. They find that at 
98: 90\% CL the upper limit for the reaction $\mu^{-} + Ti \to e^{-} + Ti$
99: is $6.1\times 10^{-13}$ ~\cite{psi}. In this experiment muons were accumulated
100: at a rate of $10^{7} \mu^{-}/sec$. According to preliminary results 
101: ~\cite{psigold} for the reaction $\mu^{-} + Au \to e^{-} + Au$
102: a single event sensitivity is $3.3\times 10^{-13}$
103: 
104: In ~\cite{rashid} an idea of increasing of muon beam intensity
105: by a few orders of magnitude up to $10^{11} \mu^{-}/sec$ based on the 
106: solenoid-capture scheme was discussed and MELC experiment was proposed
107: ~\cite{abadj} with a goal to reach a sensitivity of the order $10^{-17}$.
108:  
109: A new $\mu e$ - conversion experiment MECO (Muon Electron COnversion) E-940
110: ~\cite{bnl}, exploiting the idea of the solenoid-capture scheme,
111: is under preparation at BNL. MECO aims to search for 
112: $\mu^{-} + Al \to e^{-} + Al$ with a single event sensitivity 2$\times$$10^{-17}$. It will
113: use a new high-intensity pulsed muon beam, which could yield about
114: $10^{11} \mu^{-}/sec$ stopped in a target. 
115: 
116: Also the PRIME (PRISM MuE conversion) working group at KEK expressed an
117: interest ~\cite{prime} to carry out a search for lepton flavor violation
118: in $\mu e$ - conversion in a muonic atom at a sensitivity of 
119: $10^{-18}$ using a proposed high intensity pure muon source of
120: $10^{11} - 10^{12} \mu^{-}/sec$ .  
121: 
122: In this article we develop a pattern recognition and  track reconstruction
123: procedure 
124: based on the Kalman filter technique for a transverse version of tracker for MECO
125: experiment. 
126: 
127: This paper is organized as follows: In section 2 the 
128: transverse tracker is described and the advantages of the tracker discussed.
129: In Section 3 possible backgrounds are discussed and  the 
130: procedure of background simulation briefly explained. 
131: In Section 4 the pattern recognition
132: procedure is developed. 
133: At this stage a deterministic annealing 
134: filter (DAF) is applied to make a final background 
135: suppression and provide a starting 
136: point for track momentum reconstruction by the Kalman 
137: filter technique. Section 5 describes the procedure of momentum 
138: reconstruction based on the Kalman filter. Results of the pattern 
139: recognition and momentum reconstruction are presented in this 
140: section. A brief summary outlook can be found in 
141: Section 6. The appendices provide a more detailed look at Kalman filter and deterministic
142: annealing filter. Also tracker resolution is discussed in Appendix.  
143: 
144: \section{The Transverse Tracker Description}
145: 
146: The goal of the MECO tracker is to detect the electron from $\mu e$ - conversion
147: with large acceptance and measure its momentum 
148: with high resolution ($\sigma  \simeq$  0.1 MeV/c).
149: The tracker is located in a uniform 1T magnetic field. 
150: The minimal tracker length is defined by the following
151: requirements:
152: 
153: Background electrons with an energy around 105 MeV  are produced
154: by cosmic rays  in the wall between the transport and detector
155: solenoids. These electrons cannot have a pitch angle $\theta$ in
156: the tracker greater than $45^{o}$, due to the adiabatic character
157: of charge particle movement. To suppress the  cosmic ray
158: background we require that measured pitch angle $\theta_{min}$ for
159: signal events to be more than $45^{o}$.
160: 
161: To get more redundancy for signal events we require that the measured
162: trajectory should have two full turns. This requirement sets
163: limits on the minimal tracker length, which is
164: expressed as:
165: 
166: \begin{center}
167: $ L_{Tracker}^{min} = 4 \cdot \pi \cdot 1/2.998 \cdot P \cdot cos(\theta_{min})/B \simeq
168:   4.19 \cdot 105 \cdot 0.707$/1.0 $\simeq$ 310 cm
169: \end{center}
170: where P is a muon conversion momentum (MeV/c) and B is a magnetic
171: field (1 Tesla) in the tracker region.
172: 
173: The tracker consists of 18 modules spaced  17 cm apart. A module
174: consists of 6 planes, each turned at 30$^{o}$ and shifted 2.5 cm
175: relative to the previous one (Figure ~\ref{fig:setup}). A plane
176: consists of two trapezoidal chambers of  width 30 cm and lengths
177: 70 to 130 cm in an up and down configuration. 
178: The chamber coordinate systems are defined by a
179: rotation  angle proportional to 30$^{o}$ giving an effective
180: ``stereo'' of crossed directions for 12 different views.
181: 
182: A protective sheath is used to suppress background Compton
183: electrons from a chamber frame. 
184: The protective sheath makes nonsensitive a small
185: region of the anode wire near the frame.
186: 
187: The chamber consists of one layer of straw tubes (60 straws) of 5
188: mm diameter, and length varying from 70 cm to 130 cm. The total
189: number of chambers is 216. The chamber sensitive area starts from
190: 38 cm radius. The straws are assumed to have wall 
191: thickness 15  $\mu$m and are
192: constructed of kapton. The total thickness of each chamber is 9
193: mg/$cm^2$. The total number of the tracker straws is 12960. The
194: tracker length is 302 cm. A signal from the straw anode wire 
195: will be used to get drift time.
196: 
197: The Al target is tapered in the downstream direction, with 5 cm
198: disk spacing and radii from 8.3 cm to 6.53 cm. The target is
199: placed in the graded portion of the DS magnetic field, with the
200: first disk at 1.75 T and the last at 1.3 T. Protons from the muon
201: capture are absorbed in three concentric polyethylene absorbers: a
202: conical tube of dimensions $R_{1}$ = 46 cm, $R_{2}$ = 70 cm, L =
203: 260 cm and a tube (R = 70 cm, L = 200 cm) both have thickness 3
204: mm, and a tube of smaller radius (R = 36 cm, L = 235 cm) of
205: thickness 0.5 mm which are placed  before the tracker.
206: 
207: \begin{figure}[htb!]
208: \centerline{\hbox{\psfig{figure=tracker.eps,height=3.5in,clip=on}}}
209:   \caption{
210:   Schematic drawing of the tracker design.
211:  }
212: \label{fig:setup}
213: \end{figure}
214: 
215: \bigskip
216: The transverse tracker is to be compared with a
217: longitudinal version that MECO is also considering. 
218: The longitudinal tracker ~\cite{bnl} consists 
219: of an octagonal array of eight detector planes placed symmetrically 
220: around the Detector Solenoid axis, plus eight more planes projecting 
221: radially outward from each vertex of the octagon. Each plane consists of
222: resistive straws approximately 300 cm long. A hit position 
223: in the radial and azimuthal direction is determined by the straw
224: position and the drift time on the anode wire. The hit position in 
225: the axial direction is determined by the centroid of the imaged charge 
226: from the anode wire, as collected on cathode pads.    
227: 
228: The transverse tracker would have several advantages in comparison with a
229: longitudinal one since: 
230: 
231: \noindent
232: $\bullet$ Normal non resistive straws are used without a pad system. \\
233: $\bullet$ Gas manifolds, straw end-caps and chamber supports are
234: all outside the conversion electron trajectories. \\
235: $\bullet$ Shorter straws (0.7 - 1.3 m) are technically easier to build
236: and are more robust against instability than longitudinal straws of 3 m.\\
237: $\bullet$ Complications from the small tilt of each plane of the longitudinal
238: tracker with respect to the magnetic field are avoided.\\
239: $\bullet$ Transverse geometry provides a simple signature of an event since
240: charged particles cross a single straw only once.\\
241: $\bullet$ There  is a significant simplification dealing 
242: only with single chamber hits points.\\
243: $\bullet$ The average number of spatially separated hits is a few times greater in  
244: comparison with the number of  spatially separated clusters in the longitudinal 
245: tracker. The hits are distributed uniformly in
246: lobes along the transverse tracker.\\
247: $\bullet$ The transverse tracker presents less material in the case of
248: 15 $\mu$m straw wall thickness. The effective total thickness is
249: 29*(15+2)*3.14 = 1550 $\mu$m versus longitudinal tracker 8*3.5*25*3.14 = 2200 $\mu$m.\\
250: $\bullet$ The pattern recognition  can be performed with good precision
251: even without drift time and amplitude information.\\
252: $\bullet$ The Cu layer covering straw tubes suppresses significantly gas diffusion
253: through a straw wall.\\
254: $\bullet$ Capacitive crosstalk  between channels is small.\\
255: $\bullet$ Low energy Compton electron backgrounds have distinguished signature.\\
256: 
257: 
258: \section{Background simulation}  
259: 
260: Our analysis is based on a full GEANT simulation taking into account an
261: individual straw structure.
262: Multiple scattering and energy loss are taken into account. 
263: Isobutane ($C_{4}H_{10}$) gas is assumed to fill the tubes. 
264: 
265: For the subsequent analysis we take only events satisfying the
266: following criteria: number of hits in the tracker is greater than 15;
267: MC simulated energy release in the calorimeter is greater than 80 MeV;
268: pitch angle is greater than 45 $^{0}$. That leaves about $35\%$ of
269: the original events.
270: 
271: The primary sources of charged particles in the tracker detector
272: during the detection time are protons, neutrons and photons from
273: muon capture by $^{27}Al$ nuclei and electrons from muon decay in
274: orbit.
275: The average and peak tracker rates from
276: different backgrounds are presented in Table ~~\ref{table:tab1}.
277: The main source of background to muon conversion is  muon decay in orbit (DIO).
278: 
279: \begin{center}
280: \begin{table}[htb!]
281: \caption { The Average and Peak (in parenthesis) Tracker  Rates }
282: \begin{tabular}{|l|c|c|c|c|c|}
283: \hline
284: Processes&proton&neutron &$\gamma$ &DIO & DIO\\
285:  & & & &$<$ 55 MeV &$>$ 55MeV\\
286: \hline
287: Particles/process & 0.1 & 1.2 & 1.8 & 0.9945 & $5.5 \cdot 10^{-3}$ \\
288: Particles/nsec & $120 (170)\times 0.1$ &$120 (170)\times 1.2$ &$120 (170) \times
289: 1.8$ &80 (115) & 80 (115)\\
290: \hline
291: Prob. Particle to Hit Tracker&$ 1.08 \cdot 10^{-2}$&$0.92 \cdot 10^{-3}$
292: &$0.9 \cdot 10^{-3}$ & $4.0 \cdot 10^{-4}$ & $1.4 \cdot 10^{-2}$\\
293: \hline
294: Events in Tracker/30ns&3.9 (5.5) &4.0 (5.6) &5.9 (8.2) &1.0 (1.4) & 0.2 (0.3)\\
295: \hline
296: Straws on in Tracker/event &14.0 &5.2 &31.0 & 15.7 & 4.3\\
297: \hline
298: Straw Rate (kHz)&140 (200) &53 (75) &470 (650) &40 (56) & 2.2 (3.3)\\
299: \hline
300: \end{tabular}
301: \label{table:tab1}
302: \end{table}
303: \end{center}
304: 
305: 
306: 
307: To study the tracker performance in the presence of the background
308: the number of muon conversion events ($10^5$) with
309: initial momentum  of conversion electrons produced
310: in the target (105 MeV/c) was simulated and saved to a data file.
311: The expected number of
312: DIO events during the experiment time ($10^7$ sec) in
313: this region is $5.2 \cdot 10^4 $ events above 100 MeV/c. 
314: 
315: To study the main background 
316: the number of simulated  DIO events 
317: in this energy region  was chosen to be ten times greater.
318: Five different random backgrounds (protons, neutrons, photons and
319: DIO (see Table ~\ref{table:tab1}) were generated and saved to data
320: files. 
321: 
322: The  numbers in the table are the result of a GEANT simulations
323: at the anticipated beam intensity,
324: with no artificial increase as a  safety factor.
325: No suppression was added for rejection of
326: heavy ionization particles.
327: 
328: Background was taken from the data files and added to
329: conversion events and DIO events above 100 MeV/c. For example, we
330: simulate the number of events N for the proton background
331: according to a Poisson distribution with average 3.9 
332: taken from background Table ~\ref{table:tab1}. Then we randomly
333: pick-up N accidental proton events from the corresponding data
334: file and add these events to the muon conversion  or DIO event. 
335: In the
336: same way the above procedure is repeated for all background types.
337: 
338: 
339: 
340: In Figure ~\ref{fig:nrb} distributions in the number of real and background 
341: tracker hits are shown. 
342: The average number of real hits is 29.  
343: The real hits distribution starts from 15 hits, corresponding to 
344: the cut-off in the minimum number of selected hits. 
345: The maximum number of real tracker hits can reach 60.
346: The average number of background hits is 300 which corresponds to an average
347: straw rate about 800 kHz.
348: The distribution in the number of background hits is a broad one and
349: the maximum number of hits is about 900.   
350: 
351: 
352: 
353: \begin{figure}[htb!]
354: \centerline{\hbox{\psfig{figure=Nreal.eps,height=2.0in,clip=on}
355:                   \psfig{figure=Nback_t.eps,height=2.0in,clip=on}}}
356:  \caption{
357: Distributions in the number of real and background tracker hits. 
358:  }
359: \label{fig:nrb}
360: \end{figure}
361: 
362: 
363: \section{Pattern recognition}
364: 
365: 
366: 
367: There are twelve views in the tracker,
368: each separated from the previous by 30 degrees. In a typical
369: event, a two-dimensional projection of the helical trajectory, a
370: sine curve, is observed in three or four of the views. There are
371: approximately 10 hits in each view, and 29 hits in the
372: event. The hits are grouped in lobes (see Figure ~\ref{fig:sin1}) with a typical
373: gap between lobes about 60 cm (in these gaps an electron travels in vacuum).
374: Therefore in the sensitive area of the tracker one gets track segments 
375: but not a complete track and the problem of combining information
376: from different lobes arises. Since in a given view at least 4 hits 
377: are required to reconstruct four parameters describing a 
378: two-dimensional projection of helix often it is not possible to get helix
379: parameters for a track segment in a single lobe and than to apply
380: track element merging strategy. Quite opposite one has to use
381: hits from different lobes for pattern recognition and then for
382: momentum reconstruction. We will show that an appropriate mathematical 
383: approach allows to resolve the problem..      
384: 
385: The sinusoidal projection of helix is described by four parameters:
386: $x_{0}^{\prime }$ , $z_{0}^{\prime }$, $R_{L}$, $R_{T}$,
387: 
388: \begin{equation}
389: x_{i}^{\prime }=x_{0}^{\prime }+R_{T}\cos (\frac{z_{i}-z_{0}^{\prime }}{R_{L}%
390: })
391:  \label{eq0}
392: \end{equation}
393: 
394: where importantly,  $R_{L}$ and $R_{T}$, related to the longitudinal and transversal
395: momenta, and z coordinate are common to all projections. Coordinate $x^{\prime}$ is 
396: defined in a system for each given view.
397: 
398: A simple and powerful reconstruction strategy
399: starts from a single chamber view. To determine the parameters in this view a minimum of four
400: hits are required.
401: The  total momentum and  momentum  pitch angle are determined.
402: 
403: \begin{figure}[htb!]
404: \centerline{\hbox{\psfig{figure=sin1.eps,height=2.5in,clip=on}}}
405:   \caption{
406:   Lobes in the tracker.
407:  }
408: \label{fig:sin1}
409: \end{figure}
410: 
411: To find the five parameters describing the full 3-dimensional helix,
412:  one additional hit outside the plane is required:
413: 
414: \begin{equation}
415: x_{i}^{\prime }=x_{0}^{\prime }+R_{T}\cos (\frac{z_{i}-z_{0}^{\prime }}{R_{L}%
416: }); ~~~~~~~
417: y_{i}^{\prime }=y_{0}^{\prime }+R_{T}\sin (\frac{z_{i}-z_{0}^{\prime }}{R_{L}%
418: })
419:  \label{eq01}
420: \end{equation}
421: 
422: 
423: The mathematical procedure to find the parameters is the following one:\\
424: 
425: \noindent
426: $\bullet$ four hits from a single view are taken to give the system of equations
427: for helix parameters in the coordinate system related with the view.\\
428: $\bullet$ the system of four equations is reduced to a unique equation for $R_{L}$. \\
429: 
430: \begin{equation}
431: \Delta x_{24}S_{43}S_{41}S_{31}+\Delta x_{14}S_{43}S_{42}S_{23}+\Delta
432: x_{34}S_{41}S_{42}S_{12}=0  \label{eq2}
433: \end{equation}
434: 
435: where $\Delta x_{ij}=x_{i}^{\prime }-x_{j}^{\prime }$ , $S_{ij}=\sin
436: ((z_{i}-z_{j})/2R_{L}).$
437: \\
438: \noindent
439: $\bullet$ the equation is solved for $R_{L}$ numerically by the Newton-Raphson method.\\
440: $\bullet$ the parameters are expressed in terms of $R_{L}$. \\
441: 
442: \begin{equation}
443: \tan (z_{0}^{\prime }/R_{L})=\frac{\Delta x_{14}S_{43}\sin
444: ((z_{4}+z_{3})/2R_{L})-\Delta x_{34}S_{41}\sin ((z_{4}+z_{1})/2R_{L})}{%
445: \Delta x_{14}S_{43}\cos ((z_{4}+z_{3})/2R_{L})-\Delta x_{34}S_{41}\cos
446: ((z_{4}+z_{1})/2R_{L})}  \label{eq3}
447: \end{equation}
448: 
449: \[
450: R_{T}=\frac{\Delta x_{12}}{c_{1}-c_{2}}
451: \]
452: 
453: \[
454: x_{0}^{\prime }=\frac{x_{2}c_{1}-x_{1}c_{2}}{c_{1}-c_{2}}
455: \]
456: 
457: where $c=\cos ((z_{i}-z_{0}^{\prime })/R_{L}).$
458: \\
459: \noindent
460: 
461: $\bullet$ a fifth hit from a different view is included to
462: obtain the remaining parameters of helix. \\
463: 
464: 
465: \begin{equation}
466: x_{0}=\frac{1}{\sin (\alpha ^{\prime \prime }-\alpha ^{\prime })}\left\{
467: x_{0}^{\prime }\sin \alpha ^{\prime \prime }-[x_{5}^{\prime \prime
468: }-R_{T}\cos (\frac{z_{5}-z_{0}}{R_{L}}-\alpha ^{\prime \prime })]\sin
469: (\alpha ^{\prime })\right\}   \label{eq7}
470: \end{equation}
471: 
472: \begin{equation}
473: y_{0}=\frac{-1}{\sin (\alpha ^{\prime \prime }-\alpha ^{\prime })}\left\{
474: x_{0}^{\prime }\cos \alpha ^{\prime \prime }-[x_{5}^{\prime \prime
475: }-R_{T}\cos (\frac{z_{5}-z_{0}}{R_{L}}-\alpha ^{\prime \prime })]\cos
476: (\alpha ^{\prime })\right\} .  \label{eq8}
477: \end{equation}
478: 
479: where $\alpha ^{\prime }$ ($\alpha ^{\prime \prime }$) is the angle of
480: rotation from the global system to a given chamber system,
481: 
482: \begin{equation}
483: z_{0}=z_{0}^{\prime }-\alpha ^{\prime }R_{L}.  \label{eq6}
484: \end{equation}
485: 
486: 
487: It is important to note that the consideration of several lobes 
488: (see Figure ~\ref{fig:sin1}) allows calculation of the
489: longitudinal radius $R_{L}$ with high precision $\Delta P/P
490: \approx 10^{-4}$ due to the large distance between the lobes in
491: comparison with their size.
492: 
493: 
494: A two stage procedure was developed to provide the pattern
495: recognition in the transverse tracker:\\
496: 
497: $\bullet$ {\bf pattern recognition without drift time.} At this step only
498: information on centers of straw hits is used. The parameters of
499: the reconstructed average helix are considered as a starting point
500: to find an approximate helix
501: which fits the straw hit centers.\\
502: 
503: $\bullet$ {\bf pattern recognition with drift time.} A deterministic annealing 
504: filter DAF  ~\cite{daf} ~is applied to make a final background 
505: suppression and provide a starting 
506: point for track momentum reconstruction by the Kalman 
507: filter technique ~\cite{rkalm}, ~\cite{kalman}. 
508: 
509: 
510: \subsection*{Pattern recognition  without drift time information}
511: 
512: In the first level  of the analysis procedure (the straw hit center
513: approximation) only centers of straw hits were used as chamber hit
514: coordinates $x^{\prime}$, z.
515: It was assumed that there is only one useful muon conversion 
516: or DIO track. 
517: 
518: 
519: It is important to note that the straw hits allow us to
520: reconstruct the helix parameters with high precision even without
521: drift time information. Indeed, the uncertainty in the helix
522: radius $\Delta R_{T}$ is approximately  $\sigma \sim D/\sqrt12
523: \approx$ 0.14 cm for the diameter of a straw tube $D = $5 mm. The
524: average helix radius for muon conversion events is $R_T \sim $ 25
525: cm . We can therefore expect the momentum resolution $\Delta P/P 
526: \sim 0.5 - 1\%$
527: for a straw diameter of 5 mm.
528: It is worth noting that  using straw tube center positions
529: without drift time information we can reconstruct the total
530: momentum with the standard deviation $\sigma $ = 0.45 MeV/c for 
531: a straw diameter of 5 mm.
532: 
533: \begin{figure}[htb!]
534: \centerline{\hbox{\psfig{figure=pin_100k.eps,height=3.0in,clip=on}}}
535:  \caption{
536:  Momentum distribution of electrons entering the tracker.
537:  }
538: \label{fig:Pin}
539: \end{figure}
540: 
541: 
542: \bigskip
543: 
544:  The initial momentum distribution of electrons entering the
545: tracker and satisfying the criteria mentioned above is shown in
546: Figure ~\ref{fig:Pin}. This distribution is the a narrow one 
547: and  can be fitted by a Gaussian with a
548: standard deviation $\sigma $ = 0.21 MeV/c  and average momentum
549: P$_{in}$ = 104.4 MeV/c in the range 104.2 MeV/c - 105 MeV/c. 
550: The width of
551: the initial distribution of electrons is small in comparison with
552: the total width of the momentum distribution reconstructed in the
553: straw hit center approximation.
554: 
555: In Figure  ~\ref{fig:sel_0} an initial map of real and background tracker 
556: hits in all tracker's views is shown as an example for a first simulated  event.
557: We will call it the sample event in the following.
558: The number of real tracker hits is 29 and the number of background 
559: hits is 260 in this event. The real hits are depicted as cyan spots 
560: and the background ones are shown as red spots. Note that in the 
561: figure all 12 views were joined in one for all 18 tracker modules. 
562: 
563: \begin{figure}[htb!]
564: \centerline{\hbox{\psfig{figure=sel_0.eps,height=2.5in,clip=on}}}
565:  \caption{
566:  Plot of real + background tracker hits before the selection procedure.
567:  }
568: \label{fig:sel_0}
569: \end{figure}
570: 
571: One can see from the initial distribution of tracker hits that 
572: some of background hits are concentrated along straight lines
573: ( Figure  ~\ref{fig:sel_0}) providing a typical signature of Compton 
574: electrons from photon background.
575: 
576:  The procedure of the pattern recognition and
577: fitting without drift time in the presence of background 
578: can be described by the following steps:
579: 
580: 
581: \bigskip
582: 1) {\bf Rejection of hits produced by Compton electrons.} \\
583: 
584: 
585: $\bullet$ 
586: A grouping containing 6 modules is taken for a given view. If in
587: this grouping there are $N_{1}>2$ hits with the same tube number
588: (same straw (x$^{\prime}$) coordinate) the corresponding hit numbers 
589: are recorded. 
590: If, in a second  grouping displaced by one module from the first, 
591: there are $N_{2}>2$ hits with the
592: same tube numbers as at the initial position of the grouping then 
593: hits from the first and the second grouping are marked as Compton electron hits. The 
594: procedure is repeated in forward and backward directions. At the
595: end of this step all marked hits are temporary removed from
596: the following analysis. This selection typically reduces twofold the number of 
597: background hits. 
598: Also about 5$\% $ of
599: the real hits are rejected by the procedure 
600: but later they can be restored. 
601: 
602: 
603: Figure  ~\ref{fig:sel_1} demonstrates the remaining tracker hits after application
604: of the rejection procedure to the sample event. The suppression of background hits 
605: is evident.    
606: 
607: \begin{figure}[htb!]
608: \centerline{\hbox{\psfig{figure=sel_1.eps,height=2.5in,clip=on}}}
609:  \caption{
610:  Plot of real + background tracker hits after the first selection procedure.
611:  }
612: \label{fig:sel_1}
613: \end{figure}
614: 
615: \bigskip
616: 
617: 2) {\bf Selection based on calorimeter and tracker hits.} \\
618: 
619: To reconstruct the helix parameters four hits are needed.
620: To reduce the number of four hit combinations 
621: the calorimeter hit is used in each combination. 
622: The  calorimeter coordinate resolution is $\sigma = 1.5$ cm which is
623: ten times greater than tracker hit resolution in the straw hit center
624: approximation. 
625: In spite of the poor coordinate calorimeter resolution 
626: it is possible to get a significant background suppression. 
627: 
628: This procedure is described by the following steps:
629: 
630: $\bullet$ For a given view all possible three hit
631: combinations are chosen and four hits are formed by adding 
632: a calorimeter hit to the three hit combinations.
633: The mathematical approach described above
634: is applied to four hits.
635: The algorithm allows us to calculate $R$ , $\cos \theta $ ,
636: $z_{0}^{\prime }$ , $x_{0}^{\prime }$ in the chamber system of
637: coordinates. Only the combinations that survive a cut-off in the
638: particle
639: momentum $p_{\min }=94MeV/c<p<p_{\max }=114MeV/c$ and pitch angle 
640: $\theta _{\min }=41.4^{\circ }<\theta <\theta _{\max }=66.4^{\circ }$ are
641: retained for subsequent analysis.
642: 
643: \bigskip
644: $\bullet$ A fifth hit from a different view than
645: the four hit combination is added. 
646: This allows us to calculate all 5 helix
647: parameters in the global system of coordinates. A cut-off in $x_{0}$ and 
648: $y_{0}$ is applied at this step to five hits.
649: The coordinates $x_{0}, y_{0}$ are required to be in an acceptable range
650:  $-40 < x_{0},y_{0} < 40 $ cm,
651: where the range is determined by the GEANT simulation  
652: 
653: 
654: \bigskip
655: $\bullet$ We look for hits correlated in 3 dimensions to choose
656: good five hit combinations. For each selected five hit
657: combination using the found helix parameters we reconstruct the
658: helix, calculate all crossings of the tracker chambers for the
659: reconstructed trajectory and define the total number of crossings
660: N. Then we calculate how many times M at least one tracker hit
661: matches the crossing in a road $\pm $ 3 cm. In an
662: ideal case M should be equal to N.
663: 
664: \bigskip
665: $\bullet$ The probability is evaluated to get M hits in the  road. 
666: The probability that M  of N crossings  are in the road
667: is estimated by a trial function
668: 
669: \[
670: \mathit{Prob
671: }=\frac{N!}{M!(N-M)!}\epsilon^{M}(1-\epsilon)^{N-M}
672: \]
673: 
674: where ~~$\epsilon$ is 
675: the probability of that there is a hit 
676: in the road within $\pm$3 cm.
677: 
678: If the total probability \textit{Prob} is greater than the threshold
679: probability then the
680: given helix is considered as a good candidate.
681: The threshold probability and ~$\epsilon$ were found empirically
682: to be 0.001 and 0.95, respectively.
683: 
684: As an output of the previous steps we get a collection of valid
685: five hit combinations (helices). 
686: An individual hit is kept if it is in any valid hit
687: combination, providing a list of good tracker hits. It is worth
688: emphasizing that, due to the strong spatial correlations between the
689: helix hits in comparison with the un-correlated background, the
690: number of false hits is reduced drastically on applying the
691: road requirement. On average the number of background hits is 
692: reduced due to this step by a factor 20. 
693:  
694: In Figure  ~\ref{fig:sel_2} 
695: the surviving hits are shown. 
696: The significant reduction of background hits can be seen by comparing 
697:  Figure  ~\ref{fig:sel_1} and  Figure  ~\ref{fig:sel_2}.
698: 
699: 
700: \begin{figure}[htb!]
701: \centerline{\hbox{\psfig{figure=sel_2.eps,height=2.5in,clip=on}}}
702:  \caption{
703:  Plot of real + background tracker hits after the second selection procedure.
704:  }
705: \label{fig:sel_2}
706: \end{figure}
707: 
708: 
709: 
710: Note that at this step no real hits are removed for the sample event. 
711: 
712: \bigskip
713: 
714: 
715: 3) {\bf Selection based on tracker hits.}\\
716: 
717: In this selection procedure the tracker hits found in step 2 are used.
718: This procedure repeats the previous one with the following changes:\\
719: 
720: $\bullet$ Hits are chosen only from the tracker hits which allows to
721: improve the performance of the selection procedure since tracker hit 
722: coordinates are defined significantly better than calorimeter hit 
723: coordinates. 
724: 
725: 
726: A plot of the number of four hit and five hit
727: combinations in the presence of the background (in average about 120 
728: and 630 respectively) is shown in Figure ~\ref{fig:comb} (a) and (b).
729: 
730: \begin{figure}[htb!]
731: \centerline{\hbox{\psfig{figure=3plus1.eps,height=2.0in,clip=on}
732:                   \psfig{figure=3plus2.eps,height=2.0in,clip=on}}}
733:  \caption{
734: Number of four hit and five hit combinations.
735:  }
736: \label{fig:comb}
737: \end{figure}
738: 
739: As seen in Figure ~\ref{fig:comb} these distributions are
740: broad ones reaching a few thousand combinations.
741: 
742: 
743: \bigskip
744: $\bullet$ For each five hit combination selected above we evaluate
745: the position of a hit in the
746: calorimeter $x_{eval}$ , z$_{eval}$ on the basis of the defined helix parameters.
747: The hit combination is accepted
748: only if the evaluated hit matches the calorimeter hit within a road : $%
749: \left| x_{calo}-x_{eval}\right| < 7 cm$ . 
750: 
751: 
752: $\bullet$ Parameters at this step are more
753: restrictive: the road width is taken to be 0.75 cm and the
754: probability to be within the road $\epsilon$ = 0.97. 
755: 
756:  Figure  ~\ref{fig:sel_3} shows tracker hits 
757: remaining after this step for the  sample event. 
758: The number of background hits is 
759: reduced typically by a factor 4 - 5 due to this step. 
760: 
761: 
762: \begin{figure}[htb!]
763: \centerline{\hbox{\psfig{figure=sel_3.eps,height=2.5in,clip=on}}}
764:  \caption{
765:  Plot of real + background tracker hits after the third selection procedure.
766:  }
767: \label{fig:sel_3}
768: \end{figure}
769: 
770: 
771: 
772: \bigskip
773: 4) {\bf Selection including restored tracker hits.} \\
774: 
775: $\bullet$ Some real tracker hits are lost
776: at previous steps of the procedure especially at Compton electron hits
777: rejection. To restore the lost hits an average helix for a given event is
778: reconstructed and tracker hits that match the average helix in a road
779: $\pm$1 straw are added to the list of valid tracker hits obtained above.
780: 
781: 
782: $\bullet$ Step 3 of the procedure is repeated for the extended list of tracker 
783: hits. On average the list of valid
784: hits is extended by 1 real and 2 background hits. 
785: 
786: 
787: $\bullet$ The union of all hits in valid five hit combinations 
788: in the extended list is
789: used to provide an input for an average helix.
790: A global fit is applied to reconstruct the helix parameters on the basis of
791: the list of the selected tracker hits and the parameters of the average helix are
792: considered as a starting point for the fit.  \\ 
793: 
794: In this section the pattern recognition procedure without drift time
795: based on the straw hit center approximation was developed.
796: In this approximation no ambiguity due to mirror hits arises 
797: and the total momentum is reconstructed  
798: with the standard deviation $\sigma $ = 0.45 MeV/c. 
799: The  selection procedure based on 3D space correlations between real
800: tracker hits inside the road significantly reduces background.
801: The overall background rejection factor is about 130
802: for the pattern recognition procedure without drift time.
803: 
804: 
805: 
806: \subsection*{Pattern recognition with drift time }
807: 
808: The second stage in the pattern recognition procedure uses the
809: fitted helix and drift time for hits selected at the first stage.
810: The reconstruction of helix parameters can be improved by taking
811: into account that in addition to a straw coordinate for
812: each hit a chamber gives the radius r calculated from the measured drift
813: time $t_{i}^{meas}$. The errors ($\sigma $) in radius measurements
814: were taken to be 0.2 mm. This radius r carries an ambiguity as to
815: whether the track passed left or right of the wire. The search of
816: two possible up and down hit positions lying on the circle of
817: radius r is based on the fitted helix obtained previously in the straw
818: hit center approximation. 
819: We can call these up and down points as true and mirror ones.
820: Up and down points are extracted (see
821: Figure ~\ref{fig:TTRS1}) from the intersections of a normal to the
822: helix through the straw center and the circle of the drift radius
823: r. In this case coordinates of up and down points are given by
824: 
825: \begin{equation}
826: x_{\prime} = x_{c} \mp r sin \beta ;
827: ~~~~~z = z_{c} \pm r cos \beta ;\\
828: ~~~~~tan\beta = -(R_{T}/R_{L})sin((z_c-z_{0}^{\prime})/R_{L})
829: \label{equpdown}
830: \end{equation}
831: 
832: \begin{figure}[htb!]
833: \centerline{\hbox{\psfig{figure=ttrs1.eps,height=2.2in,clip=on}}}
834:  \caption{
835:  Reconstruction of up and down points by the helix obtained in the
836:  straw hit center approximation.
837:  }
838: \label{fig:TTRS1}
839: \end{figure}
840: 
841: 
842: The uncertainties $\Delta s$ (see Figure ~\ref{fig:TTRS2}) 
843: in the determination of up and down point
844: positions are
845: small and can be evaluated in the following way.
846: The direction of 2D helix in the chamber coordinate system is given by
847: $ tan\beta $ (see Figure ~\ref{fig:TTRS2} and Eq.(\ref{equpdown})).
848: 
849: \begin{figure}[htb!]
850: \centerline{\hbox{\psfig{figure=ttrs2.eps,height=2.0in,clip=on}}}
851:  \caption{
852: Uncertainties in up and down point positions.
853:  }
854: \label{fig:TTRS2}
855: \end{figure}
856: 
857: Since the uncertainty in $R_{T}$ is the dominant one the uncertainty in the
858: direction of the helix is of the order 
859: $\Delta \beta /\beta \sim \Delta
860: R_{T}/R_{T} \sim \Delta
861: R/R \sim \Delta P/P$  where $\Delta P/P<0.01, \beta \sim 1$.
862: 
863: The uncertainty in the determination of up or down point
864: position \\
865: $\Delta s \sim \Delta \beta $ r  $ <
866: \beta (\Delta P/P)r_{tube} \sim 1 \times 10^{-2} \times 0.25$ cm $ \sim 25 \mu$m
867: which is much less than a measurement precision 200 $\mu $m.
868: 
869: To reject background hits remaining after pre-selection  stage and
870: to resolve the up - down ambiguity
871: the deterministic annealing filter (DAF) and the Kalman filter (KF) 
872: are applied.
873: 
874: 
875: In principle at this stage the Kalman filter (KF) approach \cite{kalman}
876: could be applied. However an application of the Kalman filter 
877: (see Appendix A) requires that the problem of assignment of hits to a track
878: has been entirely resolved by the preceding selection procedure. 
879: If this is not the case the filter has to run on every possible 
880: assignment choosing the best one according to the chi-square 
881: criterion. For the number of tracker hits greater than 15 this 
882: combinatorial search is computationally expensive and practically
883: unfeasible. Therefore as the last step of pattern recognition 
884: we will use the deterministic annealing filter (DAF) ~\cite{daf}. 
885: 
886: 
887: DAF is a Kalman filter with re-weighted observations (see Appendix B).
888: For the DAF procedure we introduce artificial layers placed at the chamber straw centers 
889: in order to have competing true and mirror points in one  layer. 
890: To overcome the problem of insufficient information in the initial
891: phase of the filter, an iterative procedure is applied. After a first
892: pass of filter and after smoothing, the track position can be predicted 
893: in every layer of the tracker. Based on these predictions, the assignment
894: probabilities for all competing hits can be calculated in every layer.
895: If the probability falls below a certain threshold, the hit is excluded 
896: from the following consideration. The assignment probabilities of
897: the remaining hits are normalized to one and used as the weights
898: in the next iterations of the filter.
899: 
900: In our case the operation of DAF is described by the following steps:\\
901: 
902: $\bullet$ every true and mirror  point is projected on a 
903: layer corresponding to the center of straw in the direction
904: defined by the fitted helix for a given event.\\
905: 
906: $\bullet$ initial probabilities of competing points
907: in the layer are assumed to be equal.\\
908: 
909: $\bullet$ annealing schedule is chosen according to the following
910: formula\\
911:   $V_{n}=V (\frac{50}{f^{n}}+1)$ for a variance of observations,
912: where V = $\sigma^{2}$ and $\sigma$ = 200 $\mu$m. 
913: The annealing factor f is chosen to be either 1.4 or 2.\\
914: 
915: $\bullet$ standard Kalman filter runs on all layers taking 
916: observations as weighted mean according to assignment probabilities.\\
917: 
918: $\bullet$ the filter runs in the opposite direction, using the
919: same weighted mean as the forward filter. By taking a weighted mean of
920: the predictions of both filters at every layer, a smoothed state vector
921: and its covariance matrix are obtained.\\ 
922: 
923: $\bullet$ based on these predictions and the  covariance matrix, the
924: assignment probabilities of the hits are calculated. If combined 
925: hit probability for true and mirror point  falls below a 
926: certain threshold ($10^{-7}$), the hit is rejected. The assignment probabilities of
927: the remaining points in the layer are normalized 
928: to one and used as the weights
929: in the next iterations of the filter.\\ 
930: 
931: $\bullet$ iterations in n stop if one of the following
932: conditions is satisfied:\\
933: 1)for the reconstructed track $\chi^2 > \chi^2_{max}$ 
934: where $\chi^2_{max}$ = 1000 or $\chi^2 < 0$.\\
935: 2)for the reconstructed initial momentum $P_{in} < 94 MeV/c$ or
936: $P_{in} > 114 MeV/c$\\
937: 3)variation of $\chi^2$ is small in comparison with the 
938: previous iteration $|\chi^2_{n+1}-\chi^2_{n}|/\chi^2_{n}<0.01$.\\   
939: 
940: $\bullet$ when iterations stop (on average after 7 iterations) 
941: The DAF procedure is repeated for the different annealing factor f.
942: From results corresponding to two annealing factors we choose 
943: that one corresponding to the minimum $\chi^2$.\\
944: 
945: Figure  ~\ref{fig:sel_5} shows the distribution of tracker hits for the sample event
946: (recall that initially we had 260 background hits for this event).  
947: There are no surviving background hits and missing real hits
948: in this case. Reconstructed lobes are clearly seen in the 
949: Figure. We conclude that DAF is effective in rejection
950: of background hits remaining after pre-selection procedure and 
951: also it provides a good starting point for the track reconstruction.      
952: 
953: 
954: \begin{figure}[htb!]
955: \centerline{\hbox{\psfig{figure=sel_5.eps,height=2.5in,clip=on}}}
956:  \caption{
957:  Plot of real + background tracker hits after the DAF selection procedure.
958:  }
959: \label{fig:sel_5}
960: \end{figure}
961: 
962: 
963: 
964: Figure  ~\ref{fig:Nback}(a) represents the number of real
965: hits lost by the pattern recognition procedure. Some of the real
966: tracker hits (0.8 hits $\sim 2.7\%$) are lost due to the selection
967: procedure.
968: 
969: Figure  ~\ref{fig:Nback}(b) represents the number of background
970: hits remaining after the selection procedure. As one
971: can see from Figure ~\ref{fig:Nback}(b) the number of 
972: background hits remaining is 0.38 hits in comparison with the
973: primary 300 hits. So a total background suppression factor is
974: 300/0.38 $\approx$ 800.
975: 
976: 
977: \begin{figure}[htb!]
978: \centerline{\hbox{\psfig{figure=Nmiss.eps,height=2.0in,clip=on}
979:                    \psfig{figure=Nback.eps,height=2.0in,clip=on}}}
980:   \caption{
981:   Distributions in the number of missing real hits and
982:   remaining background hits.
983:  }
984: \label{fig:Nback}
985: \end{figure}
986: 
987: \section{Reconstruction based on Kalman filter }
988: 
989: Background studies presented in this subsection are based on the
990: application of the Kalman filter to hits selected by previous pattern 
991: recognition procedure.
992: 
993: At this stage the reconstruction is based on the hits selected by the
994: pattern recognition procedure described above.
995: 
996: In principle we could use the results obtained by DAF for the track
997: reconstruction but our analysis shows that an application of
998: a combinatorial Kalman filter to hits selected by DAF provides better
999: precision. 
1000: 
1001: In the last decade the Kalman filter (KF) approach \cite{kalman}
1002: has been extensively exploited for track fitting in high energy
1003: physics. This approach possesses the following
1004: features for effective track fitting: \\
1005: \noindent
1006: $\bullet$ multiple scattering and energy losses are included in a natural way;\\
1007: $\bullet$ a 3D trajectory is restored that approximates closely the real one; \\
1008: $\bullet$ complex tracker geometries are handled in a simple way;\\
1009: $\bullet$ N$\times $N matrix inversion, where N is the total number of measurements,
1010: is avoided; \\
1011: $\bullet$ control for error propagation is provided; \\
1012: $\bullet$ trajectory is reconstructed progressively from one measurement to the next,
1013: improving the precision  with each step; \\
1014: $\bullet$ initial and final momenta of a particle crossing the
1015: tracker are reconstructed. \\
1016: 
1017: The KF is very useful because it simultaneously finds and fits 
1018: the track;
1019: it is much more economical than the
1020: conventional least-squares global fit. The KF is a ``progressive"
1021: step by step method whose predictions are rather poor at the
1022: beginning of the track at the first stage of filtering. Since a
1023: state vector ${\bf{x}_k}=(x,y,t_x,t_y,1/p_{L})$ at point k (see
1024: definitions in Appendix A) has five parameters we need
1025: approximately $\simeq$ 6-7 straw hits to get good KF prediction
1026: precision. The prediction step, in which an estimate is made for
1027: the next measurement from the current knowledge of the state
1028: vector, is very useful to discard noise signal and hits from other
1029: tracks. Assuming the validity of the helix track model for each
1030: step, the KF propagates the track in 3D space, from one 2D surface
1031: to the next.
1032: 
1033: Below we will use the standard notations:
1034: 
1035: $\bf{x}_{k+1}^{k}$ is a prediction, i.e. the estimation of the
1036: ``future" state vector at position ``k+1" using all the ``past"
1037: measurements up to and including ``k".
1038: 
1039: $\bf{x}_{k}^{k}$ is a filtered state vector, i.e. the
1040: estimation of the state vector at position ``k" based upon all
1041: ``past" and ``present" measurements up to and including ``k".
1042: 
1043: The same notations will be held for the covariance matrix
1044: $\bf{C}$, noise matrix $\bf{Q}$ and so on.
1045: 
1046: The Kalman filter algorithm can be divided into three major steps. \\
1047: 
1048: INITIALIZATION
1049: 
1050: The KF can start from arbitrary parameters and an infinite
1051: covariance matrix but for track finding applications it is
1052: significantly better to fix somehow the initial state if possible.
1053: 
1054: The initialization of forward and backward KF algorithms is quite
1055: simple. It starts from the artificial point and initial
1056: parameters calculated from the previous stage of reconstruction
1057: procedure. The initial covariance matrix is empirically found to be diagonal
1058: with matrix elements being much greater than the corresponding
1059: uncertainties: $\bf{C}_{0}^{0}$ = (0.3,0.3,0.03,0.03,0.000003).\\
1060: 
1061: FILTERING
1062: 
1063: Once the KF is initialized it makes standard consequent steps. The
1064: current state at the k-th step is defined by the state vector
1065: $\bf{x}_k$, the state covariance matrix $\bf{C}_{k}^{k}$
1066: and the current straw hit.
1067: To take the (k+1)-th step it is necessary to:\\
1068: 
1069: \noindent
1070: $\bullet$ update $\bf{x}_{k}^{k}, \bf{C}_{k}^{k}$ to take into account the ionization losses \\
1071: $\bullet$ define the next hit object (up or down point) \\
1072: $\bullet$ propagate the parameters and the covariance matrix
1073: to: $\bf{x}_{k}^{k} \rightarrow \bf{x}_{k+1}^k$, $\bf{C}_{k}^{k} \rightarrow \bf{C}_{k+1}^k$\\
1074: $\bullet$ update $\bf{C}_{k+1}^k$ to take into account
1075: multiple scattering $\bf{C}_{k+1}^k \rightarrow \bf{C}_{k+1}^k + \bf{Q}_k$\\
1076: $\bullet$ calculate the Kalman matrix $\bf{K}_{k+1}$ \\
1077: $\bullet$ update the covariance matrix $\bf{C}_{k+1}^{k+1}$ \\
1078: $\bullet$ calculate residuals $\bf{r}_{k+1}^{k+1} $ and their covariance matrices $\bf{R}_{k+1}^{k+1} $ \\
1079: $\bullet$ calculate the incremental $\chi^2 =
1080: (\bf{r}_{k+1}^{k+1})^T (\bf{R}_{k+1}^{k+1})^{-1} \bf{r}_{k+1}^{k+1}$ \\
1081: $\bullet$ store all information defining the new state\\
1082: 
1083: SMOOTHING
1084: 
1085: In the standard Kalman filter algorithm the smoothing is a well
1086: defined procedure. Smoothing allows one to obtain the best
1087: estimate of the track parameters at any trajectory point using all
1088: hit information accumulated during the KF propagation.
1089: 
1090: 
1091: Figure ~\ref{fig:forw} (a) displays the results of the KF
1092: forward filtering for the total momentum Ptot
1093: reconstruction at each tracker hit position. Figure
1094: ~\ref{fig:forw} (b) displays how the KF smoother, based on all
1095: hit information accumulated during the KF filtering, improves the
1096: total momentum Ptot reconstruction at each tracker hit
1097: position.
1098: 
1099: 
1100: \begin{figure}[htb!]
1101: \centerline{\hbox{%
1102: \psfig{figure=forw_p_f.eps,height=2.0in,clip=on}
1103:  \psfig{figure=forw_p_s.eps,height=2.0in,clip=on}}}
1104: \caption{
1105:  The total momentum reconstructed by the
1106:  forward Kalman filtering a) and by the smoothing b)
1107: for each hit of a selected event. The y-axis is Ptot, the
1108: x-axis is the z-ordered hit number.
1109:  }
1110: \label{fig:forw}
1111: \end{figure}
1112: 
1113: 
1114: Figure ~\ref{fig:back} (a) displays the results of the KF
1115: backward filtering for the total momentum Ptot 
1116: reconstruction at each tracker hit position. Figure
1117: ~\ref{fig:back} (b) displays how the KF smoother, based on all
1118: hit information accumulated during the KF filtering, improves the
1119: total momentum Ptot reconstruction at each tracker hit
1120: position.
1121: 
1122: \begin{figure}[htb!]
1123: \centerline{\hbox{%
1124: \psfig{figure=back_p_f.eps,height=2.0in,clip=on}
1125:  \psfig{figure=back_p_s.eps,height=2.0in,clip=on}}}
1126: \caption{ The total momentum reconstructed by the
1127:  backward Kalman filtering a) and by the smoothing b)
1128: for each hit of a selected event. The y-axis is Ptot, the
1129: x-axis is the z-ordered hit number.
1130:   }
1131: \label{fig:back}
1132: \end{figure}
1133: 
1134: 
1135: The Kalman filter approach is effective in the resolving 
1136: of up - down ambiguity. By
1137: applying the Kalman filter at each step up and down points are
1138: considered and the point providing the best $\chi^{2}$ for the
1139: trajectory is selected as the true point. This procedure is
1140: approximately linear in the number of tracker hits in comparison with
1141: the a combinatorial search, which is not feasible.
1142: 
1143: At this reconstruction stage due to left-right ambiguity for the
1144: straw drift chamber we have a set of true and mirror  points for
1145: each straw hit. As discussed above the reconstruction procedure
1146: for each tracker hit defines true and mirror  points and one of
1147: them is close to the real point with high precision $\simeq$
1148: 25 $\mu$m. 
1149: So we can formulate our goal as to find a true point
1150: combination for N straw hits in the presence of N mirror points.
1151: 
1152: The procedure to find the best approximation to the true point
1153: combination for N straw hits in the presence of N mirror  points
1154: for one track from a muon-electron conversion event
1155: is based on the following steps:\\
1156: 
1157: $\bullet$ chose the first eight straw hits and built $2^8$ (256)
1158: possible hit combinations corresponding to up and down points.
1159: The KF forward and backward procedures described above are applied
1160: to these combinations. Only those combinations which satisfy a
1161: rather loose $\chi^2$ cut, $\chi^2 <$ 30,
1162: are retained (typically about 10 out of the initial 256 combinations);\\
1163: 
1164: $\bullet$ make a loop for all retained combinations with fixed up
1165: and down points for the first eight straw hits. For the 9th and
1166: higher straw hit, the up and down point choices are take into
1167: account and the point  with minimal incremental $\chi^2$ for
1168: this point is selected for the further KF propagation step. If
1169: both incremental $\chi^2$ satisfy the cut $\chi^2 < $ 10 the
1170: second point is stored in the stack to make an iterative loop.
1171: A single combination of all possible steps defines a candidate
1172: track. At this stage on average 45 candidate tracks are stored
1173: in the stack per event; \\
1174: 
1175: 
1176: $\bullet$ make a loop for all combinations from the stack. For
1177: each new hit added to the hits restored from the stack again the
1178: two up and down point choices are taken into account and the
1179: point with the minimal incremental $\chi^2$ for this point
1180: is selected for the further KF propagation step; \\
1181: 
1182: 
1183: $\bullet$ select up and down track combinations with the minimal $\chi^2$ for the track; \\
1184: 
1185: $\bullet$ select a track satisfying the cut $\chi^2 < $ 70; \\
1186: 
1187: $\bullet$ select a track with the difference between the forward
1188: (Pin\_f) and backward  (Pin\_b)
1189:  reconstructed input momentum satisfying the cut $\vert Pin\_f - Pin\_b \vert <$ 0.7 MeV/c.  \\
1190: 
1191: 
1192: 
1193: 
1194:    The Kalman filter reconstructs a trajectory of a particle
1195: in three dimensions. The trajectory is bent
1196: each time it crosses a tracker plane due to multiple 
1197: scattering. Therefore, the reconstructed track is a set of
1198: helices that intersect at the  planes.
1199: This is the track followed by the particle. 
1200:    
1201:    Figure ~\ref{fig:ev2}  displays a 2D projection of 3D trajectory
1202: reconstructed by the Kalman filter for the sample event.
1203: As above in this figure all 12 views were joined in one 
1204: for all 18 tracker modules. The 2D trajectory is shown 
1205: only for sensitive area of the tracker and for each view 
1206: the trajectory is in a different color. 
1207: For the sample event real hits are in four 
1208: tracker's views and the reconstructed
1209: lobes for these views are clearly seen in the figure.
1210: 
1211:    Due to a scale in this figure the 2D trajectory looks
1212: as an ideal sine curve and tracker hits look like spots of
1213: different size. In order to see a detailed behavior of the
1214: trajectory and hit positions a dynamical zoom is applied 
1215: to a rectangular region indicated in  Figure ~\ref{fig:ev2} 
1216: in the x-range 49-50 cm and
1217: z-range 1029-1029.5 cm. 
1218:    
1219: Figure ~\ref{fig:ev21} demonstrates the magnified region of the tracker.
1220: Blue line in the figure represents the reconstructed trajectory.
1221: Two circles represent two hits in  chamber. 
1222: These circles look like ellipses due to different axis scales. 
1223: 
1224: 
1225: 
1226: \begin{figure}[htb!]
1227: \centerline{\hbox{\psfig{figure=ev2.eps,height=3.0in,clip=on}}}
1228:  \caption{
1229:  Plot of real + background tracker hits after the DAF selection procedure.
1230:  }
1231: \label{fig:ev2}
1232: \end{figure}
1233: 
1234: \begin{figure}[htb!]
1235: \centerline{\hbox{\psfig{figure=ev2_1.eps,height=2.5in,clip=on}}}
1236:  \caption{
1237:  Zoom enlargement of the region in Figure ~\ref{fig:ev2}.
1238:  }
1239: \label{fig:ev21}
1240: \end{figure}
1241: 
1242: 
1243: Radii of the circles are directly
1244: proportional to the drift times. 
1245: For illustrative 
1246: purposes two nearest to the straw center points corresponding to the real
1247: trajectory   obtained in Monte Carlo
1248: simulation are shown in the Figure in the form of diamonds
1249: ~ (note that in the pattern recognition procedure only 
1250: radii were used but not these points).
1251: The measurement uncertainty was taken into account assuming
1252: that circle radii are distributed normally about the simulated 
1253: radii with $\sigma = 200 \mu$m. For this reason the position 
1254: of one of the  nearest points is not on a circle in Figure ~\ref{fig:ev21}.
1255:   
1256: In Figure ~\ref{fig:ev21} due to the corresponding scale the trajectory 
1257: looks like a straight line. It is tangent to one of two circles
1258: obtained on the basis of drift time.  
1259: In the region under consideration the deviation of the 
1260: trajectory from the  nearest point is less than 0.2 mm. 
1261: The change in the direction of the trajectory due to 
1262: multiple scattering can not be seen in the Figure because of 
1263: the smallness of the average angle of the scattering.        
1264: 
1265: Figure ~\ref{fig:fig_3D3}
1266: shows transverse xy-projection of the 
1267: trajectory for the sample event. 
1268: In this projection the trajectory looks approximately 
1269: as a circle.
1270: 
1271: \begin{figure}[htb!]
1272: \centerline{\hbox{\psfig{figure=fig_3D3.eps,height=3.0in,clip=on}}}
1273:  \caption{
1274:  The transverse projection of the 3D trajectory reconstructed for the sample event.
1275:  }
1276: \label{fig:fig_3D3}
1277: \end{figure}
1278: 
1279: However if a specific region of the tracker is magnified 
1280: by the dynamical zoom one can see in Figure ~\ref{fig:fig_3D4} 
1281: that the shape of the
1282: circle is distorted due to multiple scattering and energy
1283: loss. More than two turns of trajectory are clearly 
1284: seen in the Figure.
1285: 
1286: 
1287: \begin{figure}[htb!]
1288: \centerline{\hbox{\psfig{figure=fig_3D4.eps,height=2.0in,clip=on}}}
1289:  \caption{
1290:  Zoom enlargement of the region in Figure ~\ref{fig:fig_3D3}.
1291:  }
1292: \label{fig:fig_3D4}
1293: \end{figure}
1294: 
1295: 
1296:  Note that by using the KF filter a momentum of a particle can be
1297:  reconstructed at any point of the tracker. For our purposes the most
1298:  important is the momentum of a particle entering the tracker,
1299: which in the following we will call  the input momentum.
1300: 
1301: Figure ~\ref{fig:fig_3D1}
1302: demonstrate 3D trajectory  reconstructed for the 
1303: sample event. 
1304: The trajectory looks in this scale as a helix, but we remind that it 
1305: consists of many helix parts.
1306: Also in the Figure tracker's hits 
1307: generated by Monte Carlo simulation program are shown. 
1308: 
1309: 
1310: \begin{figure}[htb!]
1311: \centerline{\hbox{\psfig{figure=fig_3D1.eps,height=4.5in,clip=on}}}
1312:  \caption{
1313:  3D trajectory reconstructed for the sample event.
1314:  }
1315: \label{fig:fig_3D1}
1316: \end{figure}
1317: 
1318: 
1319:  The distribution in the difference between the initial momentum (Pin\_f) 
1320: reconstructed by
1321:  the Kalman filter and the generated initial momentum (Pin) is shown in
1322: Figure ~\ref{fig:pin_difb} in linear (a) and logarithmic (b) scale.
1323: According to this distribution the intrinsic tracker resolution is
1324: $\sigma$ = 0.12 MeV/c if one fits the distribution by a Gaussian in the range
1325: -0.3 - 0.7 MeV/c.
1326: 
1327: 
1328: \begin{figure}[htb!]
1329: \centerline{\hbox{%
1330: \psfig{figure=dif_back_lin.eps,height=3.0in,clip=on}
1331:  \psfig{figure=dif_back_log.eps,height=3.0in,clip=on}}}
1332: \caption{
1333:  Distribution in the difference between the input reconstructed momentum based on
1334:  the Kalman filter and the simulated input momentum with background.
1335:  }
1336: \label{fig:pin_difb}
1337: \end{figure}
1338: 
1339: 
1340: \begin{figure}[htb!]
1341: \centerline{\hbox{%
1342: \psfig{figure=pin_back_lin.eps,height=3.0in,clip=on}
1343:  \psfig{figure=pin_back_log.eps,height=3.0in,clip=on}}}
1344: \caption{
1345:  Distribution in the input momentum reconstructed by the
1346:  Kalman filter  with background.
1347:  }
1348: \label{fig:pin_kalb}
1349: \end{figure}
1350: 
1351: 
1352: Figure ~\ref{fig:pin_kalb} shows a distribution in the input
1353: momentum (Pin\_f) reconstructed by the Kalman filter 
1354: in linear (a) and logarithmic (b) scale with background.  
1355: This distribution is characterized by the standard deviation $\sigma$ = 0.25 MeV/c of the
1356: reconstructed input momentum  for a Gaussian fit
1357: in the range 104-106 MeV/c. 
1358: 
1359: 
1360: Note that the trajectory
1361: reconstructed by the Kalman filter consists of
1362: many helix parts.  
1363: The reconstructed input momentum resolution by the  
1364: Kalman filter is $\sigma$ = 0.25 MeV/c. This resolution significantly
1365: better than the resolution $\sigma = 0.35$ MeV/c obtained 
1366: by a single helix fit.  
1367: 
1368: 
1369: The overall reconstruction acceptance is 22.1 $\%$ for muon conversion
1370: events with the momentum above a threshold momentum of 103.6
1371: MeV/c.
1372: 
1373: 
1374: Comparing these results with the results of the reconstruction
1375: without background we get the difference in tracker
1376: resolution 1.5 \% and the difference 2.7 \% in overall acceptance
1377: (see Appendix C).
1378: Therefore the tracker resolution and overall acceptance are not
1379: affected significantly at the considered background level 
1380: (explained in Section 3).
1381: 
1382: 
1383: 
1384: 
1385: A summary of the critical selection criteria used in the momentum
1386: reconstruction is shown in Table ~\ref{table:tab5}. The
1387: efficiencies are for the selection criteria applied in consecutive
1388: order. 
1389: 
1390: 
1391: An overall acceptance for muon conversion events with momentum
1392: above threshold momentum $P_{th}$ is 22.1\% .
1393: We define a threshold
1394: momentum,
1395: above which events are considered as the useful ones by
1396:  $P_{th} = P_{max} - \Delta $, where $P_{max}$ = 104.3 MeV/c
1397: the  most probable reconstructed momentum. If $\Delta $ = 0.7 MeV/c
1398: is chosen then $P_{th}$ = 103.6 MeV/c.
1399: 
1400: \begin{center}
1401: \begin{table}[htb!]
1402: \caption {A summary of the reconstruction selection criteria }
1403: \begin{tabular}{|l|c|}
1404: \hline
1405:  Selection criterion & Efficiency\\
1406: \hline
1407: Calorimeter energy above 80 MeV & 0.53\\
1408: Required pitch angle at the tracker & 0.86\\
1409: At least 15 hits in the tracker & 0.87\\
1410: Position match in the calorimeter & 0.95\\
1411: Requirements on fit quality & 0.79\\
1412: Detected energy above 103.6 MeV & 0.74\\
1413: \hline
1414: Overall acceptance & 0.22\\
1415: \hline
1416: \end{tabular}
1417: \label{table:tab5}
1418: \end{table}
1419: \end{center}
1420: 
1421: The main factors entering into the experimental sensitivity are
1422: the running time, the proton intensity, the probability per proton
1423: that a $\mu$ is produced, transported and stopped in the stopping
1424: target, the fraction of stopped muons that are captured (as opposed to decay),
1425: the trigger efficiency and the tracker reconstruction acceptance.
1426: We do not include in this table loss of events due to accidental
1427: cosmic ray vetoes, dead-time losses and losses due to straw
1428: chamber inefficiencies, all of which are expected to be small.
1429: According to our analysis taking into account the straw efficiency
1430: 97\% the overall acceptance is reduced from 22.1\% to 21.7\% . 
1431: 
1432: 
1433: 
1434: Table ~\ref{table:tab6} shows expected MECO sensitivity for a one
1435: year ($10^7$ s) run.
1436: 
1437: \begin{center}
1438: \begin{table}[htb!]
1439: \caption {A summary of the expected MECO sensitivity.}
1440: \begin{tabular}{|l|r|}
1441: \hline
1442: Running time (s)& $10^7$\\
1443: Proton flux (Hz)& $4 \cdot 10^{13}$\\
1444: \hline
1445: Probability of $\mu$/p transported and stopped in target & 0.0025\\
1446: $\mu$ capture probability & 0.6 \\
1447: Fraction of $\mu$ which are captured in time window & 0.49 \\
1448: Trigger efficiency and the selection criteria & 0.22 \\
1449: \hline
1450: Detected events for $R_{\mu e} = 10^{-16}$ & 6.5\\
1451: \hline
1452: \end{tabular}
1453: \label{table:tab6}
1454: \end{table}
1455: \end{center}
1456: 
1457: Muon DIO events are the most important background for the experiment.
1458: The main background from muon DIO events in the presence of background tracker hits
1459: was simulated and reconstructed.
1460: Based on the simulated DIO events
1461: in the momentum range
1462: above 100 MeV/c the track pattern recognition and momentum reconstruction
1463: were performed in the presence of background tracker hits
1464: (protons, neutrons, photons, DIO) by applying the  selection criteria
1465: discussed above.
1466: 
1467: 
1468: \begin{figure}[htb!]
1469: \centerline{\hbox{\psfig{figure=dio_gt_100.eps,height=3.5in,clip=on}}}
1470:   \caption{
1471:   Distributions of DIO reconstructed momentum in the presence of the background 
1472:  above 100 MeV/c .
1473:  }
1474: \label{fig:dio}
1475: \end{figure}
1476: 
1477: 
1478: The number of primary DIO events  simulated in the momentum range above 100 MeV/c
1479: was  10 times more
1480: than expected and 3 background events were found (see Figure ~\ref{fig:dio}).
1481: So the background is expected to be 0.3 events.
1482: 
1483: It is important to note that input reconstructed momenta for 
1484: these three background events are very close to  
1485: simulated input momenta. The input simulated 
1486: momenta 103.48, 103.6 and 103.74 MeV/c for these events have to be
1487: compared with input reconstructed momenta 103.6, 103.76 and 103.8 MeV/c, 
1488: respectively.
1489: 
1490: At the present level of pattern recognition and momentum
1491: reconstruction studies we can expect the background from DIO
1492: events in the range above 100 MeV/c $\sim $0.3 events compared 
1493: to 6.5 signal events  for $R_{\mu e} = 10^{-16}$.
1494: 
1495: 
1496: \section{Conclusion}
1497: 
1498: A study of the impact of background on the performance of the 
1499: transverse tracker proposed for the MECO experiment is presented.
1500: Background from capture protons, neutrons and photons, and from
1501: muon decay in orbit was generated using GEANT3. 
1502: The effective average straw tube rate from these sources was 800 kHz
1503: at the proposed muon beam intensity of $2\times 10^{11} \mu^{-}/sec$. 
1504:    
1505: A pattern recognition
1506: procedure based on a Kalman filter technique was developed to 
1507: suppress background and assign hits to tracks.
1508: In the first stage of this procedure, straw hit center 
1509: coordinates, without drift time information, were used to reduce 
1510: the background by a factor $\sim$ 130. In the second stage, the full 
1511: drift time information
1512: and a deterministic annealing filter were used to obtain an 
1513: additional six-fold suppression. The total suppression of 800 reduces
1514: the number of background hits on average from an initial value of
1515: 300 to approximately 0.4 per event. About 0.8 hits of the 29 real hits
1516: typically recorded, or 2.7 $\%$, are lost in the process.
1517: 
1518: It was found that in the presence
1519: of background the
1520: resolution of the tracker is $\sigma = 0.12$ MeV/c and the overall 
1521: setup acceptance for muon conversion
1522: events with momentum above the threshold momentum 103.6 MeV/c is 
1523: about 22 $\%$. 
1524: At the considered background level the tracker resolution and the 
1525: overall acceptance are not affected significantly by presence of 
1526: the background: the tracker resolution is changed by 1.5 \% and the 
1527: overall acceptance by 2.7 \%.
1528: 
1529: Additional constraints on the background, not considered here, may be
1530: imposed if the drift time measurement is supplemented by a 
1531: measurement of the pulse amplitude at the anode wire. A straightforward, 
1532: crude measurement of the amplitude is sufficient to reduce significantly
1533: the background from heavily ionizing particles, i.e., the capture
1534: protons that comprise 30 \% of the background hits in the above study. 
1535: We estimate too that a significant background suppression, of 10-20,
1536: would be achieved if resistive anode wires were used in place of 
1537: conducting wires to obtain a measurement of the hit position along
1538: the wire. This would improve the resolution as well. 
1539: 
1540: At the present level of pattern recognition and momentum
1541: reconstruction studies the background from DIO events above 100 MeV/c
1542: is about 0.3 events. This is to be compared 
1543: to 6.5 signal events  for $R_{\mu e} = 10^{-16}$.
1544: 
1545: The study carried out shows that the developed procedures of pattern 
1546: recognition and momentum reconstruction in the case of the transverse tracker 
1547: provide a required precision for lepton number violation search at a sensitivity
1548: level about $10^{-17}$.
1549: 
1550: We wish to thank A. Mincer, P.Nemethy, J.Sculli and one of us (R.K.) thanks
1551: W.Willis for fruitful discussions and helpful remarks.
1552: 
1553: \begin{references}
1554: \bibitem{kuno} J.D.Vergados, Phys.Rep. {\bf 133}, 1, (1986); 
1555: Y.Kuno and Y.Okada, Rev.Mod.Phys.{\bf 73}, 151 ,(2001).
1556: \bibitem{psi} P.Wintz, Proc. of the 1st Int.Symp. on Lepton and Baryon
1557: Number Violation, 1998, ed. by H.V.Klapdor-Kleingrothaus and 
1558: I.V.Krivosheina (Inst. of Physics Publishing, Bristol and Philadelphia), 534.
1559: \bibitem{psigold} A. van der Schaaf, J.Phys.G: Nucl.Part.Phys. {\bf 29}, 1503, (2003).   
1560: \bibitem{rashid} R.M.Djilkibaev and V.M.Lobashev, Sov.J.Nucl.Phys. {\bf 49}, 384, (1989).
1561: \bibitem{abadj} V.Abadjev {\it et al}, MELC proposal, INR preprint 786/92, (1992). 
1562: \bibitem{bnl} M.Bachman {\it et al}, a research proposal to BNL AGS, (1997);
1563: NSF RSVP proposal (MECO), (1999).
1564: \bibitem{prime} S.Machida {\it et al}, a letter of intent to the J-PARC 50-GeV Proton
1565: Synchrotron Experiment, The PRIME Working Group, (2003).
1566: \bibitem{daf} R. Fr\"uhwirth and A. Stradlie, Comp.Phys.Comm. {\bf 120}, 197, (1999).
1567: \bibitem{rkalm} R.E.Kalman, Transactions of the ASME: J.Basic Engineering, {\bf D82}, 
1568: 35, (1960).
1569: \bibitem{kalman} P.Billoir, Nucl.Instr.Meth. {\bf 225}, 352, (1984); R.Fr\"uhwirth, 
1570: {\it ibid}. {\bf A262}, 444, (1987); P.Billoir and S.Qian, {\it ibid}. {\bf A294}, 
1571: 219, (1990); E.J.Wolin and L.L.Ho, {\it ibid}. {\bf A329}, 493, (1993).    
1572: \bibitem{mank} R.Mankel, Hera B note (1997).
1573: \end{references}
1574: 
1575: \newpage
1576: 
1577: 
1578: 
1579: \section*{APPENDIX\ A. KALMAN FILTER AND ITS\ APPLICATION\ TO\ TRACK\ FITTING}
1580: 
1581: The Kalman filter is an algorithm that processes measurements to deduce an
1582: optimum estimate of the past, present, or future state of \ a dynamic system
1583: by using a time sequence of measurements of the system behavior, plus a
1584: statistical model that characterizes the system and measurements errors,
1585: plus initial condition information.
1586: 
1587: The Kalman filter addresses the general problem of trying to
1588: estimate at different points ($1\leq k\leq n$) the state
1589: $\bf{x}_{k}$ of a discrete process that is governed by the
1590: linear stochastic difference equation
1591: 
1592: \begin{equation}
1593: \bf{x}_{k}=\bf{F}_{k-1}\bf{x}_{k-1}+\bf{w}_{k-1}
1594: \label{1ap}
1595: \end{equation}
1596: 
1597: with a measurement $\bf{m}_{k}$ that is
1598: 
1599: \begin{equation}
1600: \bf{m}_{k}=\bf{H}\bf{x}_{k}+\bf{\varepsilon} _{k}.
1601: \label{2ap}
1602: \end{equation}
1603: 
1604: 
1605: 
1606: 
1607: The system equation (\ref{1ap}) is not deterministic since the
1608: track experiences stochastic processes such as multiple
1609: scattering, bremsstrahlung, etc. These processes are taken into
1610: account by the process noise $\bf{w}_{k}$ .
1611: $\bf{\varepsilon} _{k}$ represents the measurement noise.
1612: $\bf{w}_{k}$ and $\bf{\varepsilon} _{k}$ are assumed to be
1613: independent of each other with zero expectation values:
1614: 
1615: $E\{\bf{w}_{k}\}=0,\qquad
1616: cov\{\bf{w}_{k}\}=\bf{Q}_{k}$ , \qquad $1\leq k\leq n,$
1617: 
1618: $E\{\bf{\varepsilon} _{k}\}=0,\qquad cov\{\bf{\varepsilon} _{k}\}=\bf{V}_{k}$ , \qquad $%
1619: 1\leq k\leq n,$
1620: 
1621: where $\bf{Q}_{k}$ and $\bf{V}_{k}$  are process noise and
1622: measurement noise covariances, respectively.
1623: 
1624: 
1625: Eq.(\ref{1ap}) in the absence of the last term is the standard
1626: equation of motion with a propagator $\bf{F}_{k-1}$ (transport
1627: matrix). Note that at the moment $\bf{F}_{k-1}$ is assumed to
1628: be constant.
1629: 
1630: Regarding a track in space as a dynamic system the filtering
1631: technique is applied to the track fitting. For example, in the
1632: case of a particle moving in magnetic field this can be done
1633: naturally by
1634: identifying the state vector $\bf{x}_{k}$ of the dynamic system with a vector $%
1635: \bf{x}_{k}=(x,y,\tan \theta _{x},\tan \theta _{y},1/p_{L})$ of
1636: 5 parameters uniquely describing the track at each point of the
1637: trajectory. The $\bf{F}$ matrix propagates the state vector on
1638: one plane to the state vector on the next plane combining position
1639: information with directional information. The transport matrix
1640: implicitly contains information about a gap between planes.
1641: 
1642: In general the set of parameters  $\bf{x}_{k}$ is not measured
1643: directly; only a function of $\bf{x}_{k}$ ,
1644: $\bf{H}\bf{x}_{k}$ is observed. For example, in the case
1645: of the transverse tracker one does not measure $\bf{x}_{k}$
1646: but $x\prime =x\cos \alpha +y\sin \alpha $ in the chamber
1647: coordinate system which corresponds to
1648: 
1649: \begin{equation}
1650: \bf{H}=(\cos\alpha ,\sin \alpha ,0,0,0)  \label{2ap1}
1651: \end{equation}
1652: 
1653: 
1654: 
1655: There are three types of operations to be performed in the analysis of a
1656: track.
1657: 
1658: \begin{itemize}
1659: \item \textbf{Prediction} is the estimation of the ``future" state
1660: vector at
1661: position ``k" using all the ``past" measurements up to and including ``k-1". $%
1662: \bf{x}_{k}^{k-1}$ is a prediction (a priori state estimation).
1663: 
1664: \item \textbf{Filtering }is the estimation of the state vector at
1665: position ``k" based upon all  ``past" and ``present" measurements
1666: up to and including ``k". $\bf{x}_{k}^{k}$ is a filtered state
1667: vector (a posteriori state estimation).
1668: 
1669: \item \textbf{Smoothing }is the estimation of the ``past" state
1670: vector at
1671: position ``k" based on all ``n" measurements taken up to the present time. $%
1672: \bf{x}_{k}^{n}$ is a smoothed state vector.
1673: \end{itemize}
1674: 
1675: The first step to estimate $\bf{x}_{k}$ is \textbf{the
1676: prediction} (time update):
1677: 
1678: \begin{equation}
1679: \bf{x}_{k}^{k-1}=\bf{F}_{k-1}\bf{x}_{k-1}^{k-1}
1680: \label{3ap}
1681: \end{equation}
1682: 
1683: \begin{equation}
1684: \bf{C}_{k}^{k-1}=\bf{F}_{k-1}\bf{C}_{k-1}^{k-1}\bf{F}_{k-1}^{T}+\bf{Q}_{k-1}
1685: \label{4ap}
1686: \end{equation}
1687: 
1688: where Eq.(\ref{3ap}) projects the state ahead and  Eq.(\ref{4ap}) projects
1689: the error covariance ahead.
1690: 
1691: \textbf{The filtered estimate} (measurement update)
1692: $\bf{x}_{k}^{k}$ is calculated as a weighted mean of the
1693: prediction and the observation:
1694: 
1695: \begin{equation}
1696: \bf{K}_{k}=\bf{C}_{k}^{k-1}\bf{H}_{k}^{T}[\bf{H}_{k}\bf{C}_{k}^
1697: {k-1}\bf{H}_{k}^{T}+\bf{V}_{k}]^{-1}
1698: \label{5ap}
1699: \end{equation}
1700: 
1701: \begin{equation}
1702: \bf{x}_{k}^{k}=\bf{x}_{k}^{k-1}+\bf{K}_{k}[\bf{m}_{k}-\bf{H}_{k}\bf{x}_{k}^{k-1}]
1703: \label{6ap}
1704: \end{equation}
1705: 
1706: \begin{equation}
1707: \bf{C}_{k}^{k}=[\bf{I}-\bf{K}_{k}\bf{H}_{k}]\bf{C}_{k}^{k-1}.
1708: \label{7ap}
1709: \end{equation}
1710: 
1711: Eq.(\ref{5ap}) computes the Kalman gain matrix defining the
1712: correction to the predicted state due to the current observation.
1713: Eq.(\ref{6ap}) updates the prediction with the measurement and
1714: Eq.(\ref{7ap}) updates the error covariance. The error covariance
1715: may be also expressed in a computationally superior form
1716: 
1717: \begin{equation}
1718: \bf{C}_{k}^{k}=[\bf{I}-\bf{K}_{k}\bf{H}_{k}]
1719: \bf{C}_{k}^{k-1}[\bf{I}-\bf{K}_{k}\bf{H}_{k}]^{T}+\bf{K}_{k}\bf{V}_{k}\bf{K}_{k}^{T}.
1720: \end{equation}
1721: 
1722: The filtering is a recursive operation. The prediction step and
1723: the filtering step are repeated for the next plane proceeding
1724: progressively from plane ``1" to plane ``n". The state vector at
1725: the last filtered point contains always the full information from
1726: all points.
1727: 
1728: At each step one can calculate the filtered residuals
1729: $\bf{r}_{k}^{k}$ , the covariance matrix of the filtered
1730: residuals $\bf{R}_{k}^{k}$\ and the filtered $\chi ^{2}$:
1731: 
1732: \[
1733: \bf{r}_{k}^{k}=\bf{m}_{k}-\bf{H}_{k}\bf{x}_{k}^{k}
1734: \]
1735: 
1736: \[
1737: \bf{R}_{k}^{k}=\bf{V}_{k}-\bf{H}_{k}\bf{C}_{k}^{k}\bf{H}_{k}^{T}
1738: \]
1739: 
1740: \[
1741: \chi
1742: _{k}^{2}=\bf{r}_{k}^{kT}(\bf{R}_{k}^{k})^{-1}\bf{r}_{k}^{k}
1743: \]
1744: 
1745: where $\chi _{k}^{2}$ is $\chi ^{2}$ - distributed with
1746: dim($\bf{m}_{k}$) degrees
1747: of freedom. The total $\chi ^{2}$ of the track is given by the sum of the $%
1748: \chi _{k}^{2}$ contributions for each plane.
1749: 
1750: The system of equations defining the Kalman filter represents an
1751: asymptotically stable system, and therefore, the estimate of the state
1752: vector $\bf{x}_{k}^{k}$ becomes independent on the starting point $\bf{x}_{0}^{0}$ , $%
1753: \bf{C}_{0}^{0}$ as k is increased.
1754: 
1755: When the last plane (nth) is taken into account the Kalman filter
1756: performs the final step which is a smoothing. The filter runs
1757: backward in time updating all filtered state vectors on the basis
1758: of information from all n planes. The equations describing
1759: \textbf{the smoothing} are given by
1760: 
1761: \[
1762: \bf{A}_{k}=\bf{C}_{k}^{k}\bf{F}_{k}^{T}(\bf{C}_{k+1}^{k})^{-1}
1763: \]
1764: 
1765: \[
1766: \bf{x}_{k}^{n}=\bf{x}_{k}^{k}+\bf{A}_{k}(\bf{x}_{k+1}^{n}-\bf{x}_{k+1}^{k})
1767: \]
1768: 
1769: \[
1770: \bf{C}_{k}^{n}=\bf{C}_{k}^{k}+\bf{A}_{k}(\bf{C}_{k+1}^{n}-\bf{C}_{k+1}^{k})\bf{A}_{k}^{T}
1771: \]
1772: 
1773: \[
1774: \bf{r}_{k}^{n}=\bf{m}_{k}-\bf{H}_{k}\bf{x}_{k}^{n}
1775: \]
1776: 
1777: \[
1778: \bf{R}_{k}^{n}=\bf{V}_{k}-\bf{H}_{k}\bf{C}_{k}^{n}\bf{H}_{k}^{T}
1779: \]
1780: 
1781:    Until now it was assumed that the problem of estimation of a discrete-time
1782: process is described by a linear stochastic differential equation.
1783: However for example in the presence of a magnetic field the track
1784: propagator $\bf{F}$ is non-linear.
1785:    Let's assume that the process of a particle propagation is governed by the
1786: non-linear stochastic differential equation
1787: 
1788: \begin{equation}
1789: x_{k}=f(x_{k-1})+w_{k-1}  \label{8ap}
1790: \end{equation}
1791: 
1792: with a measurement m in the form Eq.(\ref{2ap}). f is a non-linear
1793: function. The Kalman filter can be applied to this system by
1794: linearizing the system for example about the estimated trajectory.
1795: If deviations between the estimated trajectory and the actual
1796: trajectory remain sufficiently small the linear approximation is
1797: valid.
1798:    The non-linear equation (\ref{8ap}) can be written down in the linearized form
1799: as
1800: 
1801: \begin{equation}
1802: \bf{x}_{k}=\bf{f}(\bf{x}_{k-1}^{k-1})+\bf{F}
1803: \cdot(\bf{x}_{k-1}-\bf{x}_{k-1}^{k-1})+\bf{w}_{k-1}\label{10ap}
1804: \end{equation}
1805: 
1806: where as before $\bf{x}_{k}$,$\bf{m}_{k}$ are the actual
1807: state and measurement vectors, $\bf{x}_{k}^{k}$ is a filtered
1808: estimate of the state at step k. $\bf{F}$ is Jacobian matrix
1809: 
1810: \begin{equation}
1811: \bf{F}_{ij}=\partial
1812: \bf{f}_{i}(\bf{x}_{k-1}^{k-1})/\partial x_{j} \label{11ap}
1813: \end{equation}
1814: 
1815: Therefore the complete set of extended Kalman filter equations is
1816: given by Eqs.(\ref{4ap})-(\ref{7ap}),(\ref{10ap}) by using F in
1817: the form (\ref{11ap}).
1818: 
1819: In order to apply the extended Kalman filter to a track fitting
1820: for a particle moving in uniform magnetic field (the magnetic
1821: field is in z direction) one has to choose the state vector
1822: parameters, define the initial state vector and calculate the
1823: transport matrix $\bf{F}$, the projection matrix $\bf{H}$, and
1824: the noise matrix $\bf{Q}$.
1825:    As it was mentioned above in this case the state vector can be chosen
1826: in the form ${\bf{x}_{k}}=(x,y,t_{x},t_{y},1/p_{L})$ where x, y
1827: are the track coordinates in the tracker system,
1828: $t_{x}=p_{x}/p_{L}$, $t_{y}=p_{y}/p_{L}$ define the track
1829: direction.
1830:    The projection matrix H is given by Eq.(\ref{2ap1}).
1831:    Due to multiple scattering the absolute value of electron momentum
1832: remains unaffected, while the direction is changed. This
1833: deflection can be described using two orthogonal scattering
1834: angles, which are also orthogonal to the particle momentum ~\cite{mank}.
1835:  In terms of these variables the noise
1836: matrix is given by
1837: 
1838: %\begin{equation}
1839: $$
1840: \bf{Q}_{k}=<\Theta ^{2}>(t_{x}^{2}+t_{y}^{2}+1) \left.\left(
1841: \begin{array}{ccccc}
1842: 0 & 0 & 0 & 0 & 0 \\
1843: 0 & 0 & 0 & 0 & 0 \\
1844: 0 & 0 & t_{x}^{2}+1 & t_{x}t_{y} & t_{x}/p_{L} \\
1845: 0 & 0 & t_{x}t_{y} & t_{y}^{2}+1 & t_{y}/p_{L} \\
1846: 0 & 0 & t_{x}/p_{L} & t_{y}/p_{L} & \frac{(t_{x}^{2}+t_{y}^{2})}{
1847: p_{L}^{2}(t_{x}^{2}+t_{y}^{2}+1)}
1848: \end{array}
1849: \right.\right)$$
1850: %\end{equation}
1851: 
1852:    For the variance of the multiple scattering angle the well-known
1853: expression is used
1854: 
1855: \begin{equation}
1856: <\Theta^{2}>=(13.6MeV/p)^{2}[1+0.038\ln(t/X_{R})]t/X_{R}
1857: \end{equation}
1858: 
1859: where $X_{R}$ is a radiation length, t is a distance traveled by
1860: the particle inside a scatterer.
1861:    Energy losses are taken into account by
1862: 
1863: \begin{equation}
1864: p\prime = p - <dE/dx>t.
1865: \end{equation}
1866: 
1867: \qquad
1868: 
1869: 
1870: \section*{APPENDIX\ B. DETERMINISTIC ANNEALING FILTER}
1871: 
1872: Track reconstruction in modern high energy physics 
1873: experiments faces a significant
1874: amount of noise hits in a detector. The track fit thus is 
1875: confronted with several competing hits in detector's layers.
1876: The Kalman filter (see Appendix A) is 
1877: now widely used
1878: for the reconstruction of the track parameters in high energy
1879: physics. However the 
1880: application of the Kalman filter requires that the problem 
1881: of assignment of the detector hits to track candidates has been
1882: entirely resolved by the preceding selection procedure. 
1883: If this is not the case, the filter has to run on every 
1884: possible assignment to select the best one by chi-square
1885: criterion. Obviously this approach is computationally expensive
1886: and practically unfeasible for a considerable amount of noise
1887: hits. For this reason the Deterministic Annealing Filter (DAF)
1888: was developed ~\cite{daf}. In DAF there is an additional 
1889: validation feature eliminating hits which are not compatible
1890: with the predicted track position.
1891: 
1892: The deterministic annealing filter itself is a Kalman filter
1893: with re-weighted observations. The propagation part of DAF is
1894: identical to the standard Kalman filter. 
1895: 
1896: \textbf{The filtered estimate} (measurement update)
1897: $\bf{x}_{k}^{k}$ at layer k is calculated as a weighted mean of the
1898: prediction $\bf{x}_{k}^{k-1}$ and the observations 
1899: {$\bf m_{k}^{i}, i=1,2,...n_{k}$}:
1900: 
1901: \begin{equation}
1902: \bf x_{k}^{k}=\bf x_{k}^{k-1}+\bf K_{k}\sum _{i=1}^{{n}_{k}} 
1903: p_{k}^{i}[\bf{m}_{k}^{i}-\bf{H}_{k}\bf{x}_{k}^{k-1}]
1904: \label{1bp}
1905: \end{equation}
1906: 
1907: where ${\bf p_{k}^{i}}$ is the assignment probability of observation $\bf m_{k}^{i}$.
1908: $\bf{K}_{k}$ is the Kalman gain matrix which is given by  
1909: 
1910: \begin{equation}
1911: \bf{K}_{k}=[[\bf{C}_{k}^{k-1}]^{-1}+p_{k}\bf{H}_{k}^{T}\bf{V}_{k}^{-1}\bf{H}_{k}]^{-1}
1912: \bf{H}_{k}^{T}\bf{V}_{k}^{-1}
1913: \label{2bp}
1914: \end{equation}
1915: 
1916: where ${\bf p_{k}}$ is the sum over all weights ${\bf p_{k}^{i}}$, $\bf{H}_{k}$ is the 
1917: measurement matrix, $\bf{V}_{k}$ is the variance of the observations.
1918: 
1919: The covariance matrix $\bf{C}_{k}^{k}$ of the updated estimate
1920: $\bf{x}_{k}^{k}$ is written as 
1921: 
1922: \begin{equation}
1923: \bf{C}_{k}^{k}=[[\bf{C}_{k}^{k-1}]^{-1}+p_{k}\bf{H}_{k}^{T}\bf{V}_{k}^{-1}\bf{H}_{k}]^{-1}.
1924: \label{3bp}
1925: \end{equation}
1926:   
1927: After completion of the forward filter a backward filter runs
1928: in opposite direction, using the same weights as the forward
1929: filter. By taking a weighted mean of the filtered states of 
1930: both filters at every layer a prediction for the state vector 
1931: $\bf x_{k}^{n*}$ along with its covariance matrix $\bf {C}_{k}^{n*}$ 
1932: is obtained, using all hits except the ones at layer k. (The asterisk
1933: indicates that the information from layer k is not used in 
1934: this prediction.) Initially all assignment probabilities for the
1935: hits in each layer 
1936: are set
1937: to be equal but based on the estimated state vector 
1938: $\bf x_{k}^{n*}$ and its covariance 
1939: matrix, the assignment probabilities of all competing hits are then 
1940: recalculated in the following way:
1941: 
1942: \begin{equation}
1943: {\bf {p}_{k}^{i}} \sim \varphi(\bf{m}_{k}^{i};
1944: \bf{H}_{k}\bf x_{k}^{n*}, \bf{V}_{k}+\bf{H}_{k}\bf{C}_{k}^{n*}\bf{H}_{k}^{T})
1945: \label{4bp}
1946: \end{equation}
1947: 
1948: where $\varphi(\bf x ; {\bf \mu},\bf V$) is a multivariate Gaussian 
1949: probability density with mean vector ${\bf \mu}$ and covariance
1950: matrix $\bf V$. 
1951: 
1952: If the probability falls below a certain threshold, the hit is 
1953: considered as the false one and is excluded from the list of the
1954: hits assigned to the track. 
1955: 
1956: However at this step we cannot be sure in calculated probabilities
1957: especially in the initial phase due to insufficient information for
1958: the filter. This problem is overcome by adopting a simulated annealing     
1959: iterative procedure. This is an additional feature of DAF.
1960: 
1961: The simulated annealing optimization algorithm is based on an
1962: analogy between the behavior of a material heated past its
1963: melting point that is slowly cooled (annealed) to form a single
1964: crystal. If the cooling proceeds slowly enough, the crystalline 
1965: state reached at zero temperature will have all the atoms fixed 
1966: in a perfect lattice structure, corresponding to the lowest 
1967: possible energy of the system (global minimum).
1968: 
1969: In the same way in track fitting the simulated annealing allows
1970: to avoid a local minimum and find the global one corresponding to
1971: the minimum chi-square for the track.
1972: 
1973: DAF annealing algorithm can be described in the following way.
1974: The annealing schedule is chosen for example in the form
1975: ${\bf V_{N}}={\bf V}(A/f^{N}+1)$ where the annealing factor 
1976: $f>1$ and factor $A>>1$. This provides that the initial 
1977: variance  is well above the nominal value 
1978: ${\bf V}$ of the observation error but the final one tends 
1979: to ${\bf V}$. After each iteration the assignment 
1980: probabilities exceeding the threshold are normalized to 1
1981: and used again as weights in the next iteration, and so on.
1982: The iterations generally are stopped if the relative change in 
1983: chi-square is less than correspondent control parameter
1984: (typically of the order 0.01). 
1985: 
1986: Since we deal with the stochastic process the best result
1987: can be reached repeating the DAF procedure for a few different
1988: annealing factors f and then choosing the result 
1989: corresponding to the minimum chi-square. 
1990: 
1991: 
1992:    
1993: \newpage
1994: 
1995: 
1996: \section*{APPENDIX\ C. Tracker Resolution}
1997: 
1998: This appendix demonstrates the results of application of the pattern
1999: recognition and reconstruction procedure for the conversion events 
2000: without background.
2001: 
2002: 
2003:  The distribution in the difference between the input reconstructed momentum (Pin\_f) based on
2004:  the Kalman filter and the simulated input momentum (Pin) is shown in
2005: Figure ~\ref{fig:pin_dif} in linear (a) and logarithmic (b) scale.
2006: According to this distribution the intrinsic tracker resolution is
2007: $\sigma$ = 0.12 MeV if one fits the distribution by a Gaussian in the range
2008: -0.3 - 0.7 MeV.
2009: 
2010: 
2011: \begin{figure}[htb!]
2012: \centerline{\hbox{%
2013: \psfig{figure=dif_lin.eps,height=3.0in,clip=on}
2014:  \psfig{figure=dif_log.eps,height=3.0in,clip=on}}}
2015: \caption{
2016:  Distribution in the difference between the input reconstructed momentum based on
2017:  the Kalman filter and the simulated input momentum without background.
2018:  }
2019: \label{fig:pin_dif}
2020: \end{figure}
2021: 
2022: 
2023: \begin{figure}[htb!]
2024: \centerline{\hbox{%
2025: \psfig{figure=pin_real_lin.eps,height=3.0in,clip=on}
2026:  \psfig{figure=pin_real_log.eps,height=3.0in,clip=on}}}
2027: \caption{
2028:  Distribution in the input momentum reconstructed by the
2029:  Kalman filter  without background.
2030:  }
2031: \label{fig:pin_kal}
2032: \end{figure}
2033: 
2034: 
2035: Figure ~\ref{fig:pin_kal} shows a distribution in the input
2036: momentum (Pin\_f) reconstructed by the Kalman filter 
2037: in linear (a) and logarithmic (b) scale without background.  
2038: This distribution is characterized by the standard deviation $\sigma$ = 0.25 MeV of the
2039: reconstructed input momentum  for a Gaussian fit
2040: in the range 104-106 MeV. 
2041: The overall reconstruction acceptance is 22.7 $\%$ for muon conversion
2042: events with the momentum above a threshold momentum of 103.6
2043: MeV/c.
2044: 
2045: Comparing these results with the results of the reconstruction
2046: in the presence of the background we get the difference in tracker
2047: resolution 1.5 \% and the difference 2.7 \% in overall acceptance.
2048: Therefore the tracker resolution and overall acceptance are not
2049: affected significantly at the considered background level.
2050: 
2051: \end{document}
2052: