1: \documentstyle[psfig,12pt]{aipproc}
2: \pagestyle{plain}
3: \setlength{\textheight}{9.7in}
4:
5: \tolerance = 10000
6:
7: \newcommand{\grad}{$^{\circ}$}
8: \newcommand{\vsp}{\vspace{0.125in}}
9: \newcommand{\hsp}{\hspace{0.15in}}
10: \newcommand{\gcm}{~g/cm$^3$~}
11: \newcommand{\gcms}{~g/cm$^2$~}
12:
13: \newcommand{\muea}{\mbox{\sl $\mu^{-} A \rightarrow e^{-}$ A }}
14: \newcommand{\apr}{$\simeq$~}
15:
16: %\pagenumbering{arabic}
17:
18: \begin{document}
19:
20: \title{ KALMAN FILTER BASED TRACKER STUDY FOR $\mu e$ - CONVERSION EXPERIMENT }
21:
22:
23: \author{Rashid M. Djilkibaev \thanks{Permanent address: Institute for Nuclear
24: Research, 60-th Oct. pr. 7a, Moscow 117312, Russia}$^{,2}$, Rostislav V. Konoplich$^{2,3}$}
25: \address{$^{2}$Department of Physics,
26: New York University,
27: New York, NY 10003 \\
28: $^{3}$Manhattan College, Riverdale, New York, NY, 10471}
29: \maketitle
30:
31:
32: \begin{abstract}
33: The search for muon to electron conversion
34: with a sensitivity of the order $10^{-17}$ requires a several order of magnitude
35: increase in muon intensity and a high resolution,
36: $\sigma \simeq$ 0.1 MeV/c of the electron's momentum.
37:
38: We present results of a pattern recognition and track momentum
39: reconstruction algorithm that relies on a Kalman filter approach.
40: Background from captured protons, neutrons,
41: photons and from muon decay in orbit were generated by GEANT.
42: The effective average
43: straw tube background rate was 800 kHz.
44:
45: The pattern recognition proceeds in two stages.
46: In the first, simple considerations using only
47: straw tube center coordinates, without drift time information, were
48: applied to reduce the background to a manageable level.
49: Then the drift time information is incorporated and a Deterministic Annealing Filter
50: applied to reach the final level of background suppression and
51: to provide a starting
52: point for the track momentum reconstruction using the Kalman filter.
53: This procedure reduces the simulated background by a factor 800
54: with small,(2.7\%), losses in real tracker hits.
55:
56: The momentum resolution of the tracker is $\sigma $ = 0.12 MeV/c
57: and the acceptance for muon conversion events with momentum
58: above 103.6 MeV/c is 22\%. These numbers do not differ significantly
59: from the values obtained without background.
60:
61: The expected number of events from muon decay in orbit (main background)
62: in which the
63: decay electron has momentum greater than 100 MeV/c is 0.3, compared
64: to 6.5 $\mu e$ - conversion events above the same threshold
65: for $R_{\mu e} = 10^{-16}$.
66:
67: \end{abstract}
68:
69:
70: \section{Introduction}
71:
72: The observation of $\mu e$ - conversion would provide the first
73: direct evidence for lepton flavor violation in charged lepton sector
74: and require new physics,
75: beyond the Standard Model (see ~\cite{kuno} and references therein).
76: Lepton flavor is not conserved in neutrino oscillations but the
77: modifications to the Standard Model to include the small neutrino
78: masses do not lead to an appreciable rate for $\mu e$ - conversion
79:
80:
81: \begin{center}
82: $\mu^{-} + N \to e^{-} + N$ .
83: \end{center}
84:
85: This process violates the lepton flavor numbers, $L_{e}$ and $L_{\mu}$,
86: but conserves the total lepton number. The signature of the process is very
87: clear: a single monochromatic electron in the final state with the energy
88: close to the muon mass:
89:
90: \begin{center}
91: $E_{e} = m_{\mu} - B_{\mu} - E_{rec}$
92: \end{center}
93: where $m_{\mu}$ is a muon mass, $ B_{\mu}$ is a binding energy of the
94: 1s muonic atom, $E_{rec}$ is a nuclear recoil energy.
95:
96: The SINDRUM II collaboration at PSI has carried out a program of experiments to
97: search for $\mu e$ - conversion in various nuclei. They find that at
98: 90\% CL the upper limit for the reaction $\mu^{-} + Ti \to e^{-} + Ti$
99: is $6.1\times 10^{-13}$ ~\cite{psi}. In this experiment muons were accumulated
100: at a rate of $10^{7} \mu^{-}/sec$. According to preliminary results
101: ~\cite{psigold} for the reaction $\mu^{-} + Au \to e^{-} + Au$
102: a single event sensitivity is $3.3\times 10^{-13}$
103:
104: In ~\cite{rashid} an idea of increasing of muon beam intensity
105: by a few orders of magnitude up to $10^{11} \mu^{-}/sec$ based on the
106: solenoid-capture scheme was discussed and MELC experiment was proposed
107: ~\cite{abadj} with a goal to reach a sensitivity of the order $10^{-17}$.
108:
109: A new $\mu e$ - conversion experiment MECO (Muon Electron COnversion) E-940
110: ~\cite{bnl}, exploiting the idea of the solenoid-capture scheme,
111: is under preparation at BNL. MECO aims to search for
112: $\mu^{-} + Al \to e^{-} + Al$ with a single event sensitivity 2$\times$$10^{-17}$. It will
113: use a new high-intensity pulsed muon beam, which could yield about
114: $10^{11} \mu^{-}/sec$ stopped in a target.
115:
116: Also the PRIME (PRISM MuE conversion) working group at KEK expressed an
117: interest ~\cite{prime} to carry out a search for lepton flavor violation
118: in $\mu e$ - conversion in a muonic atom at a sensitivity of
119: $10^{-18}$ using a proposed high intensity pure muon source of
120: $10^{11} - 10^{12} \mu^{-}/sec$ .
121:
122: In this article we develop a pattern recognition and track reconstruction
123: procedure
124: based on the Kalman filter technique for a transverse version of tracker for MECO
125: experiment.
126:
127: This paper is organized as follows: In section 2 the
128: transverse tracker is described and the advantages of the tracker discussed.
129: In Section 3 possible backgrounds are discussed and the
130: procedure of background simulation briefly explained.
131: In Section 4 the pattern recognition
132: procedure is developed.
133: At this stage a deterministic annealing
134: filter (DAF) is applied to make a final background
135: suppression and provide a starting
136: point for track momentum reconstruction by the Kalman
137: filter technique. Section 5 describes the procedure of momentum
138: reconstruction based on the Kalman filter. Results of the pattern
139: recognition and momentum reconstruction are presented in this
140: section. A brief summary outlook can be found in
141: Section 6. The appendices provide a more detailed look at Kalman filter and deterministic
142: annealing filter. Also tracker resolution is discussed in Appendix.
143:
144: \section{The Transverse Tracker Description}
145:
146: The goal of the MECO tracker is to detect the electron from $\mu e$ - conversion
147: with large acceptance and measure its momentum
148: with high resolution ($\sigma \simeq$ 0.1 MeV/c).
149: The tracker is located in a uniform 1T magnetic field.
150: The minimal tracker length is defined by the following
151: requirements:
152:
153: Background electrons with an energy around 105 MeV are produced
154: by cosmic rays in the wall between the transport and detector
155: solenoids. These electrons cannot have a pitch angle $\theta$ in
156: the tracker greater than $45^{o}$, due to the adiabatic character
157: of charge particle movement. To suppress the cosmic ray
158: background we require that measured pitch angle $\theta_{min}$ for
159: signal events to be more than $45^{o}$.
160:
161: To get more redundancy for signal events we require that the measured
162: trajectory should have two full turns. This requirement sets
163: limits on the minimal tracker length, which is
164: expressed as:
165:
166: \begin{center}
167: $ L_{Tracker}^{min} = 4 \cdot \pi \cdot 1/2.998 \cdot P \cdot cos(\theta_{min})/B \simeq
168: 4.19 \cdot 105 \cdot 0.707$/1.0 $\simeq$ 310 cm
169: \end{center}
170: where P is a muon conversion momentum (MeV/c) and B is a magnetic
171: field (1 Tesla) in the tracker region.
172:
173: The tracker consists of 18 modules spaced 17 cm apart. A module
174: consists of 6 planes, each turned at 30$^{o}$ and shifted 2.5 cm
175: relative to the previous one (Figure ~\ref{fig:setup}). A plane
176: consists of two trapezoidal chambers of width 30 cm and lengths
177: 70 to 130 cm in an up and down configuration.
178: The chamber coordinate systems are defined by a
179: rotation angle proportional to 30$^{o}$ giving an effective
180: ``stereo'' of crossed directions for 12 different views.
181:
182: A protective sheath is used to suppress background Compton
183: electrons from a chamber frame.
184: The protective sheath makes nonsensitive a small
185: region of the anode wire near the frame.
186:
187: The chamber consists of one layer of straw tubes (60 straws) of 5
188: mm diameter, and length varying from 70 cm to 130 cm. The total
189: number of chambers is 216. The chamber sensitive area starts from
190: 38 cm radius. The straws are assumed to have wall
191: thickness 15 $\mu$m and are
192: constructed of kapton. The total thickness of each chamber is 9
193: mg/$cm^2$. The total number of the tracker straws is 12960. The
194: tracker length is 302 cm. A signal from the straw anode wire
195: will be used to get drift time.
196:
197: The Al target is tapered in the downstream direction, with 5 cm
198: disk spacing and radii from 8.3 cm to 6.53 cm. The target is
199: placed in the graded portion of the DS magnetic field, with the
200: first disk at 1.75 T and the last at 1.3 T. Protons from the muon
201: capture are absorbed in three concentric polyethylene absorbers: a
202: conical tube of dimensions $R_{1}$ = 46 cm, $R_{2}$ = 70 cm, L =
203: 260 cm and a tube (R = 70 cm, L = 200 cm) both have thickness 3
204: mm, and a tube of smaller radius (R = 36 cm, L = 235 cm) of
205: thickness 0.5 mm which are placed before the tracker.
206:
207: \begin{figure}[htb!]
208: \centerline{\hbox{\psfig{figure=tracker.eps,height=3.5in,clip=on}}}
209: \caption{
210: Schematic drawing of the tracker design.
211: }
212: \label{fig:setup}
213: \end{figure}
214:
215: \bigskip
216: The transverse tracker is to be compared with a
217: longitudinal version that MECO is also considering.
218: The longitudinal tracker ~\cite{bnl} consists
219: of an octagonal array of eight detector planes placed symmetrically
220: around the Detector Solenoid axis, plus eight more planes projecting
221: radially outward from each vertex of the octagon. Each plane consists of
222: resistive straws approximately 300 cm long. A hit position
223: in the radial and azimuthal direction is determined by the straw
224: position and the drift time on the anode wire. The hit position in
225: the axial direction is determined by the centroid of the imaged charge
226: from the anode wire, as collected on cathode pads.
227:
228: The transverse tracker would have several advantages in comparison with a
229: longitudinal one since:
230:
231: \noindent
232: $\bullet$ Normal non resistive straws are used without a pad system. \\
233: $\bullet$ Gas manifolds, straw end-caps and chamber supports are
234: all outside the conversion electron trajectories. \\
235: $\bullet$ Shorter straws (0.7 - 1.3 m) are technically easier to build
236: and are more robust against instability than longitudinal straws of 3 m.\\
237: $\bullet$ Complications from the small tilt of each plane of the longitudinal
238: tracker with respect to the magnetic field are avoided.\\
239: $\bullet$ Transverse geometry provides a simple signature of an event since
240: charged particles cross a single straw only once.\\
241: $\bullet$ There is a significant simplification dealing
242: only with single chamber hits points.\\
243: $\bullet$ The average number of spatially separated hits is a few times greater in
244: comparison with the number of spatially separated clusters in the longitudinal
245: tracker. The hits are distributed uniformly in
246: lobes along the transverse tracker.\\
247: $\bullet$ The transverse tracker presents less material in the case of
248: 15 $\mu$m straw wall thickness. The effective total thickness is
249: 29*(15+2)*3.14 = 1550 $\mu$m versus longitudinal tracker 8*3.5*25*3.14 = 2200 $\mu$m.\\
250: $\bullet$ The pattern recognition can be performed with good precision
251: even without drift time and amplitude information.\\
252: $\bullet$ The Cu layer covering straw tubes suppresses significantly gas diffusion
253: through a straw wall.\\
254: $\bullet$ Capacitive crosstalk between channels is small.\\
255: $\bullet$ Low energy Compton electron backgrounds have distinguished signature.\\
256:
257:
258: \section{Background simulation}
259:
260: Our analysis is based on a full GEANT simulation taking into account an
261: individual straw structure.
262: Multiple scattering and energy loss are taken into account.
263: Isobutane ($C_{4}H_{10}$) gas is assumed to fill the tubes.
264:
265: For the subsequent analysis we take only events satisfying the
266: following criteria: number of hits in the tracker is greater than 15;
267: MC simulated energy release in the calorimeter is greater than 80 MeV;
268: pitch angle is greater than 45 $^{0}$. That leaves about $35\%$ of
269: the original events.
270:
271: The primary sources of charged particles in the tracker detector
272: during the detection time are protons, neutrons and photons from
273: muon capture by $^{27}Al$ nuclei and electrons from muon decay in
274: orbit.
275: The average and peak tracker rates from
276: different backgrounds are presented in Table ~~\ref{table:tab1}.
277: The main source of background to muon conversion is muon decay in orbit (DIO).
278:
279: \begin{center}
280: \begin{table}[htb!]
281: \caption { The Average and Peak (in parenthesis) Tracker Rates }
282: \begin{tabular}{|l|c|c|c|c|c|}
283: \hline
284: Processes&proton&neutron &$\gamma$ &DIO & DIO\\
285: & & & &$<$ 55 MeV &$>$ 55MeV\\
286: \hline
287: Particles/process & 0.1 & 1.2 & 1.8 & 0.9945 & $5.5 \cdot 10^{-3}$ \\
288: Particles/nsec & $120 (170)\times 0.1$ &$120 (170)\times 1.2$ &$120 (170) \times
289: 1.8$ &80 (115) & 80 (115)\\
290: \hline
291: Prob. Particle to Hit Tracker&$ 1.08 \cdot 10^{-2}$&$0.92 \cdot 10^{-3}$
292: &$0.9 \cdot 10^{-3}$ & $4.0 \cdot 10^{-4}$ & $1.4 \cdot 10^{-2}$\\
293: \hline
294: Events in Tracker/30ns&3.9 (5.5) &4.0 (5.6) &5.9 (8.2) &1.0 (1.4) & 0.2 (0.3)\\
295: \hline
296: Straws on in Tracker/event &14.0 &5.2 &31.0 & 15.7 & 4.3\\
297: \hline
298: Straw Rate (kHz)&140 (200) &53 (75) &470 (650) &40 (56) & 2.2 (3.3)\\
299: \hline
300: \end{tabular}
301: \label{table:tab1}
302: \end{table}
303: \end{center}
304:
305:
306:
307: To study the tracker performance in the presence of the background
308: the number of muon conversion events ($10^5$) with
309: initial momentum of conversion electrons produced
310: in the target (105 MeV/c) was simulated and saved to a data file.
311: The expected number of
312: DIO events during the experiment time ($10^7$ sec) in
313: this region is $5.2 \cdot 10^4 $ events above 100 MeV/c.
314:
315: To study the main background
316: the number of simulated DIO events
317: in this energy region was chosen to be ten times greater.
318: Five different random backgrounds (protons, neutrons, photons and
319: DIO (see Table ~\ref{table:tab1}) were generated and saved to data
320: files.
321:
322: The numbers in the table are the result of a GEANT simulations
323: at the anticipated beam intensity,
324: with no artificial increase as a safety factor.
325: No suppression was added for rejection of
326: heavy ionization particles.
327:
328: Background was taken from the data files and added to
329: conversion events and DIO events above 100 MeV/c. For example, we
330: simulate the number of events N for the proton background
331: according to a Poisson distribution with average 3.9
332: taken from background Table ~\ref{table:tab1}. Then we randomly
333: pick-up N accidental proton events from the corresponding data
334: file and add these events to the muon conversion or DIO event.
335: In the
336: same way the above procedure is repeated for all background types.
337:
338:
339:
340: In Figure ~\ref{fig:nrb} distributions in the number of real and background
341: tracker hits are shown.
342: The average number of real hits is 29.
343: The real hits distribution starts from 15 hits, corresponding to
344: the cut-off in the minimum number of selected hits.
345: The maximum number of real tracker hits can reach 60.
346: The average number of background hits is 300 which corresponds to an average
347: straw rate about 800 kHz.
348: The distribution in the number of background hits is a broad one and
349: the maximum number of hits is about 900.
350:
351:
352:
353: \begin{figure}[htb!]
354: \centerline{\hbox{\psfig{figure=Nreal.eps,height=2.0in,clip=on}
355: \psfig{figure=Nback_t.eps,height=2.0in,clip=on}}}
356: \caption{
357: Distributions in the number of real and background tracker hits.
358: }
359: \label{fig:nrb}
360: \end{figure}
361:
362:
363: \section{Pattern recognition}
364:
365:
366:
367: There are twelve views in the tracker,
368: each separated from the previous by 30 degrees. In a typical
369: event, a two-dimensional projection of the helical trajectory, a
370: sine curve, is observed in three or four of the views. There are
371: approximately 10 hits in each view, and 29 hits in the
372: event. The hits are grouped in lobes (see Figure ~\ref{fig:sin1}) with a typical
373: gap between lobes about 60 cm (in these gaps an electron travels in vacuum).
374: Therefore in the sensitive area of the tracker one gets track segments
375: but not a complete track and the problem of combining information
376: from different lobes arises. Since in a given view at least 4 hits
377: are required to reconstruct four parameters describing a
378: two-dimensional projection of helix often it is not possible to get helix
379: parameters for a track segment in a single lobe and than to apply
380: track element merging strategy. Quite opposite one has to use
381: hits from different lobes for pattern recognition and then for
382: momentum reconstruction. We will show that an appropriate mathematical
383: approach allows to resolve the problem..
384:
385: The sinusoidal projection of helix is described by four parameters:
386: $x_{0}^{\prime }$ , $z_{0}^{\prime }$, $R_{L}$, $R_{T}$,
387:
388: \begin{equation}
389: x_{i}^{\prime }=x_{0}^{\prime }+R_{T}\cos (\frac{z_{i}-z_{0}^{\prime }}{R_{L}%
390: })
391: \label{eq0}
392: \end{equation}
393:
394: where importantly, $R_{L}$ and $R_{T}$, related to the longitudinal and transversal
395: momenta, and z coordinate are common to all projections. Coordinate $x^{\prime}$ is
396: defined in a system for each given view.
397:
398: A simple and powerful reconstruction strategy
399: starts from a single chamber view. To determine the parameters in this view a minimum of four
400: hits are required.
401: The total momentum and momentum pitch angle are determined.
402:
403: \begin{figure}[htb!]
404: \centerline{\hbox{\psfig{figure=sin1.eps,height=2.5in,clip=on}}}
405: \caption{
406: Lobes in the tracker.
407: }
408: \label{fig:sin1}
409: \end{figure}
410:
411: To find the five parameters describing the full 3-dimensional helix,
412: one additional hit outside the plane is required:
413:
414: \begin{equation}
415: x_{i}^{\prime }=x_{0}^{\prime }+R_{T}\cos (\frac{z_{i}-z_{0}^{\prime }}{R_{L}%
416: }); ~~~~~~~
417: y_{i}^{\prime }=y_{0}^{\prime }+R_{T}\sin (\frac{z_{i}-z_{0}^{\prime }}{R_{L}%
418: })
419: \label{eq01}
420: \end{equation}
421:
422:
423: The mathematical procedure to find the parameters is the following one:\\
424:
425: \noindent
426: $\bullet$ four hits from a single view are taken to give the system of equations
427: for helix parameters in the coordinate system related with the view.\\
428: $\bullet$ the system of four equations is reduced to a unique equation for $R_{L}$. \\
429:
430: \begin{equation}
431: \Delta x_{24}S_{43}S_{41}S_{31}+\Delta x_{14}S_{43}S_{42}S_{23}+\Delta
432: x_{34}S_{41}S_{42}S_{12}=0 \label{eq2}
433: \end{equation}
434:
435: where $\Delta x_{ij}=x_{i}^{\prime }-x_{j}^{\prime }$ , $S_{ij}=\sin
436: ((z_{i}-z_{j})/2R_{L}).$
437: \\
438: \noindent
439: $\bullet$ the equation is solved for $R_{L}$ numerically by the Newton-Raphson method.\\
440: $\bullet$ the parameters are expressed in terms of $R_{L}$. \\
441:
442: \begin{equation}
443: \tan (z_{0}^{\prime }/R_{L})=\frac{\Delta x_{14}S_{43}\sin
444: ((z_{4}+z_{3})/2R_{L})-\Delta x_{34}S_{41}\sin ((z_{4}+z_{1})/2R_{L})}{%
445: \Delta x_{14}S_{43}\cos ((z_{4}+z_{3})/2R_{L})-\Delta x_{34}S_{41}\cos
446: ((z_{4}+z_{1})/2R_{L})} \label{eq3}
447: \end{equation}
448:
449: \[
450: R_{T}=\frac{\Delta x_{12}}{c_{1}-c_{2}}
451: \]
452:
453: \[
454: x_{0}^{\prime }=\frac{x_{2}c_{1}-x_{1}c_{2}}{c_{1}-c_{2}}
455: \]
456:
457: where $c=\cos ((z_{i}-z_{0}^{\prime })/R_{L}).$
458: \\
459: \noindent
460:
461: $\bullet$ a fifth hit from a different view is included to
462: obtain the remaining parameters of helix. \\
463:
464:
465: \begin{equation}
466: x_{0}=\frac{1}{\sin (\alpha ^{\prime \prime }-\alpha ^{\prime })}\left\{
467: x_{0}^{\prime }\sin \alpha ^{\prime \prime }-[x_{5}^{\prime \prime
468: }-R_{T}\cos (\frac{z_{5}-z_{0}}{R_{L}}-\alpha ^{\prime \prime })]\sin
469: (\alpha ^{\prime })\right\} \label{eq7}
470: \end{equation}
471:
472: \begin{equation}
473: y_{0}=\frac{-1}{\sin (\alpha ^{\prime \prime }-\alpha ^{\prime })}\left\{
474: x_{0}^{\prime }\cos \alpha ^{\prime \prime }-[x_{5}^{\prime \prime
475: }-R_{T}\cos (\frac{z_{5}-z_{0}}{R_{L}}-\alpha ^{\prime \prime })]\cos
476: (\alpha ^{\prime })\right\} . \label{eq8}
477: \end{equation}
478:
479: where $\alpha ^{\prime }$ ($\alpha ^{\prime \prime }$) is the angle of
480: rotation from the global system to a given chamber system,
481:
482: \begin{equation}
483: z_{0}=z_{0}^{\prime }-\alpha ^{\prime }R_{L}. \label{eq6}
484: \end{equation}
485:
486:
487: It is important to note that the consideration of several lobes
488: (see Figure ~\ref{fig:sin1}) allows calculation of the
489: longitudinal radius $R_{L}$ with high precision $\Delta P/P
490: \approx 10^{-4}$ due to the large distance between the lobes in
491: comparison with their size.
492:
493:
494: A two stage procedure was developed to provide the pattern
495: recognition in the transverse tracker:\\
496:
497: $\bullet$ {\bf pattern recognition without drift time.} At this step only
498: information on centers of straw hits is used. The parameters of
499: the reconstructed average helix are considered as a starting point
500: to find an approximate helix
501: which fits the straw hit centers.\\
502:
503: $\bullet$ {\bf pattern recognition with drift time.} A deterministic annealing
504: filter DAF ~\cite{daf} ~is applied to make a final background
505: suppression and provide a starting
506: point for track momentum reconstruction by the Kalman
507: filter technique ~\cite{rkalm}, ~\cite{kalman}.
508:
509:
510: \subsection*{Pattern recognition without drift time information}
511:
512: In the first level of the analysis procedure (the straw hit center
513: approximation) only centers of straw hits were used as chamber hit
514: coordinates $x^{\prime}$, z.
515: It was assumed that there is only one useful muon conversion
516: or DIO track.
517:
518:
519: It is important to note that the straw hits allow us to
520: reconstruct the helix parameters with high precision even without
521: drift time information. Indeed, the uncertainty in the helix
522: radius $\Delta R_{T}$ is approximately $\sigma \sim D/\sqrt12
523: \approx$ 0.14 cm for the diameter of a straw tube $D = $5 mm. The
524: average helix radius for muon conversion events is $R_T \sim $ 25
525: cm . We can therefore expect the momentum resolution $\Delta P/P
526: \sim 0.5 - 1\%$
527: for a straw diameter of 5 mm.
528: It is worth noting that using straw tube center positions
529: without drift time information we can reconstruct the total
530: momentum with the standard deviation $\sigma $ = 0.45 MeV/c for
531: a straw diameter of 5 mm.
532:
533: \begin{figure}[htb!]
534: \centerline{\hbox{\psfig{figure=pin_100k.eps,height=3.0in,clip=on}}}
535: \caption{
536: Momentum distribution of electrons entering the tracker.
537: }
538: \label{fig:Pin}
539: \end{figure}
540:
541:
542: \bigskip
543:
544: The initial momentum distribution of electrons entering the
545: tracker and satisfying the criteria mentioned above is shown in
546: Figure ~\ref{fig:Pin}. This distribution is the a narrow one
547: and can be fitted by a Gaussian with a
548: standard deviation $\sigma $ = 0.21 MeV/c and average momentum
549: P$_{in}$ = 104.4 MeV/c in the range 104.2 MeV/c - 105 MeV/c.
550: The width of
551: the initial distribution of electrons is small in comparison with
552: the total width of the momentum distribution reconstructed in the
553: straw hit center approximation.
554:
555: In Figure ~\ref{fig:sel_0} an initial map of real and background tracker
556: hits in all tracker's views is shown as an example for a first simulated event.
557: We will call it the sample event in the following.
558: The number of real tracker hits is 29 and the number of background
559: hits is 260 in this event. The real hits are depicted as cyan spots
560: and the background ones are shown as red spots. Note that in the
561: figure all 12 views were joined in one for all 18 tracker modules.
562:
563: \begin{figure}[htb!]
564: \centerline{\hbox{\psfig{figure=sel_0.eps,height=2.5in,clip=on}}}
565: \caption{
566: Plot of real + background tracker hits before the selection procedure.
567: }
568: \label{fig:sel_0}
569: \end{figure}
570:
571: One can see from the initial distribution of tracker hits that
572: some of background hits are concentrated along straight lines
573: ( Figure ~\ref{fig:sel_0}) providing a typical signature of Compton
574: electrons from photon background.
575:
576: The procedure of the pattern recognition and
577: fitting without drift time in the presence of background
578: can be described by the following steps:
579:
580:
581: \bigskip
582: 1) {\bf Rejection of hits produced by Compton electrons.} \\
583:
584:
585: $\bullet$
586: A grouping containing 6 modules is taken for a given view. If in
587: this grouping there are $N_{1}>2$ hits with the same tube number
588: (same straw (x$^{\prime}$) coordinate) the corresponding hit numbers
589: are recorded.
590: If, in a second grouping displaced by one module from the first,
591: there are $N_{2}>2$ hits with the
592: same tube numbers as at the initial position of the grouping then
593: hits from the first and the second grouping are marked as Compton electron hits. The
594: procedure is repeated in forward and backward directions. At the
595: end of this step all marked hits are temporary removed from
596: the following analysis. This selection typically reduces twofold the number of
597: background hits.
598: Also about 5$\% $ of
599: the real hits are rejected by the procedure
600: but later they can be restored.
601:
602:
603: Figure ~\ref{fig:sel_1} demonstrates the remaining tracker hits after application
604: of the rejection procedure to the sample event. The suppression of background hits
605: is evident.
606:
607: \begin{figure}[htb!]
608: \centerline{\hbox{\psfig{figure=sel_1.eps,height=2.5in,clip=on}}}
609: \caption{
610: Plot of real + background tracker hits after the first selection procedure.
611: }
612: \label{fig:sel_1}
613: \end{figure}
614:
615: \bigskip
616:
617: 2) {\bf Selection based on calorimeter and tracker hits.} \\
618:
619: To reconstruct the helix parameters four hits are needed.
620: To reduce the number of four hit combinations
621: the calorimeter hit is used in each combination.
622: The calorimeter coordinate resolution is $\sigma = 1.5$ cm which is
623: ten times greater than tracker hit resolution in the straw hit center
624: approximation.
625: In spite of the poor coordinate calorimeter resolution
626: it is possible to get a significant background suppression.
627:
628: This procedure is described by the following steps:
629:
630: $\bullet$ For a given view all possible three hit
631: combinations are chosen and four hits are formed by adding
632: a calorimeter hit to the three hit combinations.
633: The mathematical approach described above
634: is applied to four hits.
635: The algorithm allows us to calculate $R$ , $\cos \theta $ ,
636: $z_{0}^{\prime }$ , $x_{0}^{\prime }$ in the chamber system of
637: coordinates. Only the combinations that survive a cut-off in the
638: particle
639: momentum $p_{\min }=94MeV/c<p<p_{\max }=114MeV/c$ and pitch angle
640: $\theta _{\min }=41.4^{\circ }<\theta <\theta _{\max }=66.4^{\circ }$ are
641: retained for subsequent analysis.
642:
643: \bigskip
644: $\bullet$ A fifth hit from a different view than
645: the four hit combination is added.
646: This allows us to calculate all 5 helix
647: parameters in the global system of coordinates. A cut-off in $x_{0}$ and
648: $y_{0}$ is applied at this step to five hits.
649: The coordinates $x_{0}, y_{0}$ are required to be in an acceptable range
650: $-40 < x_{0},y_{0} < 40 $ cm,
651: where the range is determined by the GEANT simulation
652:
653:
654: \bigskip
655: $\bullet$ We look for hits correlated in 3 dimensions to choose
656: good five hit combinations. For each selected five hit
657: combination using the found helix parameters we reconstruct the
658: helix, calculate all crossings of the tracker chambers for the
659: reconstructed trajectory and define the total number of crossings
660: N. Then we calculate how many times M at least one tracker hit
661: matches the crossing in a road $\pm $ 3 cm. In an
662: ideal case M should be equal to N.
663:
664: \bigskip
665: $\bullet$ The probability is evaluated to get M hits in the road.
666: The probability that M of N crossings are in the road
667: is estimated by a trial function
668:
669: \[
670: \mathit{Prob
671: }=\frac{N!}{M!(N-M)!}\epsilon^{M}(1-\epsilon)^{N-M}
672: \]
673:
674: where ~~$\epsilon$ is
675: the probability of that there is a hit
676: in the road within $\pm$3 cm.
677:
678: If the total probability \textit{Prob} is greater than the threshold
679: probability then the
680: given helix is considered as a good candidate.
681: The threshold probability and ~$\epsilon$ were found empirically
682: to be 0.001 and 0.95, respectively.
683:
684: As an output of the previous steps we get a collection of valid
685: five hit combinations (helices).
686: An individual hit is kept if it is in any valid hit
687: combination, providing a list of good tracker hits. It is worth
688: emphasizing that, due to the strong spatial correlations between the
689: helix hits in comparison with the un-correlated background, the
690: number of false hits is reduced drastically on applying the
691: road requirement. On average the number of background hits is
692: reduced due to this step by a factor 20.
693:
694: In Figure ~\ref{fig:sel_2}
695: the surviving hits are shown.
696: The significant reduction of background hits can be seen by comparing
697: Figure ~\ref{fig:sel_1} and Figure ~\ref{fig:sel_2}.
698:
699:
700: \begin{figure}[htb!]
701: \centerline{\hbox{\psfig{figure=sel_2.eps,height=2.5in,clip=on}}}
702: \caption{
703: Plot of real + background tracker hits after the second selection procedure.
704: }
705: \label{fig:sel_2}
706: \end{figure}
707:
708:
709:
710: Note that at this step no real hits are removed for the sample event.
711:
712: \bigskip
713:
714:
715: 3) {\bf Selection based on tracker hits.}\\
716:
717: In this selection procedure the tracker hits found in step 2 are used.
718: This procedure repeats the previous one with the following changes:\\
719:
720: $\bullet$ Hits are chosen only from the tracker hits which allows to
721: improve the performance of the selection procedure since tracker hit
722: coordinates are defined significantly better than calorimeter hit
723: coordinates.
724:
725:
726: A plot of the number of four hit and five hit
727: combinations in the presence of the background (in average about 120
728: and 630 respectively) is shown in Figure ~\ref{fig:comb} (a) and (b).
729:
730: \begin{figure}[htb!]
731: \centerline{\hbox{\psfig{figure=3plus1.eps,height=2.0in,clip=on}
732: \psfig{figure=3plus2.eps,height=2.0in,clip=on}}}
733: \caption{
734: Number of four hit and five hit combinations.
735: }
736: \label{fig:comb}
737: \end{figure}
738:
739: As seen in Figure ~\ref{fig:comb} these distributions are
740: broad ones reaching a few thousand combinations.
741:
742:
743: \bigskip
744: $\bullet$ For each five hit combination selected above we evaluate
745: the position of a hit in the
746: calorimeter $x_{eval}$ , z$_{eval}$ on the basis of the defined helix parameters.
747: The hit combination is accepted
748: only if the evaluated hit matches the calorimeter hit within a road : $%
749: \left| x_{calo}-x_{eval}\right| < 7 cm$ .
750:
751:
752: $\bullet$ Parameters at this step are more
753: restrictive: the road width is taken to be 0.75 cm and the
754: probability to be within the road $\epsilon$ = 0.97.
755:
756: Figure ~\ref{fig:sel_3} shows tracker hits
757: remaining after this step for the sample event.
758: The number of background hits is
759: reduced typically by a factor 4 - 5 due to this step.
760:
761:
762: \begin{figure}[htb!]
763: \centerline{\hbox{\psfig{figure=sel_3.eps,height=2.5in,clip=on}}}
764: \caption{
765: Plot of real + background tracker hits after the third selection procedure.
766: }
767: \label{fig:sel_3}
768: \end{figure}
769:
770:
771:
772: \bigskip
773: 4) {\bf Selection including restored tracker hits.} \\
774:
775: $\bullet$ Some real tracker hits are lost
776: at previous steps of the procedure especially at Compton electron hits
777: rejection. To restore the lost hits an average helix for a given event is
778: reconstructed and tracker hits that match the average helix in a road
779: $\pm$1 straw are added to the list of valid tracker hits obtained above.
780:
781:
782: $\bullet$ Step 3 of the procedure is repeated for the extended list of tracker
783: hits. On average the list of valid
784: hits is extended by 1 real and 2 background hits.
785:
786:
787: $\bullet$ The union of all hits in valid five hit combinations
788: in the extended list is
789: used to provide an input for an average helix.
790: A global fit is applied to reconstruct the helix parameters on the basis of
791: the list of the selected tracker hits and the parameters of the average helix are
792: considered as a starting point for the fit. \\
793:
794: In this section the pattern recognition procedure without drift time
795: based on the straw hit center approximation was developed.
796: In this approximation no ambiguity due to mirror hits arises
797: and the total momentum is reconstructed
798: with the standard deviation $\sigma $ = 0.45 MeV/c.
799: The selection procedure based on 3D space correlations between real
800: tracker hits inside the road significantly reduces background.
801: The overall background rejection factor is about 130
802: for the pattern recognition procedure without drift time.
803:
804:
805:
806: \subsection*{Pattern recognition with drift time }
807:
808: The second stage in the pattern recognition procedure uses the
809: fitted helix and drift time for hits selected at the first stage.
810: The reconstruction of helix parameters can be improved by taking
811: into account that in addition to a straw coordinate for
812: each hit a chamber gives the radius r calculated from the measured drift
813: time $t_{i}^{meas}$. The errors ($\sigma $) in radius measurements
814: were taken to be 0.2 mm. This radius r carries an ambiguity as to
815: whether the track passed left or right of the wire. The search of
816: two possible up and down hit positions lying on the circle of
817: radius r is based on the fitted helix obtained previously in the straw
818: hit center approximation.
819: We can call these up and down points as true and mirror ones.
820: Up and down points are extracted (see
821: Figure ~\ref{fig:TTRS1}) from the intersections of a normal to the
822: helix through the straw center and the circle of the drift radius
823: r. In this case coordinates of up and down points are given by
824:
825: \begin{equation}
826: x_{\prime} = x_{c} \mp r sin \beta ;
827: ~~~~~z = z_{c} \pm r cos \beta ;\\
828: ~~~~~tan\beta = -(R_{T}/R_{L})sin((z_c-z_{0}^{\prime})/R_{L})
829: \label{equpdown}
830: \end{equation}
831:
832: \begin{figure}[htb!]
833: \centerline{\hbox{\psfig{figure=ttrs1.eps,height=2.2in,clip=on}}}
834: \caption{
835: Reconstruction of up and down points by the helix obtained in the
836: straw hit center approximation.
837: }
838: \label{fig:TTRS1}
839: \end{figure}
840:
841:
842: The uncertainties $\Delta s$ (see Figure ~\ref{fig:TTRS2})
843: in the determination of up and down point
844: positions are
845: small and can be evaluated in the following way.
846: The direction of 2D helix in the chamber coordinate system is given by
847: $ tan\beta $ (see Figure ~\ref{fig:TTRS2} and Eq.(\ref{equpdown})).
848:
849: \begin{figure}[htb!]
850: \centerline{\hbox{\psfig{figure=ttrs2.eps,height=2.0in,clip=on}}}
851: \caption{
852: Uncertainties in up and down point positions.
853: }
854: \label{fig:TTRS2}
855: \end{figure}
856:
857: Since the uncertainty in $R_{T}$ is the dominant one the uncertainty in the
858: direction of the helix is of the order
859: $\Delta \beta /\beta \sim \Delta
860: R_{T}/R_{T} \sim \Delta
861: R/R \sim \Delta P/P$ where $\Delta P/P<0.01, \beta \sim 1$.
862:
863: The uncertainty in the determination of up or down point
864: position \\
865: $\Delta s \sim \Delta \beta $ r $ <
866: \beta (\Delta P/P)r_{tube} \sim 1 \times 10^{-2} \times 0.25$ cm $ \sim 25 \mu$m
867: which is much less than a measurement precision 200 $\mu $m.
868:
869: To reject background hits remaining after pre-selection stage and
870: to resolve the up - down ambiguity
871: the deterministic annealing filter (DAF) and the Kalman filter (KF)
872: are applied.
873:
874:
875: In principle at this stage the Kalman filter (KF) approach \cite{kalman}
876: could be applied. However an application of the Kalman filter
877: (see Appendix A) requires that the problem of assignment of hits to a track
878: has been entirely resolved by the preceding selection procedure.
879: If this is not the case the filter has to run on every possible
880: assignment choosing the best one according to the chi-square
881: criterion. For the number of tracker hits greater than 15 this
882: combinatorial search is computationally expensive and practically
883: unfeasible. Therefore as the last step of pattern recognition
884: we will use the deterministic annealing filter (DAF) ~\cite{daf}.
885:
886:
887: DAF is a Kalman filter with re-weighted observations (see Appendix B).
888: For the DAF procedure we introduce artificial layers placed at the chamber straw centers
889: in order to have competing true and mirror points in one layer.
890: To overcome the problem of insufficient information in the initial
891: phase of the filter, an iterative procedure is applied. After a first
892: pass of filter and after smoothing, the track position can be predicted
893: in every layer of the tracker. Based on these predictions, the assignment
894: probabilities for all competing hits can be calculated in every layer.
895: If the probability falls below a certain threshold, the hit is excluded
896: from the following consideration. The assignment probabilities of
897: the remaining hits are normalized to one and used as the weights
898: in the next iterations of the filter.
899:
900: In our case the operation of DAF is described by the following steps:\\
901:
902: $\bullet$ every true and mirror point is projected on a
903: layer corresponding to the center of straw in the direction
904: defined by the fitted helix for a given event.\\
905:
906: $\bullet$ initial probabilities of competing points
907: in the layer are assumed to be equal.\\
908:
909: $\bullet$ annealing schedule is chosen according to the following
910: formula\\
911: $V_{n}=V (\frac{50}{f^{n}}+1)$ for a variance of observations,
912: where V = $\sigma^{2}$ and $\sigma$ = 200 $\mu$m.
913: The annealing factor f is chosen to be either 1.4 or 2.\\
914:
915: $\bullet$ standard Kalman filter runs on all layers taking
916: observations as weighted mean according to assignment probabilities.\\
917:
918: $\bullet$ the filter runs in the opposite direction, using the
919: same weighted mean as the forward filter. By taking a weighted mean of
920: the predictions of both filters at every layer, a smoothed state vector
921: and its covariance matrix are obtained.\\
922:
923: $\bullet$ based on these predictions and the covariance matrix, the
924: assignment probabilities of the hits are calculated. If combined
925: hit probability for true and mirror point falls below a
926: certain threshold ($10^{-7}$), the hit is rejected. The assignment probabilities of
927: the remaining points in the layer are normalized
928: to one and used as the weights
929: in the next iterations of the filter.\\
930:
931: $\bullet$ iterations in n stop if one of the following
932: conditions is satisfied:\\
933: 1)for the reconstructed track $\chi^2 > \chi^2_{max}$
934: where $\chi^2_{max}$ = 1000 or $\chi^2 < 0$.\\
935: 2)for the reconstructed initial momentum $P_{in} < 94 MeV/c$ or
936: $P_{in} > 114 MeV/c$\\
937: 3)variation of $\chi^2$ is small in comparison with the
938: previous iteration $|\chi^2_{n+1}-\chi^2_{n}|/\chi^2_{n}<0.01$.\\
939:
940: $\bullet$ when iterations stop (on average after 7 iterations)
941: The DAF procedure is repeated for the different annealing factor f.
942: From results corresponding to two annealing factors we choose
943: that one corresponding to the minimum $\chi^2$.\\
944:
945: Figure ~\ref{fig:sel_5} shows the distribution of tracker hits for the sample event
946: (recall that initially we had 260 background hits for this event).
947: There are no surviving background hits and missing real hits
948: in this case. Reconstructed lobes are clearly seen in the
949: Figure. We conclude that DAF is effective in rejection
950: of background hits remaining after pre-selection procedure and
951: also it provides a good starting point for the track reconstruction.
952:
953:
954: \begin{figure}[htb!]
955: \centerline{\hbox{\psfig{figure=sel_5.eps,height=2.5in,clip=on}}}
956: \caption{
957: Plot of real + background tracker hits after the DAF selection procedure.
958: }
959: \label{fig:sel_5}
960: \end{figure}
961:
962:
963:
964: Figure ~\ref{fig:Nback}(a) represents the number of real
965: hits lost by the pattern recognition procedure. Some of the real
966: tracker hits (0.8 hits $\sim 2.7\%$) are lost due to the selection
967: procedure.
968:
969: Figure ~\ref{fig:Nback}(b) represents the number of background
970: hits remaining after the selection procedure. As one
971: can see from Figure ~\ref{fig:Nback}(b) the number of
972: background hits remaining is 0.38 hits in comparison with the
973: primary 300 hits. So a total background suppression factor is
974: 300/0.38 $\approx$ 800.
975:
976:
977: \begin{figure}[htb!]
978: \centerline{\hbox{\psfig{figure=Nmiss.eps,height=2.0in,clip=on}
979: \psfig{figure=Nback.eps,height=2.0in,clip=on}}}
980: \caption{
981: Distributions in the number of missing real hits and
982: remaining background hits.
983: }
984: \label{fig:Nback}
985: \end{figure}
986:
987: \section{Reconstruction based on Kalman filter }
988:
989: Background studies presented in this subsection are based on the
990: application of the Kalman filter to hits selected by previous pattern
991: recognition procedure.
992:
993: At this stage the reconstruction is based on the hits selected by the
994: pattern recognition procedure described above.
995:
996: In principle we could use the results obtained by DAF for the track
997: reconstruction but our analysis shows that an application of
998: a combinatorial Kalman filter to hits selected by DAF provides better
999: precision.
1000:
1001: In the last decade the Kalman filter (KF) approach \cite{kalman}
1002: has been extensively exploited for track fitting in high energy
1003: physics. This approach possesses the following
1004: features for effective track fitting: \\
1005: \noindent
1006: $\bullet$ multiple scattering and energy losses are included in a natural way;\\
1007: $\bullet$ a 3D trajectory is restored that approximates closely the real one; \\
1008: $\bullet$ complex tracker geometries are handled in a simple way;\\
1009: $\bullet$ N$\times $N matrix inversion, where N is the total number of measurements,
1010: is avoided; \\
1011: $\bullet$ control for error propagation is provided; \\
1012: $\bullet$ trajectory is reconstructed progressively from one measurement to the next,
1013: improving the precision with each step; \\
1014: $\bullet$ initial and final momenta of a particle crossing the
1015: tracker are reconstructed. \\
1016:
1017: The KF is very useful because it simultaneously finds and fits
1018: the track;
1019: it is much more economical than the
1020: conventional least-squares global fit. The KF is a ``progressive"
1021: step by step method whose predictions are rather poor at the
1022: beginning of the track at the first stage of filtering. Since a
1023: state vector ${\bf{x}_k}=(x,y,t_x,t_y,1/p_{L})$ at point k (see
1024: definitions in Appendix A) has five parameters we need
1025: approximately $\simeq$ 6-7 straw hits to get good KF prediction
1026: precision. The prediction step, in which an estimate is made for
1027: the next measurement from the current knowledge of the state
1028: vector, is very useful to discard noise signal and hits from other
1029: tracks. Assuming the validity of the helix track model for each
1030: step, the KF propagates the track in 3D space, from one 2D surface
1031: to the next.
1032:
1033: Below we will use the standard notations:
1034:
1035: $\bf{x}_{k+1}^{k}$ is a prediction, i.e. the estimation of the
1036: ``future" state vector at position ``k+1" using all the ``past"
1037: measurements up to and including ``k".
1038:
1039: $\bf{x}_{k}^{k}$ is a filtered state vector, i.e. the
1040: estimation of the state vector at position ``k" based upon all
1041: ``past" and ``present" measurements up to and including ``k".
1042:
1043: The same notations will be held for the covariance matrix
1044: $\bf{C}$, noise matrix $\bf{Q}$ and so on.
1045:
1046: The Kalman filter algorithm can be divided into three major steps. \\
1047:
1048: INITIALIZATION
1049:
1050: The KF can start from arbitrary parameters and an infinite
1051: covariance matrix but for track finding applications it is
1052: significantly better to fix somehow the initial state if possible.
1053:
1054: The initialization of forward and backward KF algorithms is quite
1055: simple. It starts from the artificial point and initial
1056: parameters calculated from the previous stage of reconstruction
1057: procedure. The initial covariance matrix is empirically found to be diagonal
1058: with matrix elements being much greater than the corresponding
1059: uncertainties: $\bf{C}_{0}^{0}$ = (0.3,0.3,0.03,0.03,0.000003).\\
1060:
1061: FILTERING
1062:
1063: Once the KF is initialized it makes standard consequent steps. The
1064: current state at the k-th step is defined by the state vector
1065: $\bf{x}_k$, the state covariance matrix $\bf{C}_{k}^{k}$
1066: and the current straw hit.
1067: To take the (k+1)-th step it is necessary to:\\
1068:
1069: \noindent
1070: $\bullet$ update $\bf{x}_{k}^{k}, \bf{C}_{k}^{k}$ to take into account the ionization losses \\
1071: $\bullet$ define the next hit object (up or down point) \\
1072: $\bullet$ propagate the parameters and the covariance matrix
1073: to: $\bf{x}_{k}^{k} \rightarrow \bf{x}_{k+1}^k$, $\bf{C}_{k}^{k} \rightarrow \bf{C}_{k+1}^k$\\
1074: $\bullet$ update $\bf{C}_{k+1}^k$ to take into account
1075: multiple scattering $\bf{C}_{k+1}^k \rightarrow \bf{C}_{k+1}^k + \bf{Q}_k$\\
1076: $\bullet$ calculate the Kalman matrix $\bf{K}_{k+1}$ \\
1077: $\bullet$ update the covariance matrix $\bf{C}_{k+1}^{k+1}$ \\
1078: $\bullet$ calculate residuals $\bf{r}_{k+1}^{k+1} $ and their covariance matrices $\bf{R}_{k+1}^{k+1} $ \\
1079: $\bullet$ calculate the incremental $\chi^2 =
1080: (\bf{r}_{k+1}^{k+1})^T (\bf{R}_{k+1}^{k+1})^{-1} \bf{r}_{k+1}^{k+1}$ \\
1081: $\bullet$ store all information defining the new state\\
1082:
1083: SMOOTHING
1084:
1085: In the standard Kalman filter algorithm the smoothing is a well
1086: defined procedure. Smoothing allows one to obtain the best
1087: estimate of the track parameters at any trajectory point using all
1088: hit information accumulated during the KF propagation.
1089:
1090:
1091: Figure ~\ref{fig:forw} (a) displays the results of the KF
1092: forward filtering for the total momentum Ptot
1093: reconstruction at each tracker hit position. Figure
1094: ~\ref{fig:forw} (b) displays how the KF smoother, based on all
1095: hit information accumulated during the KF filtering, improves the
1096: total momentum Ptot reconstruction at each tracker hit
1097: position.
1098:
1099:
1100: \begin{figure}[htb!]
1101: \centerline{\hbox{%
1102: \psfig{figure=forw_p_f.eps,height=2.0in,clip=on}
1103: \psfig{figure=forw_p_s.eps,height=2.0in,clip=on}}}
1104: \caption{
1105: The total momentum reconstructed by the
1106: forward Kalman filtering a) and by the smoothing b)
1107: for each hit of a selected event. The y-axis is Ptot, the
1108: x-axis is the z-ordered hit number.
1109: }
1110: \label{fig:forw}
1111: \end{figure}
1112:
1113:
1114: Figure ~\ref{fig:back} (a) displays the results of the KF
1115: backward filtering for the total momentum Ptot
1116: reconstruction at each tracker hit position. Figure
1117: ~\ref{fig:back} (b) displays how the KF smoother, based on all
1118: hit information accumulated during the KF filtering, improves the
1119: total momentum Ptot reconstruction at each tracker hit
1120: position.
1121:
1122: \begin{figure}[htb!]
1123: \centerline{\hbox{%
1124: \psfig{figure=back_p_f.eps,height=2.0in,clip=on}
1125: \psfig{figure=back_p_s.eps,height=2.0in,clip=on}}}
1126: \caption{ The total momentum reconstructed by the
1127: backward Kalman filtering a) and by the smoothing b)
1128: for each hit of a selected event. The y-axis is Ptot, the
1129: x-axis is the z-ordered hit number.
1130: }
1131: \label{fig:back}
1132: \end{figure}
1133:
1134:
1135: The Kalman filter approach is effective in the resolving
1136: of up - down ambiguity. By
1137: applying the Kalman filter at each step up and down points are
1138: considered and the point providing the best $\chi^{2}$ for the
1139: trajectory is selected as the true point. This procedure is
1140: approximately linear in the number of tracker hits in comparison with
1141: the a combinatorial search, which is not feasible.
1142:
1143: At this reconstruction stage due to left-right ambiguity for the
1144: straw drift chamber we have a set of true and mirror points for
1145: each straw hit. As discussed above the reconstruction procedure
1146: for each tracker hit defines true and mirror points and one of
1147: them is close to the real point with high precision $\simeq$
1148: 25 $\mu$m.
1149: So we can formulate our goal as to find a true point
1150: combination for N straw hits in the presence of N mirror points.
1151:
1152: The procedure to find the best approximation to the true point
1153: combination for N straw hits in the presence of N mirror points
1154: for one track from a muon-electron conversion event
1155: is based on the following steps:\\
1156:
1157: $\bullet$ chose the first eight straw hits and built $2^8$ (256)
1158: possible hit combinations corresponding to up and down points.
1159: The KF forward and backward procedures described above are applied
1160: to these combinations. Only those combinations which satisfy a
1161: rather loose $\chi^2$ cut, $\chi^2 <$ 30,
1162: are retained (typically about 10 out of the initial 256 combinations);\\
1163:
1164: $\bullet$ make a loop for all retained combinations with fixed up
1165: and down points for the first eight straw hits. For the 9th and
1166: higher straw hit, the up and down point choices are take into
1167: account and the point with minimal incremental $\chi^2$ for
1168: this point is selected for the further KF propagation step. If
1169: both incremental $\chi^2$ satisfy the cut $\chi^2 < $ 10 the
1170: second point is stored in the stack to make an iterative loop.
1171: A single combination of all possible steps defines a candidate
1172: track. At this stage on average 45 candidate tracks are stored
1173: in the stack per event; \\
1174:
1175:
1176: $\bullet$ make a loop for all combinations from the stack. For
1177: each new hit added to the hits restored from the stack again the
1178: two up and down point choices are taken into account and the
1179: point with the minimal incremental $\chi^2$ for this point
1180: is selected for the further KF propagation step; \\
1181:
1182:
1183: $\bullet$ select up and down track combinations with the minimal $\chi^2$ for the track; \\
1184:
1185: $\bullet$ select a track satisfying the cut $\chi^2 < $ 70; \\
1186:
1187: $\bullet$ select a track with the difference between the forward
1188: (Pin\_f) and backward (Pin\_b)
1189: reconstructed input momentum satisfying the cut $\vert Pin\_f - Pin\_b \vert <$ 0.7 MeV/c. \\
1190:
1191:
1192:
1193:
1194: The Kalman filter reconstructs a trajectory of a particle
1195: in three dimensions. The trajectory is bent
1196: each time it crosses a tracker plane due to multiple
1197: scattering. Therefore, the reconstructed track is a set of
1198: helices that intersect at the planes.
1199: This is the track followed by the particle.
1200:
1201: Figure ~\ref{fig:ev2} displays a 2D projection of 3D trajectory
1202: reconstructed by the Kalman filter for the sample event.
1203: As above in this figure all 12 views were joined in one
1204: for all 18 tracker modules. The 2D trajectory is shown
1205: only for sensitive area of the tracker and for each view
1206: the trajectory is in a different color.
1207: For the sample event real hits are in four
1208: tracker's views and the reconstructed
1209: lobes for these views are clearly seen in the figure.
1210:
1211: Due to a scale in this figure the 2D trajectory looks
1212: as an ideal sine curve and tracker hits look like spots of
1213: different size. In order to see a detailed behavior of the
1214: trajectory and hit positions a dynamical zoom is applied
1215: to a rectangular region indicated in Figure ~\ref{fig:ev2}
1216: in the x-range 49-50 cm and
1217: z-range 1029-1029.5 cm.
1218:
1219: Figure ~\ref{fig:ev21} demonstrates the magnified region of the tracker.
1220: Blue line in the figure represents the reconstructed trajectory.
1221: Two circles represent two hits in chamber.
1222: These circles look like ellipses due to different axis scales.
1223:
1224:
1225:
1226: \begin{figure}[htb!]
1227: \centerline{\hbox{\psfig{figure=ev2.eps,height=3.0in,clip=on}}}
1228: \caption{
1229: Plot of real + background tracker hits after the DAF selection procedure.
1230: }
1231: \label{fig:ev2}
1232: \end{figure}
1233:
1234: \begin{figure}[htb!]
1235: \centerline{\hbox{\psfig{figure=ev2_1.eps,height=2.5in,clip=on}}}
1236: \caption{
1237: Zoom enlargement of the region in Figure ~\ref{fig:ev2}.
1238: }
1239: \label{fig:ev21}
1240: \end{figure}
1241:
1242:
1243: Radii of the circles are directly
1244: proportional to the drift times.
1245: For illustrative
1246: purposes two nearest to the straw center points corresponding to the real
1247: trajectory obtained in Monte Carlo
1248: simulation are shown in the Figure in the form of diamonds
1249: ~ (note that in the pattern recognition procedure only
1250: radii were used but not these points).
1251: The measurement uncertainty was taken into account assuming
1252: that circle radii are distributed normally about the simulated
1253: radii with $\sigma = 200 \mu$m. For this reason the position
1254: of one of the nearest points is not on a circle in Figure ~\ref{fig:ev21}.
1255:
1256: In Figure ~\ref{fig:ev21} due to the corresponding scale the trajectory
1257: looks like a straight line. It is tangent to one of two circles
1258: obtained on the basis of drift time.
1259: In the region under consideration the deviation of the
1260: trajectory from the nearest point is less than 0.2 mm.
1261: The change in the direction of the trajectory due to
1262: multiple scattering can not be seen in the Figure because of
1263: the smallness of the average angle of the scattering.
1264:
1265: Figure ~\ref{fig:fig_3D3}
1266: shows transverse xy-projection of the
1267: trajectory for the sample event.
1268: In this projection the trajectory looks approximately
1269: as a circle.
1270:
1271: \begin{figure}[htb!]
1272: \centerline{\hbox{\psfig{figure=fig_3D3.eps,height=3.0in,clip=on}}}
1273: \caption{
1274: The transverse projection of the 3D trajectory reconstructed for the sample event.
1275: }
1276: \label{fig:fig_3D3}
1277: \end{figure}
1278:
1279: However if a specific region of the tracker is magnified
1280: by the dynamical zoom one can see in Figure ~\ref{fig:fig_3D4}
1281: that the shape of the
1282: circle is distorted due to multiple scattering and energy
1283: loss. More than two turns of trajectory are clearly
1284: seen in the Figure.
1285:
1286:
1287: \begin{figure}[htb!]
1288: \centerline{\hbox{\psfig{figure=fig_3D4.eps,height=2.0in,clip=on}}}
1289: \caption{
1290: Zoom enlargement of the region in Figure ~\ref{fig:fig_3D3}.
1291: }
1292: \label{fig:fig_3D4}
1293: \end{figure}
1294:
1295:
1296: Note that by using the KF filter a momentum of a particle can be
1297: reconstructed at any point of the tracker. For our purposes the most
1298: important is the momentum of a particle entering the tracker,
1299: which in the following we will call the input momentum.
1300:
1301: Figure ~\ref{fig:fig_3D1}
1302: demonstrate 3D trajectory reconstructed for the
1303: sample event.
1304: The trajectory looks in this scale as a helix, but we remind that it
1305: consists of many helix parts.
1306: Also in the Figure tracker's hits
1307: generated by Monte Carlo simulation program are shown.
1308:
1309:
1310: \begin{figure}[htb!]
1311: \centerline{\hbox{\psfig{figure=fig_3D1.eps,height=4.5in,clip=on}}}
1312: \caption{
1313: 3D trajectory reconstructed for the sample event.
1314: }
1315: \label{fig:fig_3D1}
1316: \end{figure}
1317:
1318:
1319: The distribution in the difference between the initial momentum (Pin\_f)
1320: reconstructed by
1321: the Kalman filter and the generated initial momentum (Pin) is shown in
1322: Figure ~\ref{fig:pin_difb} in linear (a) and logarithmic (b) scale.
1323: According to this distribution the intrinsic tracker resolution is
1324: $\sigma$ = 0.12 MeV/c if one fits the distribution by a Gaussian in the range
1325: -0.3 - 0.7 MeV/c.
1326:
1327:
1328: \begin{figure}[htb!]
1329: \centerline{\hbox{%
1330: \psfig{figure=dif_back_lin.eps,height=3.0in,clip=on}
1331: \psfig{figure=dif_back_log.eps,height=3.0in,clip=on}}}
1332: \caption{
1333: Distribution in the difference between the input reconstructed momentum based on
1334: the Kalman filter and the simulated input momentum with background.
1335: }
1336: \label{fig:pin_difb}
1337: \end{figure}
1338:
1339:
1340: \begin{figure}[htb!]
1341: \centerline{\hbox{%
1342: \psfig{figure=pin_back_lin.eps,height=3.0in,clip=on}
1343: \psfig{figure=pin_back_log.eps,height=3.0in,clip=on}}}
1344: \caption{
1345: Distribution in the input momentum reconstructed by the
1346: Kalman filter with background.
1347: }
1348: \label{fig:pin_kalb}
1349: \end{figure}
1350:
1351:
1352: Figure ~\ref{fig:pin_kalb} shows a distribution in the input
1353: momentum (Pin\_f) reconstructed by the Kalman filter
1354: in linear (a) and logarithmic (b) scale with background.
1355: This distribution is characterized by the standard deviation $\sigma$ = 0.25 MeV/c of the
1356: reconstructed input momentum for a Gaussian fit
1357: in the range 104-106 MeV/c.
1358:
1359:
1360: Note that the trajectory
1361: reconstructed by the Kalman filter consists of
1362: many helix parts.
1363: The reconstructed input momentum resolution by the
1364: Kalman filter is $\sigma$ = 0.25 MeV/c. This resolution significantly
1365: better than the resolution $\sigma = 0.35$ MeV/c obtained
1366: by a single helix fit.
1367:
1368:
1369: The overall reconstruction acceptance is 22.1 $\%$ for muon conversion
1370: events with the momentum above a threshold momentum of 103.6
1371: MeV/c.
1372:
1373:
1374: Comparing these results with the results of the reconstruction
1375: without background we get the difference in tracker
1376: resolution 1.5 \% and the difference 2.7 \% in overall acceptance
1377: (see Appendix C).
1378: Therefore the tracker resolution and overall acceptance are not
1379: affected significantly at the considered background level
1380: (explained in Section 3).
1381:
1382:
1383:
1384:
1385: A summary of the critical selection criteria used in the momentum
1386: reconstruction is shown in Table ~\ref{table:tab5}. The
1387: efficiencies are for the selection criteria applied in consecutive
1388: order.
1389:
1390:
1391: An overall acceptance for muon conversion events with momentum
1392: above threshold momentum $P_{th}$ is 22.1\% .
1393: We define a threshold
1394: momentum,
1395: above which events are considered as the useful ones by
1396: $P_{th} = P_{max} - \Delta $, where $P_{max}$ = 104.3 MeV/c
1397: the most probable reconstructed momentum. If $\Delta $ = 0.7 MeV/c
1398: is chosen then $P_{th}$ = 103.6 MeV/c.
1399:
1400: \begin{center}
1401: \begin{table}[htb!]
1402: \caption {A summary of the reconstruction selection criteria }
1403: \begin{tabular}{|l|c|}
1404: \hline
1405: Selection criterion & Efficiency\\
1406: \hline
1407: Calorimeter energy above 80 MeV & 0.53\\
1408: Required pitch angle at the tracker & 0.86\\
1409: At least 15 hits in the tracker & 0.87\\
1410: Position match in the calorimeter & 0.95\\
1411: Requirements on fit quality & 0.79\\
1412: Detected energy above 103.6 MeV & 0.74\\
1413: \hline
1414: Overall acceptance & 0.22\\
1415: \hline
1416: \end{tabular}
1417: \label{table:tab5}
1418: \end{table}
1419: \end{center}
1420:
1421: The main factors entering into the experimental sensitivity are
1422: the running time, the proton intensity, the probability per proton
1423: that a $\mu$ is produced, transported and stopped in the stopping
1424: target, the fraction of stopped muons that are captured (as opposed to decay),
1425: the trigger efficiency and the tracker reconstruction acceptance.
1426: We do not include in this table loss of events due to accidental
1427: cosmic ray vetoes, dead-time losses and losses due to straw
1428: chamber inefficiencies, all of which are expected to be small.
1429: According to our analysis taking into account the straw efficiency
1430: 97\% the overall acceptance is reduced from 22.1\% to 21.7\% .
1431:
1432:
1433:
1434: Table ~\ref{table:tab6} shows expected MECO sensitivity for a one
1435: year ($10^7$ s) run.
1436:
1437: \begin{center}
1438: \begin{table}[htb!]
1439: \caption {A summary of the expected MECO sensitivity.}
1440: \begin{tabular}{|l|r|}
1441: \hline
1442: Running time (s)& $10^7$\\
1443: Proton flux (Hz)& $4 \cdot 10^{13}$\\
1444: \hline
1445: Probability of $\mu$/p transported and stopped in target & 0.0025\\
1446: $\mu$ capture probability & 0.6 \\
1447: Fraction of $\mu$ which are captured in time window & 0.49 \\
1448: Trigger efficiency and the selection criteria & 0.22 \\
1449: \hline
1450: Detected events for $R_{\mu e} = 10^{-16}$ & 6.5\\
1451: \hline
1452: \end{tabular}
1453: \label{table:tab6}
1454: \end{table}
1455: \end{center}
1456:
1457: Muon DIO events are the most important background for the experiment.
1458: The main background from muon DIO events in the presence of background tracker hits
1459: was simulated and reconstructed.
1460: Based on the simulated DIO events
1461: in the momentum range
1462: above 100 MeV/c the track pattern recognition and momentum reconstruction
1463: were performed in the presence of background tracker hits
1464: (protons, neutrons, photons, DIO) by applying the selection criteria
1465: discussed above.
1466:
1467:
1468: \begin{figure}[htb!]
1469: \centerline{\hbox{\psfig{figure=dio_gt_100.eps,height=3.5in,clip=on}}}
1470: \caption{
1471: Distributions of DIO reconstructed momentum in the presence of the background
1472: above 100 MeV/c .
1473: }
1474: \label{fig:dio}
1475: \end{figure}
1476:
1477:
1478: The number of primary DIO events simulated in the momentum range above 100 MeV/c
1479: was 10 times more
1480: than expected and 3 background events were found (see Figure ~\ref{fig:dio}).
1481: So the background is expected to be 0.3 events.
1482:
1483: It is important to note that input reconstructed momenta for
1484: these three background events are very close to
1485: simulated input momenta. The input simulated
1486: momenta 103.48, 103.6 and 103.74 MeV/c for these events have to be
1487: compared with input reconstructed momenta 103.6, 103.76 and 103.8 MeV/c,
1488: respectively.
1489:
1490: At the present level of pattern recognition and momentum
1491: reconstruction studies we can expect the background from DIO
1492: events in the range above 100 MeV/c $\sim $0.3 events compared
1493: to 6.5 signal events for $R_{\mu e} = 10^{-16}$.
1494:
1495:
1496: \section{Conclusion}
1497:
1498: A study of the impact of background on the performance of the
1499: transverse tracker proposed for the MECO experiment is presented.
1500: Background from capture protons, neutrons and photons, and from
1501: muon decay in orbit was generated using GEANT3.
1502: The effective average straw tube rate from these sources was 800 kHz
1503: at the proposed muon beam intensity of $2\times 10^{11} \mu^{-}/sec$.
1504:
1505: A pattern recognition
1506: procedure based on a Kalman filter technique was developed to
1507: suppress background and assign hits to tracks.
1508: In the first stage of this procedure, straw hit center
1509: coordinates, without drift time information, were used to reduce
1510: the background by a factor $\sim$ 130. In the second stage, the full
1511: drift time information
1512: and a deterministic annealing filter were used to obtain an
1513: additional six-fold suppression. The total suppression of 800 reduces
1514: the number of background hits on average from an initial value of
1515: 300 to approximately 0.4 per event. About 0.8 hits of the 29 real hits
1516: typically recorded, or 2.7 $\%$, are lost in the process.
1517:
1518: It was found that in the presence
1519: of background the
1520: resolution of the tracker is $\sigma = 0.12$ MeV/c and the overall
1521: setup acceptance for muon conversion
1522: events with momentum above the threshold momentum 103.6 MeV/c is
1523: about 22 $\%$.
1524: At the considered background level the tracker resolution and the
1525: overall acceptance are not affected significantly by presence of
1526: the background: the tracker resolution is changed by 1.5 \% and the
1527: overall acceptance by 2.7 \%.
1528:
1529: Additional constraints on the background, not considered here, may be
1530: imposed if the drift time measurement is supplemented by a
1531: measurement of the pulse amplitude at the anode wire. A straightforward,
1532: crude measurement of the amplitude is sufficient to reduce significantly
1533: the background from heavily ionizing particles, i.e., the capture
1534: protons that comprise 30 \% of the background hits in the above study.
1535: We estimate too that a significant background suppression, of 10-20,
1536: would be achieved if resistive anode wires were used in place of
1537: conducting wires to obtain a measurement of the hit position along
1538: the wire. This would improve the resolution as well.
1539:
1540: At the present level of pattern recognition and momentum
1541: reconstruction studies the background from DIO events above 100 MeV/c
1542: is about 0.3 events. This is to be compared
1543: to 6.5 signal events for $R_{\mu e} = 10^{-16}$.
1544:
1545: The study carried out shows that the developed procedures of pattern
1546: recognition and momentum reconstruction in the case of the transverse tracker
1547: provide a required precision for lepton number violation search at a sensitivity
1548: level about $10^{-17}$.
1549:
1550: We wish to thank A. Mincer, P.Nemethy, J.Sculli and one of us (R.K.) thanks
1551: W.Willis for fruitful discussions and helpful remarks.
1552:
1553: \begin{references}
1554: \bibitem{kuno} J.D.Vergados, Phys.Rep. {\bf 133}, 1, (1986);
1555: Y.Kuno and Y.Okada, Rev.Mod.Phys.{\bf 73}, 151 ,(2001).
1556: \bibitem{psi} P.Wintz, Proc. of the 1st Int.Symp. on Lepton and Baryon
1557: Number Violation, 1998, ed. by H.V.Klapdor-Kleingrothaus and
1558: I.V.Krivosheina (Inst. of Physics Publishing, Bristol and Philadelphia), 534.
1559: \bibitem{psigold} A. van der Schaaf, J.Phys.G: Nucl.Part.Phys. {\bf 29}, 1503, (2003).
1560: \bibitem{rashid} R.M.Djilkibaev and V.M.Lobashev, Sov.J.Nucl.Phys. {\bf 49}, 384, (1989).
1561: \bibitem{abadj} V.Abadjev {\it et al}, MELC proposal, INR preprint 786/92, (1992).
1562: \bibitem{bnl} M.Bachman {\it et al}, a research proposal to BNL AGS, (1997);
1563: NSF RSVP proposal (MECO), (1999).
1564: \bibitem{prime} S.Machida {\it et al}, a letter of intent to the J-PARC 50-GeV Proton
1565: Synchrotron Experiment, The PRIME Working Group, (2003).
1566: \bibitem{daf} R. Fr\"uhwirth and A. Stradlie, Comp.Phys.Comm. {\bf 120}, 197, (1999).
1567: \bibitem{rkalm} R.E.Kalman, Transactions of the ASME: J.Basic Engineering, {\bf D82},
1568: 35, (1960).
1569: \bibitem{kalman} P.Billoir, Nucl.Instr.Meth. {\bf 225}, 352, (1984); R.Fr\"uhwirth,
1570: {\it ibid}. {\bf A262}, 444, (1987); P.Billoir and S.Qian, {\it ibid}. {\bf A294},
1571: 219, (1990); E.J.Wolin and L.L.Ho, {\it ibid}. {\bf A329}, 493, (1993).
1572: \bibitem{mank} R.Mankel, Hera B note (1997).
1573: \end{references}
1574:
1575: \newpage
1576:
1577:
1578:
1579: \section*{APPENDIX\ A. KALMAN FILTER AND ITS\ APPLICATION\ TO\ TRACK\ FITTING}
1580:
1581: The Kalman filter is an algorithm that processes measurements to deduce an
1582: optimum estimate of the past, present, or future state of \ a dynamic system
1583: by using a time sequence of measurements of the system behavior, plus a
1584: statistical model that characterizes the system and measurements errors,
1585: plus initial condition information.
1586:
1587: The Kalman filter addresses the general problem of trying to
1588: estimate at different points ($1\leq k\leq n$) the state
1589: $\bf{x}_{k}$ of a discrete process that is governed by the
1590: linear stochastic difference equation
1591:
1592: \begin{equation}
1593: \bf{x}_{k}=\bf{F}_{k-1}\bf{x}_{k-1}+\bf{w}_{k-1}
1594: \label{1ap}
1595: \end{equation}
1596:
1597: with a measurement $\bf{m}_{k}$ that is
1598:
1599: \begin{equation}
1600: \bf{m}_{k}=\bf{H}\bf{x}_{k}+\bf{\varepsilon} _{k}.
1601: \label{2ap}
1602: \end{equation}
1603:
1604:
1605:
1606:
1607: The system equation (\ref{1ap}) is not deterministic since the
1608: track experiences stochastic processes such as multiple
1609: scattering, bremsstrahlung, etc. These processes are taken into
1610: account by the process noise $\bf{w}_{k}$ .
1611: $\bf{\varepsilon} _{k}$ represents the measurement noise.
1612: $\bf{w}_{k}$ and $\bf{\varepsilon} _{k}$ are assumed to be
1613: independent of each other with zero expectation values:
1614:
1615: $E\{\bf{w}_{k}\}=0,\qquad
1616: cov\{\bf{w}_{k}\}=\bf{Q}_{k}$ , \qquad $1\leq k\leq n,$
1617:
1618: $E\{\bf{\varepsilon} _{k}\}=0,\qquad cov\{\bf{\varepsilon} _{k}\}=\bf{V}_{k}$ , \qquad $%
1619: 1\leq k\leq n,$
1620:
1621: where $\bf{Q}_{k}$ and $\bf{V}_{k}$ are process noise and
1622: measurement noise covariances, respectively.
1623:
1624:
1625: Eq.(\ref{1ap}) in the absence of the last term is the standard
1626: equation of motion with a propagator $\bf{F}_{k-1}$ (transport
1627: matrix). Note that at the moment $\bf{F}_{k-1}$ is assumed to
1628: be constant.
1629:
1630: Regarding a track in space as a dynamic system the filtering
1631: technique is applied to the track fitting. For example, in the
1632: case of a particle moving in magnetic field this can be done
1633: naturally by
1634: identifying the state vector $\bf{x}_{k}$ of the dynamic system with a vector $%
1635: \bf{x}_{k}=(x,y,\tan \theta _{x},\tan \theta _{y},1/p_{L})$ of
1636: 5 parameters uniquely describing the track at each point of the
1637: trajectory. The $\bf{F}$ matrix propagates the state vector on
1638: one plane to the state vector on the next plane combining position
1639: information with directional information. The transport matrix
1640: implicitly contains information about a gap between planes.
1641:
1642: In general the set of parameters $\bf{x}_{k}$ is not measured
1643: directly; only a function of $\bf{x}_{k}$ ,
1644: $\bf{H}\bf{x}_{k}$ is observed. For example, in the case
1645: of the transverse tracker one does not measure $\bf{x}_{k}$
1646: but $x\prime =x\cos \alpha +y\sin \alpha $ in the chamber
1647: coordinate system which corresponds to
1648:
1649: \begin{equation}
1650: \bf{H}=(\cos\alpha ,\sin \alpha ,0,0,0) \label{2ap1}
1651: \end{equation}
1652:
1653:
1654:
1655: There are three types of operations to be performed in the analysis of a
1656: track.
1657:
1658: \begin{itemize}
1659: \item \textbf{Prediction} is the estimation of the ``future" state
1660: vector at
1661: position ``k" using all the ``past" measurements up to and including ``k-1". $%
1662: \bf{x}_{k}^{k-1}$ is a prediction (a priori state estimation).
1663:
1664: \item \textbf{Filtering }is the estimation of the state vector at
1665: position ``k" based upon all ``past" and ``present" measurements
1666: up to and including ``k". $\bf{x}_{k}^{k}$ is a filtered state
1667: vector (a posteriori state estimation).
1668:
1669: \item \textbf{Smoothing }is the estimation of the ``past" state
1670: vector at
1671: position ``k" based on all ``n" measurements taken up to the present time. $%
1672: \bf{x}_{k}^{n}$ is a smoothed state vector.
1673: \end{itemize}
1674:
1675: The first step to estimate $\bf{x}_{k}$ is \textbf{the
1676: prediction} (time update):
1677:
1678: \begin{equation}
1679: \bf{x}_{k}^{k-1}=\bf{F}_{k-1}\bf{x}_{k-1}^{k-1}
1680: \label{3ap}
1681: \end{equation}
1682:
1683: \begin{equation}
1684: \bf{C}_{k}^{k-1}=\bf{F}_{k-1}\bf{C}_{k-1}^{k-1}\bf{F}_{k-1}^{T}+\bf{Q}_{k-1}
1685: \label{4ap}
1686: \end{equation}
1687:
1688: where Eq.(\ref{3ap}) projects the state ahead and Eq.(\ref{4ap}) projects
1689: the error covariance ahead.
1690:
1691: \textbf{The filtered estimate} (measurement update)
1692: $\bf{x}_{k}^{k}$ is calculated as a weighted mean of the
1693: prediction and the observation:
1694:
1695: \begin{equation}
1696: \bf{K}_{k}=\bf{C}_{k}^{k-1}\bf{H}_{k}^{T}[\bf{H}_{k}\bf{C}_{k}^
1697: {k-1}\bf{H}_{k}^{T}+\bf{V}_{k}]^{-1}
1698: \label{5ap}
1699: \end{equation}
1700:
1701: \begin{equation}
1702: \bf{x}_{k}^{k}=\bf{x}_{k}^{k-1}+\bf{K}_{k}[\bf{m}_{k}-\bf{H}_{k}\bf{x}_{k}^{k-1}]
1703: \label{6ap}
1704: \end{equation}
1705:
1706: \begin{equation}
1707: \bf{C}_{k}^{k}=[\bf{I}-\bf{K}_{k}\bf{H}_{k}]\bf{C}_{k}^{k-1}.
1708: \label{7ap}
1709: \end{equation}
1710:
1711: Eq.(\ref{5ap}) computes the Kalman gain matrix defining the
1712: correction to the predicted state due to the current observation.
1713: Eq.(\ref{6ap}) updates the prediction with the measurement and
1714: Eq.(\ref{7ap}) updates the error covariance. The error covariance
1715: may be also expressed in a computationally superior form
1716:
1717: \begin{equation}
1718: \bf{C}_{k}^{k}=[\bf{I}-\bf{K}_{k}\bf{H}_{k}]
1719: \bf{C}_{k}^{k-1}[\bf{I}-\bf{K}_{k}\bf{H}_{k}]^{T}+\bf{K}_{k}\bf{V}_{k}\bf{K}_{k}^{T}.
1720: \end{equation}
1721:
1722: The filtering is a recursive operation. The prediction step and
1723: the filtering step are repeated for the next plane proceeding
1724: progressively from plane ``1" to plane ``n". The state vector at
1725: the last filtered point contains always the full information from
1726: all points.
1727:
1728: At each step one can calculate the filtered residuals
1729: $\bf{r}_{k}^{k}$ , the covariance matrix of the filtered
1730: residuals $\bf{R}_{k}^{k}$\ and the filtered $\chi ^{2}$:
1731:
1732: \[
1733: \bf{r}_{k}^{k}=\bf{m}_{k}-\bf{H}_{k}\bf{x}_{k}^{k}
1734: \]
1735:
1736: \[
1737: \bf{R}_{k}^{k}=\bf{V}_{k}-\bf{H}_{k}\bf{C}_{k}^{k}\bf{H}_{k}^{T}
1738: \]
1739:
1740: \[
1741: \chi
1742: _{k}^{2}=\bf{r}_{k}^{kT}(\bf{R}_{k}^{k})^{-1}\bf{r}_{k}^{k}
1743: \]
1744:
1745: where $\chi _{k}^{2}$ is $\chi ^{2}$ - distributed with
1746: dim($\bf{m}_{k}$) degrees
1747: of freedom. The total $\chi ^{2}$ of the track is given by the sum of the $%
1748: \chi _{k}^{2}$ contributions for each plane.
1749:
1750: The system of equations defining the Kalman filter represents an
1751: asymptotically stable system, and therefore, the estimate of the state
1752: vector $\bf{x}_{k}^{k}$ becomes independent on the starting point $\bf{x}_{0}^{0}$ , $%
1753: \bf{C}_{0}^{0}$ as k is increased.
1754:
1755: When the last plane (nth) is taken into account the Kalman filter
1756: performs the final step which is a smoothing. The filter runs
1757: backward in time updating all filtered state vectors on the basis
1758: of information from all n planes. The equations describing
1759: \textbf{the smoothing} are given by
1760:
1761: \[
1762: \bf{A}_{k}=\bf{C}_{k}^{k}\bf{F}_{k}^{T}(\bf{C}_{k+1}^{k})^{-1}
1763: \]
1764:
1765: \[
1766: \bf{x}_{k}^{n}=\bf{x}_{k}^{k}+\bf{A}_{k}(\bf{x}_{k+1}^{n}-\bf{x}_{k+1}^{k})
1767: \]
1768:
1769: \[
1770: \bf{C}_{k}^{n}=\bf{C}_{k}^{k}+\bf{A}_{k}(\bf{C}_{k+1}^{n}-\bf{C}_{k+1}^{k})\bf{A}_{k}^{T}
1771: \]
1772:
1773: \[
1774: \bf{r}_{k}^{n}=\bf{m}_{k}-\bf{H}_{k}\bf{x}_{k}^{n}
1775: \]
1776:
1777: \[
1778: \bf{R}_{k}^{n}=\bf{V}_{k}-\bf{H}_{k}\bf{C}_{k}^{n}\bf{H}_{k}^{T}
1779: \]
1780:
1781: Until now it was assumed that the problem of estimation of a discrete-time
1782: process is described by a linear stochastic differential equation.
1783: However for example in the presence of a magnetic field the track
1784: propagator $\bf{F}$ is non-linear.
1785: Let's assume that the process of a particle propagation is governed by the
1786: non-linear stochastic differential equation
1787:
1788: \begin{equation}
1789: x_{k}=f(x_{k-1})+w_{k-1} \label{8ap}
1790: \end{equation}
1791:
1792: with a measurement m in the form Eq.(\ref{2ap}). f is a non-linear
1793: function. The Kalman filter can be applied to this system by
1794: linearizing the system for example about the estimated trajectory.
1795: If deviations between the estimated trajectory and the actual
1796: trajectory remain sufficiently small the linear approximation is
1797: valid.
1798: The non-linear equation (\ref{8ap}) can be written down in the linearized form
1799: as
1800:
1801: \begin{equation}
1802: \bf{x}_{k}=\bf{f}(\bf{x}_{k-1}^{k-1})+\bf{F}
1803: \cdot(\bf{x}_{k-1}-\bf{x}_{k-1}^{k-1})+\bf{w}_{k-1}\label{10ap}
1804: \end{equation}
1805:
1806: where as before $\bf{x}_{k}$,$\bf{m}_{k}$ are the actual
1807: state and measurement vectors, $\bf{x}_{k}^{k}$ is a filtered
1808: estimate of the state at step k. $\bf{F}$ is Jacobian matrix
1809:
1810: \begin{equation}
1811: \bf{F}_{ij}=\partial
1812: \bf{f}_{i}(\bf{x}_{k-1}^{k-1})/\partial x_{j} \label{11ap}
1813: \end{equation}
1814:
1815: Therefore the complete set of extended Kalman filter equations is
1816: given by Eqs.(\ref{4ap})-(\ref{7ap}),(\ref{10ap}) by using F in
1817: the form (\ref{11ap}).
1818:
1819: In order to apply the extended Kalman filter to a track fitting
1820: for a particle moving in uniform magnetic field (the magnetic
1821: field is in z direction) one has to choose the state vector
1822: parameters, define the initial state vector and calculate the
1823: transport matrix $\bf{F}$, the projection matrix $\bf{H}$, and
1824: the noise matrix $\bf{Q}$.
1825: As it was mentioned above in this case the state vector can be chosen
1826: in the form ${\bf{x}_{k}}=(x,y,t_{x},t_{y},1/p_{L})$ where x, y
1827: are the track coordinates in the tracker system,
1828: $t_{x}=p_{x}/p_{L}$, $t_{y}=p_{y}/p_{L}$ define the track
1829: direction.
1830: The projection matrix H is given by Eq.(\ref{2ap1}).
1831: Due to multiple scattering the absolute value of electron momentum
1832: remains unaffected, while the direction is changed. This
1833: deflection can be described using two orthogonal scattering
1834: angles, which are also orthogonal to the particle momentum ~\cite{mank}.
1835: In terms of these variables the noise
1836: matrix is given by
1837:
1838: %\begin{equation}
1839: $$
1840: \bf{Q}_{k}=<\Theta ^{2}>(t_{x}^{2}+t_{y}^{2}+1) \left.\left(
1841: \begin{array}{ccccc}
1842: 0 & 0 & 0 & 0 & 0 \\
1843: 0 & 0 & 0 & 0 & 0 \\
1844: 0 & 0 & t_{x}^{2}+1 & t_{x}t_{y} & t_{x}/p_{L} \\
1845: 0 & 0 & t_{x}t_{y} & t_{y}^{2}+1 & t_{y}/p_{L} \\
1846: 0 & 0 & t_{x}/p_{L} & t_{y}/p_{L} & \frac{(t_{x}^{2}+t_{y}^{2})}{
1847: p_{L}^{2}(t_{x}^{2}+t_{y}^{2}+1)}
1848: \end{array}
1849: \right.\right)$$
1850: %\end{equation}
1851:
1852: For the variance of the multiple scattering angle the well-known
1853: expression is used
1854:
1855: \begin{equation}
1856: <\Theta^{2}>=(13.6MeV/p)^{2}[1+0.038\ln(t/X_{R})]t/X_{R}
1857: \end{equation}
1858:
1859: where $X_{R}$ is a radiation length, t is a distance traveled by
1860: the particle inside a scatterer.
1861: Energy losses are taken into account by
1862:
1863: \begin{equation}
1864: p\prime = p - <dE/dx>t.
1865: \end{equation}
1866:
1867: \qquad
1868:
1869:
1870: \section*{APPENDIX\ B. DETERMINISTIC ANNEALING FILTER}
1871:
1872: Track reconstruction in modern high energy physics
1873: experiments faces a significant
1874: amount of noise hits in a detector. The track fit thus is
1875: confronted with several competing hits in detector's layers.
1876: The Kalman filter (see Appendix A) is
1877: now widely used
1878: for the reconstruction of the track parameters in high energy
1879: physics. However the
1880: application of the Kalman filter requires that the problem
1881: of assignment of the detector hits to track candidates has been
1882: entirely resolved by the preceding selection procedure.
1883: If this is not the case, the filter has to run on every
1884: possible assignment to select the best one by chi-square
1885: criterion. Obviously this approach is computationally expensive
1886: and practically unfeasible for a considerable amount of noise
1887: hits. For this reason the Deterministic Annealing Filter (DAF)
1888: was developed ~\cite{daf}. In DAF there is an additional
1889: validation feature eliminating hits which are not compatible
1890: with the predicted track position.
1891:
1892: The deterministic annealing filter itself is a Kalman filter
1893: with re-weighted observations. The propagation part of DAF is
1894: identical to the standard Kalman filter.
1895:
1896: \textbf{The filtered estimate} (measurement update)
1897: $\bf{x}_{k}^{k}$ at layer k is calculated as a weighted mean of the
1898: prediction $\bf{x}_{k}^{k-1}$ and the observations
1899: {$\bf m_{k}^{i}, i=1,2,...n_{k}$}:
1900:
1901: \begin{equation}
1902: \bf x_{k}^{k}=\bf x_{k}^{k-1}+\bf K_{k}\sum _{i=1}^{{n}_{k}}
1903: p_{k}^{i}[\bf{m}_{k}^{i}-\bf{H}_{k}\bf{x}_{k}^{k-1}]
1904: \label{1bp}
1905: \end{equation}
1906:
1907: where ${\bf p_{k}^{i}}$ is the assignment probability of observation $\bf m_{k}^{i}$.
1908: $\bf{K}_{k}$ is the Kalman gain matrix which is given by
1909:
1910: \begin{equation}
1911: \bf{K}_{k}=[[\bf{C}_{k}^{k-1}]^{-1}+p_{k}\bf{H}_{k}^{T}\bf{V}_{k}^{-1}\bf{H}_{k}]^{-1}
1912: \bf{H}_{k}^{T}\bf{V}_{k}^{-1}
1913: \label{2bp}
1914: \end{equation}
1915:
1916: where ${\bf p_{k}}$ is the sum over all weights ${\bf p_{k}^{i}}$, $\bf{H}_{k}$ is the
1917: measurement matrix, $\bf{V}_{k}$ is the variance of the observations.
1918:
1919: The covariance matrix $\bf{C}_{k}^{k}$ of the updated estimate
1920: $\bf{x}_{k}^{k}$ is written as
1921:
1922: \begin{equation}
1923: \bf{C}_{k}^{k}=[[\bf{C}_{k}^{k-1}]^{-1}+p_{k}\bf{H}_{k}^{T}\bf{V}_{k}^{-1}\bf{H}_{k}]^{-1}.
1924: \label{3bp}
1925: \end{equation}
1926:
1927: After completion of the forward filter a backward filter runs
1928: in opposite direction, using the same weights as the forward
1929: filter. By taking a weighted mean of the filtered states of
1930: both filters at every layer a prediction for the state vector
1931: $\bf x_{k}^{n*}$ along with its covariance matrix $\bf {C}_{k}^{n*}$
1932: is obtained, using all hits except the ones at layer k. (The asterisk
1933: indicates that the information from layer k is not used in
1934: this prediction.) Initially all assignment probabilities for the
1935: hits in each layer
1936: are set
1937: to be equal but based on the estimated state vector
1938: $\bf x_{k}^{n*}$ and its covariance
1939: matrix, the assignment probabilities of all competing hits are then
1940: recalculated in the following way:
1941:
1942: \begin{equation}
1943: {\bf {p}_{k}^{i}} \sim \varphi(\bf{m}_{k}^{i};
1944: \bf{H}_{k}\bf x_{k}^{n*}, \bf{V}_{k}+\bf{H}_{k}\bf{C}_{k}^{n*}\bf{H}_{k}^{T})
1945: \label{4bp}
1946: \end{equation}
1947:
1948: where $\varphi(\bf x ; {\bf \mu},\bf V$) is a multivariate Gaussian
1949: probability density with mean vector ${\bf \mu}$ and covariance
1950: matrix $\bf V$.
1951:
1952: If the probability falls below a certain threshold, the hit is
1953: considered as the false one and is excluded from the list of the
1954: hits assigned to the track.
1955:
1956: However at this step we cannot be sure in calculated probabilities
1957: especially in the initial phase due to insufficient information for
1958: the filter. This problem is overcome by adopting a simulated annealing
1959: iterative procedure. This is an additional feature of DAF.
1960:
1961: The simulated annealing optimization algorithm is based on an
1962: analogy between the behavior of a material heated past its
1963: melting point that is slowly cooled (annealed) to form a single
1964: crystal. If the cooling proceeds slowly enough, the crystalline
1965: state reached at zero temperature will have all the atoms fixed
1966: in a perfect lattice structure, corresponding to the lowest
1967: possible energy of the system (global minimum).
1968:
1969: In the same way in track fitting the simulated annealing allows
1970: to avoid a local minimum and find the global one corresponding to
1971: the minimum chi-square for the track.
1972:
1973: DAF annealing algorithm can be described in the following way.
1974: The annealing schedule is chosen for example in the form
1975: ${\bf V_{N}}={\bf V}(A/f^{N}+1)$ where the annealing factor
1976: $f>1$ and factor $A>>1$. This provides that the initial
1977: variance is well above the nominal value
1978: ${\bf V}$ of the observation error but the final one tends
1979: to ${\bf V}$. After each iteration the assignment
1980: probabilities exceeding the threshold are normalized to 1
1981: and used again as weights in the next iteration, and so on.
1982: The iterations generally are stopped if the relative change in
1983: chi-square is less than correspondent control parameter
1984: (typically of the order 0.01).
1985:
1986: Since we deal with the stochastic process the best result
1987: can be reached repeating the DAF procedure for a few different
1988: annealing factors f and then choosing the result
1989: corresponding to the minimum chi-square.
1990:
1991:
1992:
1993: \newpage
1994:
1995:
1996: \section*{APPENDIX\ C. Tracker Resolution}
1997:
1998: This appendix demonstrates the results of application of the pattern
1999: recognition and reconstruction procedure for the conversion events
2000: without background.
2001:
2002:
2003: The distribution in the difference between the input reconstructed momentum (Pin\_f) based on
2004: the Kalman filter and the simulated input momentum (Pin) is shown in
2005: Figure ~\ref{fig:pin_dif} in linear (a) and logarithmic (b) scale.
2006: According to this distribution the intrinsic tracker resolution is
2007: $\sigma$ = 0.12 MeV if one fits the distribution by a Gaussian in the range
2008: -0.3 - 0.7 MeV.
2009:
2010:
2011: \begin{figure}[htb!]
2012: \centerline{\hbox{%
2013: \psfig{figure=dif_lin.eps,height=3.0in,clip=on}
2014: \psfig{figure=dif_log.eps,height=3.0in,clip=on}}}
2015: \caption{
2016: Distribution in the difference between the input reconstructed momentum based on
2017: the Kalman filter and the simulated input momentum without background.
2018: }
2019: \label{fig:pin_dif}
2020: \end{figure}
2021:
2022:
2023: \begin{figure}[htb!]
2024: \centerline{\hbox{%
2025: \psfig{figure=pin_real_lin.eps,height=3.0in,clip=on}
2026: \psfig{figure=pin_real_log.eps,height=3.0in,clip=on}}}
2027: \caption{
2028: Distribution in the input momentum reconstructed by the
2029: Kalman filter without background.
2030: }
2031: \label{fig:pin_kal}
2032: \end{figure}
2033:
2034:
2035: Figure ~\ref{fig:pin_kal} shows a distribution in the input
2036: momentum (Pin\_f) reconstructed by the Kalman filter
2037: in linear (a) and logarithmic (b) scale without background.
2038: This distribution is characterized by the standard deviation $\sigma$ = 0.25 MeV of the
2039: reconstructed input momentum for a Gaussian fit
2040: in the range 104-106 MeV.
2041: The overall reconstruction acceptance is 22.7 $\%$ for muon conversion
2042: events with the momentum above a threshold momentum of 103.6
2043: MeV/c.
2044:
2045: Comparing these results with the results of the reconstruction
2046: in the presence of the background we get the difference in tracker
2047: resolution 1.5 \% and the difference 2.7 \% in overall acceptance.
2048: Therefore the tracker resolution and overall acceptance are not
2049: affected significantly at the considered background level.
2050:
2051: \end{document}
2052: