1: %
2: % References should be done using the \cite, \ref, and \label commands
3: %\section{\label{sec:introduction}Introduction}
4:
5: % figures should be put into the text as floats.
6: % Here is an example of the general form of a figure:
7: % Fill in the caption in the braces of the \caption{} command. Put the label
8: % that you will use with \ref{} command in the braces of the \label{} command.
9: % Use the figure* environment if the figure should span across the
10: % entire page. There is no need to do explicit centering.
11:
12: % \begin{figure}
13: % \includegraphics{}%
14: % \caption{\label{}}
15: % \end{figure}
16:
17: % tables should appear as floats within the text
18: % Here is an example of the general form of a table:
19: % Fill in the caption in the braces of the \caption{} command. Put the label
20: % that you will use with \ref{} command in the braces of the \label{} command.
21: % Insert the column specifiers (l, r, c, d, etc.) in the empty braces of the
22: % \begin{tabular}{} command.
23: % The ruledtabular enviroment adds doubled rules to table and sets a
24: % reasonable default table settings.
25: % Use the table* environment to get a full-width table in two-column
26: % Add \usepackage{longtable} and the longtable (or longtable*}
27: % environment for nicely formatted long tables. Or use the the [H]
28: % placement option to break a long table (with less control than
29: % in longtable).
30: % \begin{table}%[H] add [H] placement to break table across pages
31: % \caption{\label{}}
32: % \begin{ruledtabular}
33: % \begin{tabular}{}
34: % Lines of table here ending with \\
35: % \end{tabular}
36: % \end{ruledtabular}
37: % \end{table}
38:
39:
40: \section{\label{sec:data}Data Samples and Event Selection}
41:
42: \subsection{Colliding Beam Data}
43:
44: The colliding beam data used in this analysis were recorded during the
45: period March 2002 - August 2003, when the instantaneous Tevatron
46: luminosity ranged from 0.5 to 4.0~$\times 10^{31} ~{\rm cm^{-2}
47: s^{-1}}$.
48:
49: %Collision data are typically recorded for an uninterrupted
50: %interval of several hours, called a "Run". Each run has a separate set
51: %of calibrations for all detector components. Calibrations that may
52: %vary over a Run, such as beam luminosity and beam positions, are done
53: %for "Run Sections" of 500,000 events each. In order to insure precise
54: %understanding of our trigger and selection efficiencies, the quality
55: %of the data is examined in each run. All detector components must be
56: %within operating specifications in order for data for that run to be
57: %used in the analysis.
58:
59: Cherenkov light detectors in the very forward region ($|\eta|\geq
60: 3.7$) record information on the instantaneous and total integrated
61: luminosity of the Tevatron~\cite{clc}. The total integrated
62: luminosity for this period is $193 \pm 12~{\rm pb}^{-1}$; after quality
63: requirements on the silicon tracking, the data sample used for this
64: analysis amounts to $162 \pm 10~{\rm pb}^{-1}$ for CEM electrons and CMUP muons,
65: and $150 \pm 9~{\rm pb}^{-1}$ for CMX muons.
66:
67: For the primary data samples used in this analysis, the detector is
68: triggered on high momentum electrons and muons. The electron hardware
69: triggers require an XFT track with $p_T \geq 8 ~{\rm GeV/c}$ matched
70: to an EM cluster with $E_T \geq 16$ GeV and the ratio of hadronic to
71: electromagnetic energy less than 0.125. The muon hardware triggers
72: require an XFT track with $p_T \geq 8 ~{\rm GeV/c}$ matched to muon
73: stubs in the joint CMUP configuration or in the CMX. A complete
74: version of the offline lepton selection is performed online in the
75: last stage of triggering, and repeated in offline processing with
76: updated calibration constants. Other secondary datasets described in
77: Sec.~\ref{sec:secvtx} use a jet trigger with a certain $E_T$ threshold
78: or an electron trigger with relaxed $E_T$ requirements.
79:
80: \subsection{Monte Carlo Samples}
81:
82: The understanding of acceptances, efficiencies, and backgrounds relies on
83: detailed simulation of physics processes and the detector response.
84: Most measurements of acceptance and efficiency rely on
85: {\sc pythia}~v6.2~\cite{pythia} or {\sc herwig}~v6.4~\cite{herwig1,herwig2}.
86: These generators employ leading order matrix elements for the hard parton
87: scattering, followed by parton showering to simulate gluon radiation and
88: fragmentation. Each generator is used in conjunction with the CTEQ5L
89: parton distribution functions~\cite{cteq}. For heavy flavor jets, we
90: interface to {\sc qq}~v9.1~\cite{qq} to provide proper
91: modeling of $b$ and $c$ hadron decays.
92:
93: The estimate of the $b$-tagging backgrounds due to higher order QCD
94: processes such as $Wb\bar b$ requires special care.
95: %Although PYTHIA and
96: %HERWIG properly model the case where a gluon jet ``splits'' into
97: %collinear heavy flavor quarks, these generators fail in the
98: %non-radiative configurations where the quarks are widely separated.
99: This study of backgrounds in the $b$-tagged sample uses the
100: {\sc alpgen} program~\cite{alpgen}, which generates high multiplicity
101: partonic final states using exact leading-order matrix elements. The
102: parton level events are then passed to {\sc herwig} and {\sc qq} for
103: parton showering and $b$ and $c$ hadron decay. Further discussion of
104: {\sc alpgen} can be found in Sec.~\ref{sec:hffracs}.
105:
106: The CDF II detector simulation reproduces the response of the detector
107: to particles produced in $p\bar p$ collisions. The same detector
108: geometry database is used in both the simulation and the
109: reconstruction, and tracking of particles through matter is performed
110: with {\sc geant3}~\cite{geant3}. Charge deposition in the silicon
111: detectors is calculated using a simple geometrical model based on the
112: path length of the ionizing particle and an unrestricted Landau
113: distribution. The drift model for the COT uses a parameterization of
114: a {\sc garfield} simulation, with the parameters tuned to match COT
115: data~\cite{cot}. The calorimeter simulation uses the {\sc
116: gflash}~\cite{gflash} parameterization package interfaced with {\sc
117: geant3}. The {\sc gflash} parameters are tuned to test beam data
118: for electrons and high-$p_T$ pions, and they are checked by comparing
119: the calorimeter energy of isolated tracks in the collision data to
120: their momenta as measured in the COT. Further detail on the CDF II
121: simulation can be found elsewhere~\cite{paulini}.
122:
123: \subsection{$W$ + Jets Selection}
124:
125: The selection identifies events consistent with the $W$ + jets
126: signature containing a high-momentum electron or muon (hereafter
127: referred to as ``lepton,'' $\ell$), large missing transverse energy, and
128: hadronic jets. The event selection is summarized below.
129:
130: The offline electron selection requires an EM cluster with $E_T \geq
131: 20$ GeV matched to a track with $p_T \geq 10\,\mathrm{GeV}/c$. The
132: cluster is required to have an electromagnetic fraction and shower shape
133: consistent with an electron deposit. The extrapolated track is
134: required to match the shower location as measured in the shower
135: maximum strip detector, and to have a momentum consistent with the
136: shower energy. Finally, since the electron from $W$ decay is expected
137: to be isolated from other energy deposits in the calorimeter, the
138: energy in a cone of radius $\Delta R = 0.4$ around the electron
139: cluster, but not including the cluster itself, is measured, and the
140: isolation ratio of the energy in the cone to the energy of the
141: electron is required to be less than 0.1.
142:
143: Photon conversions in the detector material are a source of electron
144: backgrounds. A conversion is defined as a pair of tracks (one of them
145: the electron) satisfying the following cuts:
146: \begin{itemize}
147: \item oppositely charged,
148: \item $|\Delta(xy)|<2\,\mathrm{mm}$, and
149: \item $|\Delta(\cot\theta)|<0.04$,
150: \end{itemize}
151: \noindent where $\Delta(xy)$ is the distance between the tracks in the
152: $r-\phi$ plane at the point where they are parallel in that plane, and
153: $\Delta(\cot\theta)$ is the difference between the cotangents of the
154: polar angles of the two tracks.
155: Electrons that are part of an identified conversion
156: pair are not considered further in the electron selection.
157:
158: The offline muon selection requires a COT track with $p_T \geq
159: 20\,\mathrm{GeV}/c$ matched to a CMUP or CMX muon stub. The matching
160: is based on the extrapolated track position at the chambers, accounting
161: for the effects of multiple scattering. The
162: energy in the calorimeter tower containing the muon is required to be
163: consistent with the deposition expected from a minimum ionizing
164: particle. Backgrounds from cosmic rays are removed by requiring that
165: the track extrapolates to the origin, and that the minimum ionizing
166: tower energy deposit is within a narrow timing window around the beam
167: crossing.
168:
169: In these high momentum lepton samples, the signal of the neutrino from
170: $W \rightarrow \ell\nu$ is large missing transverse energy, $\met$.
171: The $\met$ is calculated as the vector sum of the energy in
172: each calorimeter tower multiplied by the azimuthal direction of the
173: tower. If isolated high momentum muons are found in the event, the
174: $\met$ is corrected by subtracting the muon energy in the calorimeter
175: and adding the muon $p_T$ to the vector sum. The selection finally
176: requires $\met \geq 20\,\mathrm{GeV}$.
177:
178: In addition to the direct $t \rightarrow e \nu_e b$ and
179: $t \rightarrow \mu \nu_{\mu} b$ modes, this event selection has a
180: small acceptance for top final states with $W \rightarrow \tau \nu$ and
181: a subsequent leptonic $\tau$ decay, or with high momentum semi-leptonic
182: b quark decays.
183: These are included in the signal acceptances calculated in Sec. IX.
184:
185: $Z$ bosons and top dilepton decays that contribute to the inclusive
186: high $p_T$ lepton dataset are removed by flagging the presence of a
187: second lepton. Any event with two leptons satisfying the lepton
188: identification is removed, as well as those events where the second
189: lepton is an electron in the plug calorimeter or a muon that fails the
190: CMUP requirement, but has one CMU or CMP muon segment. Finally, we attempt
191: to remove $Z$ bosons without a well identified second lepton by eliminating
192: events with one lepton and certain second objects which form an invariant
193: mass between 76 and 106~GeV/$c^2$ with the primary lepton. For primary
194: muons the other object is an
195: opposite-signed isolated track with $p_T > 10$ GeV/c. For primary
196: electrons the second object may be such a track, an electromagnetic
197: cluster, or a jet with $E_T > 15$ GeV and $|\eta| \leq 2.0$ that has
198: fewer than three tracks and an electromagnetic energy fraction greater
199: than 95\%. The correction for the residual $Z$ boson contribution to
200: the $W$ + jets sample is described in Section~\ref{sec:checks}. Small
201: contributions from $Z \rightarrow \tau \bar{\tau}$ where a $\tau$ is tagged
202: are treated as a separate background, and described in Section~\ref{sec:m2bkg}.
203:
204: \begin{figure}
205: \includegraphics[width=0.5\textwidth]{Wmass1jet.eps}%
206: \caption{\label{fig:wmass}Transverse mass of the identified lepton
207: and inferred neutrino, consistent with $W$ boson production
208: (162 $\mathrm{pb}^{-1}$ data sample).}
209: \end{figure}
210:
211: The number of jets produced in association with the leptonically
212: decaying $W$ in the event is measured by selecting jets of cone radius
213: $\Delta R = 0.4$, with $E_T \geq 15\,\mathrm{GeV}$ and $|\eta| \leq
214: 2.0$. The jets are clustered after removing towers associated with
215: the selected isolated electron from the leptonic $W$ decay, and after
216: correcting the tower $E_T$ for the location of the primary vertex $z$
217: coordinate. The number of events in each jet multiplicity bin is shown
218: in Table~\ref{tab:pretag}. The overall ${\rm acceptance \times
219: efficiency}$ of this selection for $t\bar t$ events in the
220: lepton+jets channel with three or more jets, including the leptonic
221: branching ratios, is roughly 4\% for the electron channel, 2\% for
222: muons in the CMUP, and 1\% for muons in the CMX.
223:
224: The presence of the $W$ boson in the selected events is verified by
225: calculating the transverse mass of the lepton and the missing energy:
226: $M_T = \sqrt{(E_T(\ell)+E_T(\nu))^2 -
227: (\overrightarrow{P_T}(\ell)+\overrightarrow{P_T}(\nu))^2}$. The
228: distribution of this variable for all events passing the requirement
229: of a lepton, missing energy, and at least one jet is shown in
230: Fig.~\ref{fig:wmass}, and displays the Jacobian edge associated with
231: $W$ production and decay.
232:
233: \begin{table*}
234: \caption{\label{tab:pretag}Number of events selected, before
235: $b$-tagging, for each jet multiplicity.}
236: \begin{ruledtabular}
237: \begin{tabular}{ccccccc}
238: & $W$ + 1 jet & $W$ + 2 jets & $W$ + 3 jets & $W$ + 4 jets & $W$ + 3 jets & $W$ + 4 jets \\
239: & & & \multicolumn{2}{c}{$H_T > 0$} & \multicolumn{2}{c}{$H_T >
240: 200$~GeV} \\ \hline
241: Electrons & 8828 & 1446 & 241 & 70 & 117 & 63 \\
242: Muons & 6486 & 1002 & 146 & 37 & 63 & 28 \\
243: \end{tabular}
244: \end{ruledtabular}
245: \end{table*}
246:
247: As a final optimization step, the selection will incorporate an
248: additional cut on the total transverse energy $H_T$ of all objects in
249: the event. Events from $t\bar t$ production have, on average, a
250: significantly greater total transverse energy than background events.
251: The optimization of this requirement and acceptance
252: corrections and uncertainties will be discussed in
253: Sec.~\ref{sec:optimization}.
254:
255: Because the $t\bar t$ signal is expected to contribute significantly to
256: the sample of events with $W$ + 3 jets or $W$ + $\geq 4$ jets, an excess
257: of observed events over the expected background with those jet multiplicities
258: is assumed to be entirely due to $t\bar t$ production. The observed results
259: for events with $W$ + 1 jet or $W$ + 2 jets, where the $t\bar t$ contribution
260: is negligible, serve as a check of the background prediction.
261:
262: In Reference~\cite{chicago}, the momentum spectrum of the leading jets in
263: the $W$ + 1 and 2 jet events is shown to be a reasonable model of the
264: backgrounds in the $W$ + 3 or 4 jet events, and is used in deriving a
265: completely independent estimate of the b-tag backgrounds to top
266: production in the $W$ + 3 or 4 jet channels. The estimated background,
267: $18 \pm 4$ events, is in good agreement with our overall
268: estimate of $23 \pm 3$ (for $H_T>0$), derived from an explicit calculation
269: for each contributing background process (see Sec.\ref{sec:m2bkg}). The
270: independence of these background estimates allows for a combined
271: cross section calculation which will appear in a future paper.
272: We note here that the technique of Reference~\cite{chicago} will
273: work only in the study of top quarks in the $W$ plus jet mode, and we
274: consider it a cross-check on our more general technique for
275: calculating b-tag backgrounds, which will be employed in other
276: b-tagging analyses at CDF.
277:
278: The final cross section calculation, $\sigma_{t\bar t} = (N_\mathrm{obs} -
279: N_\mathrm{bkg})/(\epsilon_{t\bar t} \times \mathcal{L})$, depends on the
280: product $\epsilon_{t\bar t}$ of signal acceptance and selection efficiency, the
281: expected number of non-$t\bar t$ background events $N_\mathrm{bkg}$, and
282: the integrated luminosity $\mathcal L$.
283:
284:
285:
286:
287:
288:
289:
290:
291:
292:
293:
294:
295: