1: \subsection{Neural Network Analysis}
2: \label{nn-analysis}
3: A neural network is a multivariate statistical technique for
4: separating signals from backgrounds.
5: We use the {\sc mlpfit}~\cite{mlpfit} package to construct and
6: implement the networks.
7: In order for a neural network to approach the
8: maximal signal-background separation, some optimization is
9: required. This occurs in three steps:
10: 1) judicious choice of signal and background pairs,
11: 2) selection of input variables, and
12: 3) optimization of training parameters.
13:
14: \subsubsection{Choice of Signal-Background Pairs}
15: \label{sec:nn-sig-back-pairs}
16: We have chosen to create networks trained on single
17: top quark signals against the two dominant backgrounds: $W$+jets
18: and {\ttbar}. For $W$+jets, we train using
19: a $Wb\bar{b}$ Monte Carlo sample as this process best
20: represents all $W$+jets processes. For {\ttbar},
21: we train on ${\ttbar}{\rar}\ell$+jets which is the
22: dominant background as opposed to the dilepton background
23: which is small.
24:
25: \subsubsection{Choice of Input Variables}
26: \label{sec:nn-input-vars}
27: We start from a set of discriminating variables that each
28: show some signal-background separation as discussed
29: in Sec.~\ref{discrim_vars}. Based on this, we
30: optimize the input variables for
31: each network by training with different combinations of variables
32: and choosing the combination that produces the
33: minimum testing error, which corresponds to the
34: best signal-background separation.
35:
36: We use the same variables
37: for the electron and muon channel. However, owing to different resolutions
38: and pseudorapidity ranges, we train the networks separately for the
39: two.
40:
41: \subsubsection{Neural Network Training}
42: Each network is composed of three
43: layers of nodes: input, hidden, and output. Testing and training event
44: sets are created from simulated signal and background samples.
45: We divide the input samples such that 60\% of the events are used for
46: training and the remaining 40\% for testing. Training is effected with
47: weighted events and the logarithm of all nonangular variables.
48: We use a technique called early stopping~\cite{early_stopping} to
49: determine the maximum number of epochs for training which prevents
50: over-training.
51:
52: Each network is further tuned by varying the number of hidden nodes
53: between 10 and 30
54: and then selecting the number of hidden nodes that
55: returns the smallest testing error.
56:
57: \subsubsection{Neural Network Results}
58: The above procedure produces eight unique networks:
59: two signals ($s$-channel, $t$-channel) $\times$ two
60: backgrounds ($Wb\bar{b}$, ${\ttbar}{\rar}\ell$+jets) $\times$ two lepton
61: flavors ($e$, $\mu$).
62:
63: Figures~\ref{nn-1d-output-training_tb} and~\ref{nn-1d-output-training_tqb}
64: show the output variable distributions
65: from the networks in the $s$-channel and $t$-channel searches for electrons
66: and muons. From the figures, it can be seen that these
67: networks are highly efficient at separating the single top quark signal from the
68: ${\ttbar}{\rar}\ell$+jets background. Studies have shown that these networks are
69: not as effective for the ${\ttbar}$ dilepton background, which is fortunately small.
70: The $s$-channel and $t$-channel networks are less efficient at separating
71: the single top quark signal from the $Wb\bar{b}$
72: background as compared to ${\ttbar}{\rar}\ell$+jets. In
73: addition, we find these networks are equally effective
74: in separating the $Wjj$ and the misidentified lepton background
75: as compared to the $Wb\bar{b}$ background.
76: It should be noted that the output variable from {\sc mlpfit} networks is
77: not restricted to lie between zero and one.
78:
79: Figures~\ref{nn-yield-compare-schan} and \ref{nn-yield-compare-tchan}
80: show comparisons of the summed
81: backgrounds to data for the $s$-channel and $t$-channel searches,
82: for electrons, muons, single-tagged, and double-tagged samples combined.
83: These distributions show that the background model reproduces the data very well.
84: From the figures, it can be seen that the ${\ttbar}{\rar}\ell$+jets
85: filters do indeed separate the ${\ttbar}$ background which clusters
86: near zero, but does not affect the $W$+jets and multijet backgrounds,
87: which cluster near one. Similarly, the $Wb\bar{b}$ filters
88: discriminate the $W$+jets and multijet backgrounds, which cluster
89: to the left of 0.5, but do not affect the ${\ttbar}$
90: background, which clusters to the right of 0.5.
91: They also show that separation of the single top quark signal from
92: background is not yet powerful enough since the background dominates
93: even in the regions where the signal peaks.
94:
95: Figure~\ref{nn-yield-2d-compare} shows the output of the
96: $tb$-${\ttbar}$ network versus the $tb$-$Wb\bar{b}$ network, and similarly
97: for the $tqb$ networks, again for electrons, muons, single-tagged, and
98: double-tagged events combined.
99:
100: \begin{figure*}[!h!tbp]
101: \includegraphics[width=0.40\textwidth]
102: {figures/nn_output/electron_wbb_filter_tb.eps}
103: \includegraphics[width=0.40\textwidth]
104: {figures/nn_output/electron_lepjets_filter_tb.eps}
105: \includegraphics[width=0.40\textwidth]
106: {figures/nn_output/muon_wbb_filter_tb.eps}
107: \includegraphics[width=0.40\textwidth]
108: {figures/nn_output/muon_lepjets_filter_tb.eps}
109: \caption[nn-1d-output-tb]{Neural network outputs in the $s$-channel.
110: This figure shows the signal-background separation for
111: (a) the filter for $Wb\bar{b}$ in the electron channel,
112: (b) the filter for ${\ttbar}{\rar}\ell$+jets in the electron channel,
113: (c) the filter for $Wb\bar{b}$ in the muon channel, and
114: (d) the filter for ${\ttbar}{\rar}\ell$+jets in the muon channel
115: where the background is the dashed-lined and the top quark
116: signal is the solid line. All the curves are
117: normalized to have equal area, so that the separation between signal
118: and background can be best seen.}
119: \label{nn-1d-output-training_tb}
120: \end{figure*}
121:
122: \begin{figure*}[!h!tbp]
123: \includegraphics[width=0.40\textwidth]
124: {figures/nn_output/electron_wbb_filter_tqb.eps}
125: \includegraphics[width=0.40\textwidth]
126: {figures/nn_output/electron_lepjets_filter_tqb.eps}
127: \includegraphics[width=0.40\textwidth]
128: {figures/nn_output/muon_wbb_filter_tqb.eps}
129: \includegraphics[width=0.40\textwidth]
130: {figures/nn_output/muon_lepjets_filter_tqb.eps}
131: \caption[nn-1d-output-tb]{Neural network outputs in the $t$-channel.
132: This figure shows the signal-background separation for
133: (a) the filter for $Wb\bar{b}$ in the electron channel,
134: (b) the filter for ${\ttbar}{\rar}\ell$+jets in the electron channel,
135: (c) the filter for $Wb\bar{b}$ in the muon channel, and
136: (d) the filter for ${\ttbar}{\rar}\ell$+jets in the muon channel
137: where the background is the dashed-lined and the top quark
138: signal is the solid line. All the curves are
139: normalized to have equal area, so that the separation between signal
140: and background can be best seen.}
141: \label{nn-1d-output-training_tqb}
142: \end{figure*}
143:
144: \begin{figure*}[!h!tbp]
145: \includegraphics[width=0.45\textwidth]
146: {figures/nn_output/Add_tbtt_filter_aw.eps}
147: \includegraphics[width=0.45\textwidth]
148: {figures/nn_output/Add_tbwbb_filter_aw.eps}
149: \caption{Comparison of signal, background, and
150: data for the neural network outputs in the $s$-channel, for the electron and muon
151: channels combined, requiring at least one $b$-tag. This figure shows
152: (a) the $\ttbar$ filter and
153: (b) the $Wb\bar{b}$ filter.
154: Signals are multiplied by ten.}
155: \label{nn-yield-compare-schan}
156: \end{figure*}
157:
158: \begin{figure*}[!h!tbp]
159: \includegraphics[width=0.45\textwidth]
160: {figures/nn_output/Add_tqbtt_filter_aw.eps}
161: \includegraphics[width=0.45\textwidth]
162: {figures/nn_output/Add_tqbwbb_filter_aw.eps}
163: \caption{Comparison of signal, background, and
164: data for the neural network outputs in the $t$-channel, for the electron and muon
165: channels combined, requiring at least one $b$-tag. This figure shows
166: (a) the $\ttbar$ filter and
167: (b) the $Wb\bar{b}$ filter.
168: Signals are multiplied by ten.}
169: \label{nn-yield-compare-tchan}
170: \end{figure*}
171:
172: \begin{figure*}[!h!tbp]
173: \includegraphics[width=0.45\textwidth]
174: {figures/nn_output/Add_tb_filter_all.eps}
175: \includegraphics[width=0.45\textwidth]
176: {figures/nn_output/Add_tqb_filter_all.eps}
177: \caption{Neural network outputs for both the $\ttbar$ versus $Wb\bar{b}$
178: filters in the (a) $s$-channel
179: (b) and $t$-channel analyses. The background sum is shown as the shaded
180: area, the signal as contour lines, and the data as stars.}
181: \label{nn-yield-2d-compare}
182: \end{figure*}
183:
184:
185: