0604:hep-ex0604020/nn

1: \subsection{Neural Network Analysis}

2: \label{nn-analysis}

3: A neural network is a multivariate statistical technique for

4: separating signals from backgrounds.

5: We use the {\sc mlpfit}~\cite{mlpfit} package to construct and

6: implement the networks.

7: In order for a neural network to approach the

8: maximal signal-background separation, some optimization is

9: required. This occurs in three steps:

10: 1) judicious choice of signal and background pairs,

11: 2) selection of input variables, and

12: 3) optimization of training parameters.

13:

14: \subsubsection{Choice of Signal-Background Pairs}

15: \label{sec:nn-sig-back-pairs}

16: We have chosen to create networks trained on single

17: top quark signals against the two dominant backgrounds: $W$+jets

18: and {\ttbar}.  For $W$+jets, we train using

19: a $Wb\bar{b}$ Monte Carlo sample as this process best

20: represents all $W$+jets processes.  For {\ttbar},

21: we train on ${\ttbar}{\rar}\ell$+jets which is the

22: dominant background as opposed to the dilepton background

23: which is small.

24:

25: \subsubsection{Choice of Input Variables}

26: \label{sec:nn-input-vars}

27: We start from a set of discriminating variables that each

28: show some signal-background separation as discussed

29: in Sec.~\ref{discrim_vars}.  Based on this, we

30: optimize the input variables for

31: each network by training with different combinations of variables

32: and choosing the combination that produces the

33: minimum testing error, which corresponds to the

34: best signal-background separation.

35:

36: We use the same variables

37: for the electron and muon channel. However, owing to different resolutions

38: and pseudorapidity ranges, we train the networks separately for the

39: two.

40:

41: \subsubsection{Neural Network Training}

42: Each network is composed of three

43: layers of nodes: input, hidden, and output.  Testing and training event

44: sets are created from simulated signal and background samples.

45: We divide the input samples such that 60\% of the events are used for

46: training and the remaining 40\% for testing. Training is effected with

47: weighted events and the logarithm of all nonangular variables.

48: We use a technique called early stopping~\cite{early_stopping} to

49: determine the maximum number of epochs for training which prevents

50: over-training.

51:

52: Each network is further tuned by varying the number of hidden nodes

53: between 10 and 30

54: and then selecting the number of hidden nodes that

55: returns the smallest testing error.

56:

57: \subsubsection{Neural Network Results}

58: The above procedure produces eight unique networks:

59: two signals ($s$-channel, $t$-channel) $\times$ two

60: backgrounds ($Wb\bar{b}$, ${\ttbar}{\rar}\ell$+jets) $\times$ two lepton

61: flavors ($e$, $\mu$).

62:

63: Figures~\ref{nn-1d-output-training_tb} and~\ref{nn-1d-output-training_tqb}

64: show the output variable distributions

65: from the networks in the $s$-channel and $t$-channel searches for electrons

66: and muons. From the figures, it can be seen that these

67: networks are highly efficient at separating the single top quark signal from the

68: ${\ttbar}{\rar}\ell$+jets background. Studies have shown that these networks are

69: not as effective for the ${\ttbar}$ dilepton background, which is fortunately small.

70: The $s$-channel and $t$-channel networks are less efficient at separating

71: the single top quark signal from the $Wb\bar{b}$

72: background as compared to ${\ttbar}{\rar}\ell$+jets. In

73: addition, we find these networks are equally effective

74: in separating the $Wjj$ and the misidentified lepton background

75: as compared to the $Wb\bar{b}$ background.

76: It should be noted that the output variable from {\sc mlpfit} networks is

77: not restricted to lie between zero and one.

78:

79: Figures~\ref{nn-yield-compare-schan} and \ref{nn-yield-compare-tchan}

80: show comparisons of the summed

81: backgrounds to data for the $s$-channel and $t$-channel searches,

82: for electrons, muons, single-tagged, and double-tagged samples combined.

83: These distributions show that the background model reproduces the data very well.

84: From the figures, it can be seen that the ${\ttbar}{\rar}\ell$+jets

85: filters do indeed separate the ${\ttbar}$ background which clusters

86: near zero, but does not affect the $W$+jets and multijet backgrounds,

87: which cluster near one.  Similarly, the $Wb\bar{b}$ filters

88: discriminate the $W$+jets and multijet backgrounds, which cluster

89: to the left of 0.5, but do not affect the ${\ttbar}$

90: background, which clusters to the right of 0.5.

91: They also show that separation of the single top quark signal from

92: background is not yet powerful enough since the background dominates

93: even in the regions where the signal peaks.

94:

95: Figure~\ref{nn-yield-2d-compare} shows the output of the

96: $tb$-${\ttbar}$ network versus the $tb$-$Wb\bar{b}$ network, and similarly

97: for the $tqb$ networks, again for electrons, muons, single-tagged, and

98: double-tagged events combined.

99:

100: \begin{figure*}[!h!tbp]

101: \includegraphics[width=0.40\textwidth]

102: {figures/nn_output/electron_wbb_filter_tb.eps}

103: \includegraphics[width=0.40\textwidth]

104: {figures/nn_output/electron_lepjets_filter_tb.eps}

105: \includegraphics[width=0.40\textwidth]

106: {figures/nn_output/muon_wbb_filter_tb.eps}

107: \includegraphics[width=0.40\textwidth]

108: {figures/nn_output/muon_lepjets_filter_tb.eps}

109: \caption[nn-1d-output-tb]{Neural network outputs in the $s$-channel.

110: This figure shows the signal-background separation for

111: (a) the filter for $Wb\bar{b}$ in the electron channel,

112: (b) the filter for ${\ttbar}{\rar}\ell$+jets in the electron channel,

113: (c) the filter for $Wb\bar{b}$ in the muon channel, and

114: (d) the filter for ${\ttbar}{\rar}\ell$+jets in the muon channel

115: where the background is the dashed-lined and the top quark

116: signal is the solid line. All the curves are

117: normalized to have equal area, so that the separation between signal

118: and background can be best seen.}

119: \label{nn-1d-output-training_tb}

120: \end{figure*}

121:

122: \begin{figure*}[!h!tbp]

123: \includegraphics[width=0.40\textwidth]

124: {figures/nn_output/electron_wbb_filter_tqb.eps}

125: \includegraphics[width=0.40\textwidth]

126: {figures/nn_output/electron_lepjets_filter_tqb.eps}

127: \includegraphics[width=0.40\textwidth]

128: {figures/nn_output/muon_wbb_filter_tqb.eps}

129: \includegraphics[width=0.40\textwidth]

130: {figures/nn_output/muon_lepjets_filter_tqb.eps}

131: \caption[nn-1d-output-tb]{Neural network outputs in the $t$-channel.

132: This figure shows the signal-background separation for

133: (a) the filter for $Wb\bar{b}$ in the electron channel,

134: (b) the filter for ${\ttbar}{\rar}\ell$+jets in the electron channel,

135: (c) the filter for $Wb\bar{b}$ in the muon channel, and

136: (d) the filter for ${\ttbar}{\rar}\ell$+jets in the muon channel

137: where the background is the dashed-lined and the top quark

138: signal is the solid line. All the curves are

139: normalized to have equal area, so that the separation between signal

140: and background can be best seen.}

141: \label{nn-1d-output-training_tqb}

142: \end{figure*}

143:

144: \begin{figure*}[!h!tbp]

145: \includegraphics[width=0.45\textwidth]

146: {figures/nn_output/Add_tbtt_filter_aw.eps}

147: \includegraphics[width=0.45\textwidth]

148: {figures/nn_output/Add_tbwbb_filter_aw.eps}

149: \caption{Comparison of signal, background, and

150: data for the neural network outputs in the $s$-channel, for the electron and muon

151: channels combined, requiring at least one $b$-tag.  This figure shows

152: (a) the $\ttbar$ filter and

153: (b) the $Wb\bar{b}$ filter.

154: Signals are multiplied by ten.}

155: \label{nn-yield-compare-schan}

156: \end{figure*}

157:

158: \begin{figure*}[!h!tbp]

159: \includegraphics[width=0.45\textwidth]

160: {figures/nn_output/Add_tqbtt_filter_aw.eps}

161: \includegraphics[width=0.45\textwidth]

162: {figures/nn_output/Add_tqbwbb_filter_aw.eps}

163: \caption{Comparison of signal, background, and

164: data for the neural network outputs in the $t$-channel, for the electron and muon

165: channels combined, requiring at least one $b$-tag.  This figure shows

166: (a) the $\ttbar$ filter and

167: (b) the $Wb\bar{b}$ filter.

168: Signals are multiplied by ten.}

169: \label{nn-yield-compare-tchan}

170: \end{figure*}

171:

172: \begin{figure*}[!h!tbp]

173: \includegraphics[width=0.45\textwidth]

174: {figures/nn_output/Add_tb_filter_all.eps}

175: \includegraphics[width=0.45\textwidth]

176: {figures/nn_output/Add_tqb_filter_all.eps}

177: \caption{Neural network outputs for both the $\ttbar$ versus $Wb\bar{b}$

178: filters in the (a) $s$-channel

179: (b) and $t$-channel analyses. The background sum is shown as the shaded

180: area, the signal as contour lines, and the data as stars.}

181: \label{nn-yield-2d-compare}

182: \end{figure*}

183:

184:

185: