R. D. Field - Recent Publications

A Topological Analysis of the Top Quark Signal and Background at Hadron Colliders

R. D. Field, Y. Kanev, and M. Tayebnejad

Institute for Fundamental Theory, Department of Physics, University of Florida, Gainesville, FL 32611

Published in Phys. Rev. D55, 5685 (1997).

Abstract

We investigate the event topology of the lvb decay mode of top-pair production in proton-antiproton collisions at 1.8 TeV. Neural networks and Fisher discriminates are used in conjunction with modified Fox-Wolfram "shape" variables to help distinguish the top-pair signal from the W+jets and b+jets background. Instead of requiring at least four jets in the event, we employ a cut on the number of calorimeter cells with transverse energy greater than some minimum. By combining these cell cuts with the event shape information, we are able to obtain a signal to background ratio of around 9 (for an ideal detector), without tagging b-quarks. B-quark tagging would, of course, further enhance the signal to background ratio.

I. Introduction

The top quark decays into a b-quark and a W boson (tbW). The W boson decays into a lepton (electron or muon) and a neutrino about 22% (2/9) of the time and into a quark-antiquark pair about 67% (6/9) of the time. This implies that when top-pairs are produced in hadron-hadron collisions, t+X, both of the W bosons decay into a lepton and neutrino only about 5% of the time resulting in the final state consisting of two leptons, two neutrinos, and two b-quarks (llvvb). This distinctive final state constitutes the "discovery" mode of the top quark at hadron colliders [1,2]. On the other hand, it is considerable more likely for one of the W bosons to decay into a quark-antiquark pair resulting in a final state consisting of a lepton, a neutrino, a b, and a pair. The lvb mode shown in Fig. 1a occurs about 35% of the time or about 7 times more often than the purely leptonic mode. The backgrounds are larger for this decay mode, but so is the signal. When each of the four outgoing quarks produce a distinct jet, then the resulting event contains a lepton, a neutrino, and four jets (lvjjjj). This decay mode is used to analyze the properties of the top quark in more detail and to determine, for example, the top mass [3-5]. The purely hadron six jet decay mode occurs about 60% of the time, but it is buried underneath "ordinary" QCD multijet production.

(a) Top-Pair Production

(b) Event Topology

Figure 1. (a) Illustration of top-pair production in proton- antiproton collisions in which one of the W bosons decays leptonically and the other decays hadronically resulting in a final state consisting of a lepton, a neutrino, plus a b

and a

pair. (b) Shows the event topology for the top- pair signal. If each outgoing quark produces a distinct jet then the final state containes a lepton, a neutrino (missing E_T), and four jets.

In this paper, we concentrate on the lvb decay mode of top-pair production in proton-antiproton collisions at 1.8 Tev and investigate ways to optimize this signal over the backgrounds. The event topology of the signal is shown in Fig. 1b and consists of a lepton, a neutrino, and four outgoing quarks which manifest themselves as "jets". In the center-of-mass of a 175 Gev top quark, the W boson and b-quark decay products each have a momentum of about 70 Gev. Furthermore, in the CM frame of the W boson, the quark and antiquark decay products each have a momentum of about 40 Gev. The top-pair are produced near threshold resulting in a typical event that is rather spherical in shape with all six decay products, lvb, having large transverse energy. The background comes from the "ordinary" QCD production of large transverse momentum W bosons plus multiple jets as shown in Fig. 2a and from the production of b-quark pairs plus associated jets as illustrated in Fig. 2b. We begin our analysis of the signal and backgrounds in Section II with a discussion of the data selection and initial cuts. In Section III, we reconstruct the invariant mass of the top-pair and compare it with the true parton-parton CM energy. In Section IV, we introduce modified Fox-Wolfram "shape" variables and apply them to the outgoing jets and in Section V the use of neural networks and Fisher discriminates is explored. Section VI is reserved for summary and conclusions.

(a) W+Jets Background

(b) b+Jets Background

Figure 2. Illustration the (a) W+jets background and the (b) b

+jets background to the top-pair production in proton-antiproton collisions shown in Fig. 1.

II. Event Generation, Data Selection and Cuts

ISAJET version 7.06 is used to generate top quarks with a mass of 175 GeV in 1.8 Tev proton-antiproton collisions. At this energy, 175 Gev top-pairs are produced via quark-antiquark annihilation, t, about 88% of the time and by gluon-gluon fusion, gg t, the remaining 12%. We refer to this as the "signal". We have normalized the top cross section to be 7.5 pb corresponding to 750 events with an integrated luminosity of 100/pb [3-5]. The "background" consists of single W boson events generated with the hard-scattering transverse momentum, k_T, greater than 25 Gev. Single W bosons are produced at large transverse momentum via the "ordinary'' QCD subprocesses qgWq, gW , and Wg. These subprocesses, of course, generate addition gluons via bremsstrahlung off both incident and outgoing color non-singlet partons, resulting in multiparton final states which subsequently fragment into hadrons. This is referred to as the W+jets background. Another background is b-quark pairs produced via the subprocesses b and b plus the accompanying radiation. This is referred to as the b+jets background.

We do not attempt to do a detailed simulation of the CDF or D0 detector [3,4]. Events are analyzed by dividing the solid angle into "calorimeter'' cells having size = 0.1x7.5°, where and are the pseudorapidity and azimuthal angle, respectively. Our simple calorimeter covers the range || < 4 and has 3840 cells. A single cell has an energy (the sum of the energies of all the particles that hit the cell excluding neutrinos) and a direction given by the coordinates of the center of the cell. From this the transverse energy of each cell is computed from the cell energy and direction. We have taken the energy resolution to be perfect, which means that the only resolution effects are caused by the lack of spatial resolution due to the cell size. Large transverse momentum leptons are analyzed separately and are not included when computing the energy of a cell.

Lepton plus Missing Transverse Energy Trigger

The "zero-level'' trigger is designed to select large transverse momentum W bosons that have decayed into a charged lepton and a neutrino. This first cut is made by demanding that the event contain at least one isolated high transverse momentum charged lepton (electron or muon) in the central region satisfying:

P_T(l) > 15 GeV, |(l)| < 2.5.

"Isolated" leptons are defined by demanding that the total transverse energy within a distance R_l of the lepton in

space be less than E_T^l(max). For this analysis,

R_l=0.2, E_T^l(max)=5 GeV.

In addition, the event must have large missing transverse energy, E_T(miss), and an overall lepton-neutrino transverse momentum, P_T(lv), given by:

E_T(miss) > 20 GeV, P_T(lv) > 25 GeV,

where the missing transverse momentum 2-vectoris determined from the transverse energy grid (the calorimeter) and

where the x-axis and y-axis are perpendicular to the colliding beams and the z-axis is parallel.

Table 1. 175 GeV Top quark pairs produced in 1.8 TeV proton-antiproton collisions. The table shows the number of events (with L=100/pb) for the top-pair signal and the W+jets background. The b+jets backgrond is shown in paranthesis. The "zero-level" lepton plus missing E_T trigger is used as a reference point and is normalized to 100%. The enhancement factor is defined to be the percentage of signal divided by the percentage of background surviving the given set of cuts. Both the overall and relative enhancement factors are shown.
	Top Signal			W+jets Background
		Efficiency			Efficiency		Enhancement
Cut or Selection	Events	Relative	Overall	Events	Relative	Overall	Relative	Overall	Sig/Bak
"Zero-Level" Lepton plus Missing E_T Trigger: P_T(l) > 15 GeV E_T(miss) > 20 GeV P_T(lv) > 25 GeV	165	100%	100%	7,044 (470)	100%	100%	1.0	1.0	0.0234
Calorimeter Cell Cuts: N(cell) > 7 (E_T(cell) > 5 GeV) E_T(sum) > 100 GeV	113	69%	69%	49 (3)	0.7%	0.7%	99.7	99.7	2.3
Fisher Cut: F > 0.75	49	44%	30%	6	12%	0.1%	3.7	373	8.7

This selection of P_T(l^±) > 15 GeV, E_T(miss) > 20 GeV, and P_T(lv) > 25 GeV is referred to as the lepton plus missing E_T trigger. Table 1 shows that about 165 top-pair events survive this selection criterion (22% of the overall top signal). For illustration, we take the integrated luminosity to be 100/pb. Table 1 also shows that about 7,000 W+jets and roughly 500 b+jets background events also survive this "zero-level" cut. The lepton isolation requirements do a good job removing most of the b+jets background, so we will concentrate primarily on the W+jets background. At this stage, the background and is about 43 times the signal.

In order to quantify how various additional cuts enhance the signal above the background, we define the enhancement, F(enh), and the efficiency, F(eff), of a given set of cuts as follows:

F(enh) = (% of signal surviving cut)/(% of background surviving cut )
F(eff) = % of signal surviving cut .

We define the "zero-level" trigger to be the reference point and the fraction of events escaping this cut is set to 100% in Table 1. Similarly, all "enhancement'' factors are set to one at this level as we measure the effectiveness of all other additional cuts from this point. The overall enhancement and efficiency is determined relative to the "zero-level" trigger, while relative enhancements and efficiencies are determined by examining the number of events before and after the particular cut.

Figure 3. Shows the multiplicity of calorimeter cells containing at least 5 GeV of transverse energy for the of the top-pair signal and the W+jets background. In all cases, the events have survived the "zero-level" lepton plus missing E_T trigger. The plot shown the percentage of events with N cells with E_T(cell) > 5 GeV. The position of our cell cut is marked by the dotted line.

Figure 4. Shows the total transverse energy of all the calorimeter cells with E_T(cell) > 5 GeV for the of the top-pair signal and the W+jets background. In all cases, the events have survived the "zero-level" lepton plus missing E_T trigger. The plot shown the percentage of events with E_T(sum) within a 25 GeV bin. The position of our cell cut is marked by the dotted line.

Calorimeter Cell Cuts

At this stage in the analysis, one normally demands that the event contain at least four jets [3-5]. Cutting on the number of jets is a way to preferencially select the top-pair signal over the background. However, we have found that it is faster and better to simply cut on the number of calorimeter cells, N(cell), with transverse energy greater than some minimum, E_T(cell-min). Fig. 3 shows the calorimeter cell multiplicity with E_T(cell-min)=5 GeV for the top-pair signal and the W+jets background. On the average, the top-pair signal populates a larger number of cells than does the background. Obviously this is because the top-pair signal produces more jets, however, one does not have to define a "jet" to see this topology. The top-pair signal produces transverse energy flying out in all directions and this can be seen directly from the calorimeter cell multiplicity.

The top-pair signal also produces more global transverse energy than the background. This is shown in Fig. 4, where we define E_T(sum) to be the sum of the transverse energy of all the calorimeter cells with E_T > E_T(cell-min). As illustrated in Fig. 3 and Fig. 4, we make the following calorimeter cell cuts:

N(cell) > 7 with E_T(cell-min)=5 GeV
E_T(sum) > 100 GeV

At this stage one could cut harder on E_T(sum) and remove more background. However, we want to avoid as much as possible cuts that cause the background invariant mass to peak at the same place as the top-pair signal. For this reason, we will use event "shape" variables to further improve the signal to background ratio.

Table 1 shows that of the 165 top-pair events passing the "zero-level" lepton plus missing E_T cut roughly 69% also pass the calorimeter cell cuts. On the other hand, less than 1% of the W+jets background events survive the cell cuts. The calorimeter cell cuts produce an enhancement of 0.69/0.007 or about 100 over the W+jets background with a 69% efficiency, resulting in a signal to background ratio of about 2. The N(cell) > 7 with E_T(cell-min)=5 GeV cut produces more than a factor of two better enhancement than the traditional "jet cuts'' (i.e. N(jet) > 3). Adding the E_T(sum) > 100 GeV cut gives an additional relative enhancement of more than a factor of three.

Figure 5. Shows the multiplicity of jets with transverse energy greater than 15 GeV for the of the top-pair signal and the W+jets background. In all cases the events have survived the "zero-level" lepton plus missing E_T trigger and the calorimeter cells cuts. The plot shown the percentage of events with N jets with E_T(jet) > 15 GeV.

III. Reconstructing the Top-Pair Invariant Mass

Reconstructing the neutrino momentum

Ideally one would like to reconstruct the invariant mass of the top-pair from its decay products: lepton, neutrino, and four jets. However, the neutrino is not detected and its presence must be inferred by examining the missing transverse momentum. If we set the transverse momentum components of the neutrino equal to the missing transverse momentum,

, and assume that the charged lepton and the neutrino are the result of a W decay (and neglect the W width) then the longitudinal momentum of the neutrino is given by one of the two solutions:

where E_l, p^l_L, and p^l_T, are the energy, longitudinal momentum, and transverse momentum, respectively, of the charged lepton, and p^v_T is the transverse mementum of the neutrino. The quantity A is given by

where

is the azimuthal angle between the transverse momentum vector of the charged lepton and the neutrino. We include both solutions in our determination of the top-pair invariant mass.

Adding in the momentum and energy of the jets

We have not used jets in our event trigger, however, we do use jets to reconstruct the top-pair invariant mass. In addition, we use the jet topology to help further distinguish the signal from the backgrounds. Jets are defined using a simple algorithm. One first considers the "hot'' cells (those with transverse energy greater than 5 GeV). Cells are combined to form a jet if they lie within a specified "radius'',

R_j² = ² + ²,

space from each other. Jets have an energy given by the sum of the energy of each cell in the cluster and a momentum given by the vector sum of the momentums of each cell. The invariant mass of a jet is simply

M_j² = E_j²-p_j².

In this analysis, we take the jet radius to be R_j=0.4 and require jets to have at least 15 GeV of transverse energy. Namely,

R_j=0.4 and E_T(jet) > 15 GeV.

The top-pair invariant mass, M(t

), is constructed from the energy and momentum of the charged lepton, the energy and momentum of the reconstructed neutrino, and the overall momentum vector of the associated jets as follows:

where

and

The overall jet energy, E_jets, and momentum, p_jets, is constructed by summing over all jets with transverse energy greater than 15 GeV. We do not require the event to have a minimum of four jets. The calorimeter cell cuts have replaced the need to make a jet multiplicity cut. This can be seen in Fig. 5 which shows the multiplicity of jets with E_T(jet) > 15 GeV for the top-pair signal and the W+jets background after the lepton plus missing E_T trigger and the calorimeter cell cuts. The cell cuts have forced the signal and background jet multiplicities to look similar and one does not gain much by making an additional jet multiplicity cut. (At this stage, requiring N(jet) > 3 would result in an additional relative enhancement of about 2 with an efficiency of 81%.)

Figure 6. Shows the reconstructed top-pair invariant mass, M(t

), for 175 GeV top quarks produced in 1.8 TeV proton-antiproton collisions (solid curve). The plot contains only the top-pair signal and corresponds to the number of events per year (with L=100/pb) in a 50 GeV. The events have survived the "zero-level" lepton plus missing E_T trigger and the calorimeter cell cuts. Also shown in the true parton-parton CM energy of the event (not directly observable experimentally).

Figure 7. Shows the reconstructed top-pair invariant mass, M(t

), for 175 GeV top quarks produced in 1.8 TeV proton-antiproton collisions together with the W+jets background. The plot shown the sum of the signal plus background and corresponds to the number of events per year (with L=100/pb) in a 50 GeV. The events have survived the "zero-level" lepton plus missing E_T trigger and the calorimeter cell cuts.

Comparing with the parton-parton CM energy

The top-pair invariant mass, M(t), corresponds to the center-of-mass energy, E_cm, of the underlying parton-parton two-to-two subprocess which has a threshold at twice the mass of the top quark, E_cm > 2M_top. This is seen clearly in Fig. 6 which compares the true t and ggt CM energy, E_cm (not experimentally observable), with the reconstructed top-pair invariant mass, M(t). If the neutrino momentum could be precisely determined from the missing E_T and if we knew exactly which particles to include in the jets then the two curves in Fig. 6 would agree. Although one cannot precisely reconstruct the parton-parton CM energy, there still remains a nice peak in the reconstructed top-pair invariant mass at twice the top mass. In this paper, we use the observation of this peak as a measure of how well one can determine the top quark mass and we would like to remove as much background as possible from the peak.

Fig. 6 includes only the top-pair signal with no background. Fig. 7 shows the reconstructed parton-parton CM energy for the top-pair signal and the W+jets background after the "zero level" lepton plus missing E_T trigger and the calorimeter cell cuts. The plot shown the sum of the signal and the background. At this stage the signal is about twice the background. However, the signal to background ratio can be improved by examining in more detail the "shape" of the events.

IV. Variables that Characterize the Event Shape

Fox-Wolfram Moments

In 1979 Geoffrey Fox and Stephen Wolfram [6] constructed a complete set of rotationally invariant observables, H_l which can be used to characterize the "shapes" of the final states in electron-positron annihilations. They are constructed from the momentum vectors of all the final state particles as follows,

where the inner sum is over the particles produced and Y^m_l are the spherical harmonics. Here one must choose a particular set of axes to evaluate the angles,

_i = (

_i,

_i), of the final state particles, but the values of the H_l are independent of this choice. These moments lie in the range 0 < H_l < 1 and if energy conserved in the final state then H₀=1 ( neglecting the masses). If momentum is conserved in the final state then H₁=0.

The Fox-Wolfram observables (or moments) constitute a complete set of shape parameters. For example, the collinear "two-jet" final state results in H_l near 1 for even l and H_l near 0 for odd l. Events that are completely spherically symmetric give H_l near 0 for all l.

Table 2. Shows the mean value and standard deviation from the mean of six of the modified Fox-Wolfram moments applied to the jets in the event with transverse energy greater than 15 GeV. Results are shown for the top-pair signal and the W+jets background. Also shown are the resulting Fisher coefficients.
	Top Signal		W+jets Background
Moment	Mean	Stdev	Mean	Stdev	Fisher Coefficient
H₁	0.2382	0.1775	0.3598	0.2457	-0.500
H₂	0.2815	0.1563	0.4381	0.2211	-1.282
H₃	0.2757	0.1367	0.4031	0.1941	-1.088
H₄	0.2833	0.1279	0.4099	0.1838	-0.978
H₅	0.2856	0.1207	0.3994	0.1744	-0.544
H₆	0.2891	0.1156	0.3989	0.1735	-0.069

Modified Fox-Wolfram Moments Applied the Jets

In hadron-hadron collisions spherical symmetry is lost and we are interested more in the shape of events in the transverse plane. For example, the Fox-Wolfram moments when applied directly to hadron-hadron collisions would interpret a minimum bias event as a "two-jet" event. Whereas, we would like to have a minimum bias event treated more like a spherically symmetric e⁺e^- final state (i.e. no structure). To accomplish this, we define the following modified Fox-Wolfram moments for hadron-hadron collisions,

where the inner sum is now over all the jets in the event with transverse energy, E_Tⁱ, greater than 15 GeV and

_i = (

_i,

_i) represent the angular location jet. Here, E_T(sum) is the sum of the transverse energy of all the jets that are included in the sum. These modified moments also lie in the range 0 to 1 and by definition

₀=1. Furthermore, if the transverse momentum of the jets in the event is conserved then

₁=0. In this case, however, events that are "cylindrically symmetric" about the beam axis give

_l near 0 for all l.

Figure 8. Shows the modified Fox-Wolfram moment,

₁, calculated using the jets in the event with transverse energy greater than 15 GeV for top-pair signal and for the W+jets background. The plot showns the percentage of events in a 0.05 bin. The events have survived the "zero-level" lepton plus missing E_T trigger and the calorimeter cell cuts. (If the vector sum of the monentum of all the jets in the events is zero then

₁=0.)

Figure 9. Shows the modified Fox-Wolfram moment,

₂, calculated using all the jets in the event with transverse energy greater than 15 GeV for top-pair signal and for the W+jets background. The plot showns the percentage of events in a 0.05 bin. The events have survived the "zero- level" lepton plus missing E_T trigger and the calorimeter cell cuts.

Figure 10. Shows the modified Fox-Wolfram moment,

₄, calculated using all the jets in the event with transverse energy greater than 15 GeV for top-pair signal and for the W+jets background. The plot showns the percentage of events in a 0.05 bin. The events have survived the "zero-level" lepton plus missing E_T trigger and the calorimeter cell cuts.

Table 2 shows the mean values and standard deviations for six of the modified Fox-Wolfram moments calculated using all jets with E_T(jet) > 15 GeV for events that have survived the "zero-level" lepton and missing E_T trigger and the calorimeter cell cuts. There are clearly still some differences between the jet topologies of the top-pair signal and the W+jets background. The mean values of the six moments ₁,... , ₆ are smaller for the signal than the background indicating that the jets originating from the top- pair signal form a more cylindrically symmetric pattern when they emerge from the event than do the background jets. The top-pair jets are more spread out in - space. This can be seen in Figs. 8, 9 ,and 10 which show the ₁, ₂, and ₄ distributions, respectively, for the signal and background. At this stage one could simply make a linear cut on, for example, ₂. Requiring ₂ < 0.3 gives an additional signal to background enhancement of about 2 with a relative efficiency of around 60%. One can do a little better, however, by using the information of all six of the _l's simultaneously. This can be done by constructing a neural network or by using Fisher discriminates.

V. Neural Network Analysis

Neural networks are an excellent tool for separating patterns into categories (e.g. signal and background). Our neural networks [7] consist of a set of N_in inputs, {x}, which can have any value and one output, z_net, which is restricted to the range, 0 < z_net < 1. The net output is a function of the input set {x} and the network "memory" parameters as follows:

z_net = F_net({x},{w},{T}),

where the network memory consists of a set of weights, {w}, and a set of thresholds {T}. The goal is to construct a network that can distinguish between two patterns of input data, "signal" events and "background" events, where each event is characterized by the N_in variables. A "perfect" network responds with z_net near one for a signal input and with z_net near zero for a background input and a single cut can be made on this network output which will enhance the signal over the background.

Figure 11. Shows the network response, z_net, for the sample of signal and background events used in the training. The plot corresponds to the percentage of events with z_net within a 0.05 bin for the top-pair signal and the W+jets background. The events have survived the "zero-level" lepton plus missing E_T trigger and the calorimeter cell cuts.

Figure 12. Shows the enhancement versus the efficiency for the training sample of events for a 6-12-1 neural network with 97 memory parameters. Each point in the plot corresponds to a different choice for the network cut-off with the lower efficiencies and higher enhancements corresponding to larger values of z_cut. The network enhancements are compared with the enhancements arrived at by the use of Fisher discriminates.

Network Inputs and Training

Of course, the key to a good network lies in the selection of the input variables. These variables must characterize the differences between the signal and the background. We choose the first six modified Fox-Wolfram variables ( applied to the jets) as the network inputs:

x₁=₁, x₂=₂, x₃=₃, x₄=₄, x₅=₅, x₆=₆.

The network is trained on a sample of 4,000 top-pair signal and 3,814 W+jets background events using the six inputs shown above and where both signal and background events have already satisfied the "zero-level" lepton plus missing E_T trigger and the calorimeter cell cuts. To get this training sample, it was necessary to generate 50,000 top-pair events and 1,200,000 W+jet events.

There is no systematic procedure that provides the best network topology for a given problem. One looks for the simplest network that can discriminate signal from background. Here we use a simple network with only one "hidden layer". We use a 6-12-1 net which has 97 memory parameters (see Ref. [7] for notation). Fig. 11 shows the network response (i.e. z_net) for the sample of signal and background events used in the training. The situation is far from the ideal. There are some events around z_net=0.5 for which the net cannot distinguish between signal and background. Nevertheless, the net does allow for some separation of signal and background. The net clearly recognizes some events as signal or background, while for other events there is an overlap and the net cannot distinguish between the two.

The next step is to perform a network cut-off and assign any event with z_net > z_cut to be signal and events with z_net < z_cut to be background. The enhancement and efficiency of the network cut-off depends on the value chosen for z_cut, where the network enhancement and efficiency are defined as follows:

F_net(enh) = (% of signal with z_net > z_cut)/(% of background with z_net > z_cut)
F_net(eff) = % of signal with z_net > z_cut.

The overall network performance can be characterized by the single curve of the network enhancement versus the network efficiency shown in Fig. 12. Each point in Fig. 12 corresponds to a different choice for the network cut-off with the lower efficiencies and higher enhancements corresponding to larger values of z_cut. For example, a net cut of z_cut=0.75 corresponds to an additional enhancement of about 4 with a relative efficiency of about 47%.

Figure 13. Shows the "shifted" Fisher response,

, for the sample of signal and background events used in the training of the neural network. The plot corresponds to the percentage of events with

within a 0.05 bin for the top-pair signal and the W+jets background. The events have survived the "zero-level" lepton plus missing E_T trigger and the calorimeter cell cuts. The position of our "Fisher cut" is marked by the dotted line.

Fisher Discriminates

A simplier method of separating signal and background is to use Fisher discriminates. This method is analogous to a neural network with no hidden layers. Here as with the network, one inputs a set of N_in variables, x_i, and there is one output, F. However, in this case F is a linear function of the inputs,

where the Fisher coefficients,

_i, are chosen to maximize the separation between signal and background in F-space,

where

and

are the mean and the standard deviation, respectively, of the Fisher output for the signal (sig) and background (bak) sample. The Fisher coefficients are given by

where

is the inverse matrix and

_i is the mean of the distribution x_i,

and V is the covariance matrix,

Here the sum over n corresponds to the sum over the training sample.

In this case training consists of calculating the Fisher coefficients which involves inverting an N_inx N_in matrix, but this is much easier than training a network. Once this is done the situation is similar to the network (with F replacing z_net), for each input of N_in variables there is one output F. We have determined the Fisher coefficients for the sample of signal and background events used to train our network and the Fisher response for these events is shown in Fig. 13. In plotting the Fisher responce in Fig. 13, we have shifted, F, to lie between zero and one as follows:

In this analysis all the inputs, x_i, lie between zero and one and all the Fisher coefficients turn our negative which implies that

The separation between signal and background in Fig. 13 is about the same as the network. As with the network, the overall Fisher performance can be characterized by the single curve of the Fisher enhancement versus the Fisher efficiency which is shown in Fig. 12 together with the network performance. Each point corresponds to a different choice for the Fisher cut-off.

Figure 14. Shows the reconstructed top-pair invariant mass, M(t

Using the Fisher Cut-off

Fig. 12 shown that Fisher discriminates have essencialy the same performance curve as does the neural network and since it is simplier to calculate the Fisher function, we complete our analysis by making a cut on as follows:

> 0.75.

As can be seen in Table 1, this Fisher cut provides an additional enhancement of around 4 with a relative efficience of about 44% resulting in an overall signal to background ratio of about 9.

Fig. 14 shows the reconstructed parton-parton CM energy for the top-pair signal and the W+jets background after the "zero level" lepton plus missing E_T trigger and the calorimeter cell cuts and the Fisher cut. The plot shows the sum of the signal and the background.

VI. Summary and Conclusions

We have developed a procedure that enhances the top quark signal (lvb decay mode) over the W+jets and the b+jets background in hadron-hadron collisions. Our technique can be summarized by the following series of selections and cuts:

Lepton and missing transverse energy trigger.
Calorimeter cell cuts.
Modified Fox-Wolfram shape parameters (applied to the jets).
Fisher discriminate or neural network {cut-off}.

We do not use a conventional "jet trigger". Instead of requiring at least four jets in the event, we find that it is faster and better to simply cut on the number of calorimeter cells with transverse energy greater than some minimum. Our N(cell) > 7 with E_T(cell-min) = 5 GeV cut produces more than a factor of two better enhancement than the traditional, N(jet) > 3, cut. Adding an E_T(sum) > 100 GeV cut gives an additional relative enhancement of more than a factor of three.

In addition, we use Neural networks and Fisher discriminates in conjunction with modified Fox-Wolfram "shape" variables to further distinguish the top-pair signal from background. For example, using the first six Fox-Wolfram moments ( applied to the jets) together with a Fisher cut-off, > 0.75, provides an additional enhancement of around 4 with a relative efficiency of about 44%. By combining the calorimeter cell cuts with the event shape information, we are able to obtain an overall signal to background enhancement of around 370 with an efficiency of 30%, and a signal to background ratio of around nine.

References

F. Abe et al. (The CDF Collaboration), Phys. Rev. Lett. 73, 225,1994; Phys. Rev. D50, 2966,1994.
S. Abachi et al. (The D0 Collaboration), Phys. Rev. Lett. 74, 2632,1995.
F. Abe et al. (The CDF Collaboration), Phys. Rev. Lett. 74, 2626, 1995.
S. Abachi et al. (The D0 Collaboration), Phys. Rev. Lett. 74, 2422, 1995.
E.W. Varnes (for the D0 Collaboration, presented at 1996 Annual Divisional Meeting (DPF 96) of the Division of Particles and Fields of the American Physical Society, Minneapolis, MN, 10-15 Aug 1996.
G. C. Fox and S. Wolfram, Phys. Rev. Lett. 41, 1581 (1978); Nucl. Phys. B149, 413 (1979); Phys. Lett. B82, 134 (1979).
R. D. Field, Y. Kanev, M. Tayebnejad, and P. A. Griffin, Phys. Rev. D53, 2296 (1996). (Click for abstract)

This research was supported in part by a grant from the U.S. Department of Energy (DoE).

Click here to return to Recent Publications.