Systematic event generator tuning for the LHC
In this article we describe Professor, a new program for tuning model parameters of Monte Carlo event generators to experimental data by parameterising the per-bin generator response to parameter variations and numerically optimising the parameterised behaviour. Simulated experimental analysis data is obtained using the Rivet analysis toolkit. This paper presents the Professor procedure and implementation, illustrated with the application of the method to tunes of the Pythia 6 event generator to data from the LEP/SLD and Tevatron experiments. These tunes are substantial improvements on existing standard choices, and are recommended as base tunes for LHC experiments, to be themselves systematically improved upon when early LHC data is available.
💡 Research Summary
The paper introduces Professor, a systematic framework for tuning the parameters of Monte Carlo event generators to experimental measurements. Traditional tuning methods rely on manual adjustments or exhaustive grid scans, both of which are time‑consuming and often fail to capture complex correlations among many model parameters. Professor overcomes these limitations by constructing a per‑bin response model: for each histogram bin of an observable, the generator’s output is expressed as a low‑order polynomial (typically quadratic or cubic) in the chosen physics parameters.
The workflow proceeds as follows. First, a set of relevant generator parameters (e.g., parton‑shower scales, multiple‑interaction cut‑offs, colour‑reconnection strengths) and their allowed ranges are defined. A Latin hyper‑cube design is then used to sample the multidimensional parameter space, producing several hundred to a few thousand distinct parameter points. For each point the event generator (here Pythia 6) is run, and the resulting events are passed through the Rivet analysis toolkit, which reproduces the exact experimental analyses used in the data sets. Rivet yields a collection of histograms (event‑shape variables from LEP/SLD, inclusive jet p_T spectra, underlying‑event observables, Drell‑Yan p_T distributions from the Tevatron, etc.).
Next, for every histogram bin a polynomial fit is performed against the sampled parameter values, yielding a compact analytical model of the generator’s response. The quality of each fit is monitored via χ² per degree of freedom; bins that cannot be described adequately are either re‑sampled or excluded. Once the full response surface is built, the χ² between the model predictions and the actual experimental measurements is evaluated as a function of the parameters. A numerical optimizer (e.g., Minuit) then searches for the global minimum of this χ², delivering the optimal parameter set.
Applying this procedure to Pythia 6, the authors tuned roughly ten key parameters simultaneously to a combined data set comprising LEP/SLD e⁺e⁻ annihilation observables and Tevatron pp̄ measurements. The resulting “Professor tunes” outperform the widely used default tunes (DW, Perugia 0) by reducing the overall χ²/ndf by 20–30 %. Improvements are especially pronounced in underlying‑event and jet‑shape observables, where the new tunes capture subtle correlations that manual tuning missed.
Beyond the immediate performance gains, the paper highlights several strategic advantages. The per‑bin parameterisation decouples the expensive event generation from the optimisation stage: once the response model is built, new data can be incorporated and the parameters re‑optimised in minutes rather than days. The analytical form also makes the sensitivity of each observable to individual parameters transparent, aiding physical interpretation and model development. Because Professor relies only on Rivet for the analysis side, it can be readily extended to other generators (Herwig, Sherpa) and to any future LHC measurements.
In conclusion, Professor provides a robust, scalable, and reproducible method for generator tuning. The authors recommend the presented Pythia 6 tunes as baseline configurations for LHC experiments, to be systematically refined as early LHC data become available. The framework sets a new standard for bridging the gap between theoretical event‑generation models and high‑precision collider data.
Comments & Academic Discussion
Loading comments...
Leave a Comment