Draft:Linear Template Fit
Review waiting, please be patient.
This may take 3 months or more, since drafts are reviewed in no specific order. There are 4,419 pending submissions waiting for review.
Where to get help
How to improve a draft
You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article. Improving your odds of a speedy review To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags. Editor resources
Reviewer tools
|
| Part of a series on |
| Regression analysis |
|---|
| Models |
| Estimation |
| Background |
In statistics, Linear Template Fit (LTF) is a method for simulation-based parameter estimation applicable to both univariate and high-dimensional multivariate analyses, in cases where the underlying model function cannot be evaluated continuously but is only available at a discrete set of reference values of the parameters of interest[1]. The LTF combines a linear regression with a (generalized) least squares approach to produce a fully closed-form analytic expression for the best estimator, enabling direct and computationally efficient parameter determination.
The method addresses situations where the residuals cannot be expressed analytically or are too computationally expensive to evaluate repeatedly, as is often the case in iterative minimization algorithms. In the Linear Template Fit, the residuals are estimated from the random variables and from a linear approximation of the underlying true model, while the true model needs to be provided for at least n+1 distinct reference values (where n is the number of estimators). The true distribution is then approximated by a linear regression using pre-computed templates, yielding estimators that are determined directly from the data. This closed-form structure enables fully analytic error propagation, allowing uncertainty quantification that is valuable in complex physics analyses.
The Linear Template Fit is used in particle physics, where it has been employed to determine the W-boson's mass,[2] the top quark's mass and width,[3] and the strong coupling constant in quantum chromodynamics.[1]
Problem setup
The Linear Template Fit considers a least squares problem with m observations (data points) di, i = 1, ..., m, which are assumed to follow a normal (Gaussian) probability distribution. The objective function is written in terms of a χ² function:
where:
- is the m-vector of observed data values
- is the m-vector of the theoretical model predictions, dependent on the n-dimensional parameter vector
- is the m×m covariance matrix incorporating all uncertainty sources
The best estimators are found by minimizing .
In many practical applications, particularly in high energy physics, the model is computationally intensive to evaluate, or may not even be available for arbitrary values of . Instead, model predictions are only provided at a finite set of j reference values . These predictions are called templates:
The Linear Template Fit exploits only these pre-computed template predictions.
Algorithm
Linear regression step
The first step of the Linear Template Fit is to construct a continuous approximation of the model from the discrete templates. In every bin i, the model is approximated by a linear function:
where and are determined by linear regression of the template values at the reference values .
The regression matrix M is constructed from the reference values and a column of ones:
The best estimators for the regression parameters for the i-th bin are given by the least squares solution:
where is the Moore–Penrose pseudoinverse of M. A key simplification in the Linear Template Fit is that the bin-wise regression can be treated as unweighted. This is because all templates are typically generated using the same methodology (e.g., the same Monte Carlo statistics), leading to approximately equal uncertainties across templates that cancel in the regression. Thus:
The matrix is a 2×j matrix that is universal — it depends only on the reference values , not on the template values themselves, and is therefore the same in every bin i.
Master formula
The pseudoinverse is decomposed into two j-dimensional column vectors:
where and are j-vectors derived from the columns of .
The template matrix Y is an m×j matrix constructed from the column vectors of all template distributions:
Substituting the regression coefficients into the linear approximation (eq. 4) and using the vectors , and matrix Y, the model is expressed as:
Substituting this into the χ² objective function (eq. 1):
where is the inverse covariance matrix.
Since χ² is quadratic in , the best estimator is found at the stationary point , yielding the master formula of the Linear Template Fit:
Introducing the generalized inverse matrix , the estimator simplifies to , and this matrix enables fully analytic error propagation and uncertainty analysis. The variance of the best estimator is:
|14}}
Given that the data follow a normal distribution and the linear approximation holds, is a best linear unbiased estimator (BLUE) according to the Gauss–Markov theorem.
Multivariate Linear Template Fit
For models depending on k parameters , the Linear Template Fit generalizes naturally. The regression matrix M becomes a j × (n+k) matrix,
and the pseudoinverse decomposition becomes
where is now a j × k matrix. The linearized model as a function of all parameters is:
and after analytic minimization, the closed-form expression for the best estimator $\hat{\boldsymbol{\alpha}}$ of the parameters of interest becomes:
Introducing the generalized inverse matrix , the estimator simplifies to , and this matrix enables fully analytic error propagation and uncertainty analysis.
Further variants
The Linear Template Fit can be formulated in further variants:
- Systematic uncertainties with full bin-to-bin correlations can be included by treating them as nuisance parameters that are also determined analytically in closed form. This allows for detailed insights into possible biases from each uncertainty component.
- The log-normal variant reformulates the equations for log-normal-distributed quantities instead of normally distributed ones. Log-normal uncertainties represent normally distributed relative uncertainties, which is often a reasonable assumption for systematic effects such as luminosity uncertainties. This variant applies the logarithm to both data and template values before performing the fit, and considers relative uncertainties in the covariance matrices.
- When detector effects such as resolution or acceptance need to be accounted for, the template matrix can be modified by incorporating a detector response matrix A, i.e., , where A represents the detector migration and Y contains the templates at particle level. This "forward folding" approach avoids the need for iterative or regularized unfolding.
Error propagation
A key advantage of the Linear Template Fit is its closed-form analytic expression, which enables comprehensive and straightforward error propagation of all uncertainty components. Each uncertainty source can be propagated separately to the fit results using standard linear error propagation . Fully bin-to-bin correlated systematic uncertainties are propagated as .
Uncertainties in the templates can likewise be propagated to the best estimator. In addition, uncertainties that are not included in the fit can be propagated, for example, theoretical uncertainties that cannot be constrained by data in the fit.
Quadratic template fit
When the model exhibits significant non-linearity in the parameters, the linear approximation may be insufficient. The quadratic template fit extends the method using second-degree polynomials for the parameter dependence of the model.[1]
In each bin i, the model is approximated as:
|26}}
which requires at least three reference values for a univariate problem (or at least n²+2n for n parameters in the multivariate case).
Since the χ² function becomes of order in , no closed-form solution exists. The quadratic template fit employs an iterative algorithm:
- The Linear Template Fit is performed to obtain an initial estimator .
- The Newton algorithm is employed with a few m iterations to obtain improved estimators , where the Hesse matrix is analytically calculable.
- The best estimator and error calculation are obtained using linearized approximations in the equations of the Linear Template Fit.
The first step provides a starting point close to the minimum. Since the starting point is already in the vicinity of the minimum, the Hesse matrix is commonly positive definite, and the Newton method has excellent convergence for nearly quadratic functions. The algorithm typically converges in just a few iterations.
Beyond correcting for non-linearity, the quadratic template fit also provides an important consistency check: agreement between the linear and quadratic estimators validates that the underlying model is sufficiently linear within the region of interest.
Consistency checks and validation
The quality of a Linear Template Fit result can be ensured by appropriate selection of the template reference values: the best estimator should lie within the interval spanned by the templates, and the spacing between reference points should be comparable to or smaller than the expected uncertainty of the estimator. Several cross-checks are available to validate the result. An alternative estimator can be obtained by fitting a parabola to the χ² values computed at each template reference point; its consistency with the LTF result serves as a diagnostic for the linearity of the problem. Additionally, the expected distance to the minimum (EDM) from a Newton-step evaluation quantifies non-linear effects, and an independent fit using the quadratic template fit provides a further check — agreement between all three estimators indicates a well-constrained, sufficiently linear fit.
Applications
The Linear Template Fit has found various applications in high-energy particle physics and other fields. Some examples are:
- W-boson mass measurement: The CMS collaboration used LTF to extract the W-boson mass from hadronic jet mass distributions in boosted W boson decays at TeV, exploiting the closed-form analytic structure for detailed uncertainty quantification[2].
- Top quark mass measurement: CMS determined the top quark mass from the differential tt¯ production cross section as a function of the jet mass in hadronic decays of boosted top quarks at TeV, where LTF provided a computationally efficient alternative to iterative numerical minimization[3].
- Top quark mass and width determination: Amoroso et al. applied LTF to simultaneously determine the top-quark mass and width including NLO parton-shower effects, demonstrating the method's capability for multi-parameter fits with correlated systematic uncertainties[4].
- Strong coupling constant determination: The original LTF paper demonstrated the method by extracting from inclusive jet cross sections at TeV, where templates were available only at discrete values of due to the computational cost of NLO QCD calculations[1].
See also
- Least squares
- Linear regression
- Ordinary least squares
- Weighted least squares
- Generalized least squares
- Nonlinear least squares
- Maximum likelihood estimation
References
- ^ a b c d Britzger, Daniel (2022). "The Linear Template Fit". European Physical Journal C. 82 (8): 731. arXiv:2112.01548. Bibcode:2022EPJC...82..731B. doi:10.1140/epjc/s10052-022-10581-w. S2CID 244896511.
- ^ a b CMS Collaboration (2026). "Measurement of the jet mass in hadronic decays of boosted W bosons at 13 TeV and extraction of the W boson mass". arXiv:2603.19963 [hep-ex].
- ^ a b CMS Collaboration (2022). "Measurement of the differential tt¯ production cross section as a function of the jet mass and extraction of the top quark mass in hadronic decays of boosted top quarks". arXiv:2211.01456 [hep-ex].
- ^ Amoroso, S. (2024). "NLOPS off-shell effects in precise determinations of the top-quark mass and width at the LHC". Proc. Sci. EPS-HEP2023: 313. arXiv:2407.16280.
Category:Least squares Category:Regression analysis Category:Statistical estimation Category:Statistical dispersion
Content Disclaimer
Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.
- The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
- There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
- It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
- Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
- Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.
