Unbiased Estimation of Central Moments in Unbalanced Two- and Three-Level Models
This paper derives closed-form unbiased estimators of central moments in multilevel random-effects models with unbalanced group sizes. In a two-level model, we provide unbiased estimators for the second, third, and fourth central moments under both group-level and observation-level averaging. In a three-level model, we provide unbiased estimators for the second and third central moments.
š” Research Summary
**
The paper develops closedāform unbiased estimators for central moments of latent components in hierarchical randomāeffects models when the panel is unbalanced, i.e., group sizes differ across clusters. The authors treat twoālevel and threeālevel linear mixed models and consider two natural weighting schemes: (i) groupālevel averaging, where each cluster receives equal weight, and (ii) observationālevel averaging, where each individual observation receives equal weight.
In the twoālevel setting the observed outcome is written as yᵢⱼ = uįµ¢ + vᵢⱼ, with uįµ¢ (groupālevel random effect) and vᵢⱼ (withināgroup error) mutually independent, identically distributed within each level, and with zero means. Sample means are defined for each cluster (ȳᵢ), for the whole sample using groupālevel weights (ȳ_g), and using observationālevel weights (ȳ_o). LemmaāÆA.1, proved in the appendix, gives the expectations of powers of a weighted sum of i.i.d. zeroāmean variables. Applying this lemma to the withināgroup deviations yᵢⱼāÆāāÆČ³įµ¢ yields unbiased estimators for the second, third, and fourth central moments of v (µāᵄ, µāᵄ, µāᵄ). The formulas involve simple sums of powers of the deviations divided by combinatorial factors that depend on the cluster size Jįµ¢ (e.g., Ī£ (Jįµ¢ā1) for the variance, Ī£ Jįµ¢(Jįµ¢ā1)(Jįµ¢ā2) for the third moment).
Having obtained unbiased estimates of the vāmoments, the authors turn to the groupālevel random effect u. By expressing the betweenāgroup deviation ȳᵢāÆāāÆČ³_g (or ȳᵢāÆāāÆČ³_o) as a weighted sum of uįµ¢ and the alreadyāestimated vāaverages, they derive linear systems that link the unknown moments of u (µāᵤ, µāᵤ, µāᵤ) to observable quantities and the previously estimated vāmoments. The systems are 2āÆĆāÆ2 for the variance and fourthāorder moment, and 3āÆĆāÆ3 for the thirdāorder moment. The coefficients a_{Ā·} in these systems are explicit functions of the cluster sizes (Jįµ¢) and the number of clusters (n). Solving the systems yields closedāform unbiased estimators for µāᵤ, µāᵤ, and µāᵤ under both weighting schemes.
The threeālevel model extends the framework to yᵢⱼā = uįµ¢ + vᵢⱼ + wᵢⱼā, adding a lowerālevel random component w. Analogous definitions of clusterālevel, groupālevel, and observationālevel means are introduced, now involving the additional dimensions Kᵢⱼ (size of the lowestālevel subāclusters). LemmaāÆA.1 is applied first to the withināsubācluster deviations to obtain unbiased estimators of µā_w and µā_w. These are then used to adjust the betweenāsubācluster and betweenāgroup deviations, leading to unbiased estimators of µā_v, µā_v, µā_u, and µā_u under both averaging schemes. The resulting formulas again involve sums of powers of centered observations divided by combinatorial factors that depend on both Jįµ¢ and Kᵢⱼ.
The paperās contribution is threefold. First, it provides the first known closedāform unbiased estimators for thirdāorder and fourthāorder central moments in unbalanced hierarchical models. Second, it treats both groupālevel and observationālevel averaging within a unified algebraic framework, allowing researchers to match the estimator to their sampling design. Third, it demonstrates that even for the fourth momentāwhere crossāproduct terms appearāthe problem can be reduced to solving a small linear system, making the estimators computationally tractable.
Practical implications are highlighted through examples where skewness and kurtosis of latent shocks matter, such as earnings dynamics, firm growth distributions, and insurance risk. The authors note that while the fourthāorder formulas are manageable, extending the approach to fifthāorder and higher moments would lead to increasingly complex systems, suggesting caseābyācase derivations or alternative methods (e.g., Bayesian priors or shrinkage) for future work. The paper thus equips applied econometricians with a rigorous toolkit for unbiased highāorder moment estimation in realistic, unbalanced multilevel data settings.
Comments & Academic Discussion
Loading comments...
Leave a Comment