Neyman allocation

Neyman allocation, also known as optimum allocation, is a method of sample size allocation in stratified sampling developed by Jerzy Neyman in 1934. This technique determines the optimal sample size for each stratum to minimize the variance of the estimated population parameter for a fixed total sample size and cost.

Theory

In stratified sampling, the population is divided into L mutually exclusive and exhaustive strata, and independent samples are drawn from each stratum. Neyman allocation determines the sample size nh for each stratum h that minimizes the variance of the estimated population mean or total. The Neyman allocation formula is:

where:

  • nh is the sample size for stratum h
  • n is the total sample size
  • Nh is the population size for stratum h
  • Sh is the standard deviation of the variable of interest in stratum h
  • Σ represents the sum over all strata

Mathematical derivation

The derivation of Neyman allocation follows from minimizing the variance of the stratified mean estimator subject to a fixed total sample size constraint. The variance of the stratified mean estimator is:

where fh = nh/Nh is the sampling fraction in stratum h. Using the method of Lagrange multipliers to minimize this variance subject to the constraint Σnh = n leads to the Neyman allocation formula.

Advantages

Neyman allocation offers several advantages over other allocation methods:

  • It provides the most statistically efficient allocation for estimating population means and totals when costs are equal across strata.
  • It takes into account both the size and variability of each stratum.
  • It generally results in smaller standard errors compared to proportional allocation.

Limitations

Despite its optimality properties, Neyman allocation has some practical limitations:

  • It requires prior knowledge of stratum standard deviations, which may not be available in practice.
  • The allocated sample sizes may not be integers and need to be rounded.
  • Very small strata may receive insufficient sample sizes for reliable estimation.
  • It may not be optimal when estimating multiple parameters simultaneously.

Applications

Neyman allocation is widely used in large-scale surveys and statistical studies, particularly in:

  • Official statistics and government surveys
  • Market research studies
  • Environmental sampling
  • Quality control in manufacturing
  • Educational assessment studies

When sampling costs differ across strata, the allocation can be modified to account for these differences, leading to cost-optimal allocation formulas.

See also

References

  • Neyman, J. (1934). "On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection". Journal of the Royal Statistical Society. 97 (4): 558–625.
  • Cochran, W. G. (1977). Sampling Techniques (3rd ed.). New York: John Wiley & Sons.

Content Disclaimer

Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.

  1. The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
  2. There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
  3. It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
  4. Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
  5. Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.