A NEW METHOD FOR DETERMINATION OF ADEQUATE SAMPLE SIZE
Keywords:
Coefficient of Variation, Clumped, Density, Normal Distribution, Random, Sample Size, UniformAbstract
Most population variables such as density seldom appear to be normally distributed. Therefore, the equations for normal distributions may overestimate the sample size required to obtain an accurate estimate of the variable. In this research, the total required sample size for obtaining the density with 5 and 10% precision was determined in random, uniform and clumped distribution patterns. A significant power relationship (p< 0.0001) was found between the actual sample size and the square of coefficient of variation for achieving 5 and 10% precision. The sample size calculated for normal distribution based on 95% confidence level and 5 and 10% precision was respectively 1 to 10 and 1 to 4 times the actual sample size obtained in the different distribution patterns. The overestimation increased with an increase in the values of coefficient of variation. A new method is presented based on population and sample coefficient of variation for estimating the required sample size. The new method can be used to obtain a reliable estimate of sample size using a wide range of data types in different study designs.
Downloads
References
Alwin, T.G., Fox, M.G. and Spence Cheruvelil, K.. (2010). Estimating lake- wide watermilfoil weevil (Euhrychiopsis lecontei) density: the roles of quadrat size, sample size, and effort. Journal of Aquatic Plant Management (JAPM), 48, p.96.
Anderson, S. and Marcus, L.F. (1993). Effect of quadrat size on measurements of species density, Journal of Biogeography, p.421-428.
Bartlett, M.S. (1947). The use of transformations, Biometrics, 3(1), p.39-52.
Becker, D.A. and Crockett, J.J. (1973). Evaluation of sampling techniques on tall-grass prairie, Journal of Range Management, 26, p.61-65.
Bonham, C. D. (1989). Measurements for Terrestrial Vegetation, NY: John Wiley and Sons, 338.
Castano-Meneses, G., García-Franco, J.G. and Palacios-Vargas, J.G. (2003). Spatial distribution patterns of Tillandsia violacea (Bromeliaceae) and support trees in an altitudinal gradient from a temperate forest in Central Mexico, Selbyana, 24 (1) p. 71-77.
Charan, J. and Biswas, T. (2013). How to calculate sample size for different study designs in medical research?. Indian Journal of Psychological Medicine, 35(2), p.121-126.
Chiang, Y.C., Li, D. and Jane, H.A. (2017). Wild or tended nature? The effects of landscape location and vegetation density on physiological and psychological responses, Landscape and Urban Planning, 167, p.72-83.
Chow, Shein-Chung, Jun Shao, Hansheng Wang, and Yuliya Lokhnygina (2017). Sample Size Calculations in Clinical Research, Chapman and Hall/CRC Biostatistics Series.
Clark, D.L. (2001). Stabilization of the mean as a demonstration of sample adequacy, Proceedings of America Society of Mining and Reclamation, p.65- 69.
Cochran, W.G. (1977). Sampling Techniques. John Wiley & Sons.
Cowling, R.M., Gallien, L., Richardson, D.M. and Ojeda, F. (2018). What predicts the richness of seeder and resprouter species in fire‐prone Cape fynbos: Rainfall reliability or vegetation density?, Austral Ecology, 43, 614- 622.
Cundill, B. and Alexander, N.D. (2015). Sample size calculations for skewed distributions, BMC Medical Research Methodology, 15(1), p.28 (9 pages).
Curran-Everett, D. (2018). Explorations in Statistics: the log transformation, Advances in Physiology Education, 42(2), p. 343-347.
Dacier, A., de Luna, A.G., Fernandez‐Duque, E. and Di Fiore, A. (2011). Estimating population density of Amazonian titi monkeys (Callicebus discolor) via playback point counts, Biotropica, 43(2), p.135-140.
Eng, J. (2003). Sample size estimation: how many individuals should be studied?. Radiology, 227(2), p. 309-313.
Fitzner, K. and Heckinger, E. (2010). Sample size calculation and power analysis: a quick review, The Diabetes Educator, 36(5), p.701-707.
Gardner, W., Mulvey, E.P. and Shaw, E.C. (1995). Regression analyses of counts and rates: Poisson, overdispersed Poisson and negative binomial models, Psychological Bulletin, 118(3), p.392.
Gregoire, T.G. and Affleck, D.L. (2018). Estimating desired sample size for simple random sampling of a skewed population, The American Statistician, 72(2), p.184-190.
Haukos, D.A., Sun, H.Z., Wester, D.B. and Smith, L.M. (1998). Sample size, power and analytical considerations for vertical structure data from profile boards in wetland vegetation, Wetlands, 18(2), p.203-215.
Heady, H.F. and Van Dyne, G.M. (1965). Prediction of weight composition from point samples on clipped herbage, Journal of Range Management, 18, p. 144-148.
Hinkley, D.V. (2014). Angular Transformation, Wiley Stats Ref: Statistics Reference Online.
Hoda, M.N. (2018). The square root transformation, International Journal oMathematical Archive EISSN 2229-5046, 9(4), p. 96-97.
Kelley, K. (2007). Sample size planning for the coefficient of variation from the accuracy in parameter estimation approach, Behavior Research Methods, 39(4), p. 755-766.
Kim, K.Y., Chung, H.C. and Rha, S.Y. (2009). A weighted sample size for microarray datasets that considers the variability of variance and multiplicity, Journal of Bioscience and Bioengineering, 108(3), p.252-258.
Kuno, E. (1968). Use of the regression of mean crowding on mean density for estimating sample size and the transformation of data for the analysis of variance, Researches on Population Ecology, 10(2), p.210-214.
Lang, T. (2004). Twenty statistical errors even you can find in biomedical research articles, Croatian Medical Journal, 45, p. 361-370
Larson, E.R., DiStefano, R.J., Magoulick, D.D. and Westhoff, J.T. (2008). Efficiency of a quadrat sampling technique for estimating riffle-dwelling crayfish density, North American Journal of Fisheries Management, 28(4), p. 1036-1043.
Lickfeldt, D.W., Voigt, T.B. and Hamblin, A.M., 2002. Cultivar composition and spatial patterns in Kentucky bluegrass blends. Crop science, 42(3), pp.842-847.
Ludwig, J.A., Quartet, L. and Reynolds, J.F. (1988). Statistical Ecology: A Primer in Methods and Computing (Vol. 1), John Wiley & Sons.
Lyon, L.J. (1968). An evaluation of density sampling methods in a shrub community, Journal of Range Management, 21, p.16-20.
Mahajan, M. and Fatima, S. (2017). Frequency, abundance, and density of plant species by list count quadrat method, International Journal of Multidisciplinary Research, 3(7), p.1-8.
Malone, C.R. (1968). Determination of peak standing crop biomass of herbaceous shoots by the harvest method, American Midland Naturalist, 79, p.429-435.
Malterud, K., Siersma, V.D. and Guassora, A.D. (2016). Sample size in qualitative interview studies: guided by information power, Qualitative Health Research, 26(13), p.1753-1760.
McIntyre, G.A. (1953). Estimation of plant density using line transects, Journal of Ecology, 41(2), p. 319-330.
Morisita, M. (1962). I σ-Index, a measure of dispersion of individuals, Researches on Population Ecology, 4(1), p.1-7.
Mousaei Sanjerehei, M, and Basiri, M. (2008). Comparison and evaluation of density measurement methods on Artemisia sieberi shrublands in Yazd province, Iranian Journal of Natural Resources, 61, p. 235–251.
Mousaei Sanjerehei, M. M. (2011). Determination of an appropriate quadrat size and shape for detecting association between plant species, Ecological modeling, 222(10), p. 1790-1792.
Mousaei Sanjerehei, M.M. and Bassiri, M. (2007). comparison and evaluation of indices of dispersion patterns of plants on artemisia sieberi shrublands in Yazd province, JWSS-Isfahan University of Technology, 11(40), p.483-495.
Mousaei Sanjerehei, M.M., Jafari, M., Mataji, A., Meybodi, N.B. and Bihamta, M.R. (2011). Facilitative and competitive interactions between plant species (an example from Nodushan rangelands, Iran). Flora-Morphology, Distribution, Functional Ecology of Plants, 206(7), p.631-637.
Olen, S.M., Bookhagen, B. and Strecker, M.R. (2016). Role of climate and vegetation density in modulating denudation rates in the Himalaya, Earth and Planetary Science Letters, 445, p.57-67.
Otypkova, Z. and Chytry, M. (2006). Effects of plot size on the ordination of vegetation samples, Journal of Vegetation Science, 17(4), p.465-472.
Papanastasis, V.P. (1977). Optimum size and shape of quadrat for sampling herbage weight in grasslands of northern Greece, Journal of Range Management, 30(6), p.446-448.
Picard, N., Nouvellet, Y. and Sylla, M.L. (2004). Relationship between plot size and the variance of the density estimator in West African savannas, Canadian Journal of Forest Research, 34(10), p.2018-2026.
Rasmussen, H. E., Ma, R., & Wang, J. J. (2019). Controlling type 1 error rate for sequential, bioequivalence studies with crossover designs, Pharmaceutical Statistics, 18(1), p. 96-105.
Roraas, T., Petersen, P.H. and Sandberg, S. (2012). CIs and power calculations for within-person biological variation: effect of analytical imprecision, number of replicates, number of samples and number of individuals, Clinical Chemistry, 58(9), p. 1306-1313.
Signorini, D. F. (1991). Sample size for Poisson regression, Biometrika, 78(2), p. 446-450.
Singh, H. P. (1986). A note on the estimation of variance of sample mean using the knowledge of coefficient of variation in normal population, Communications in Statistics-Theory and Methods, 15(12), p. 3737-3746.
Strong, C.W. (1966). An improved method of obtaining density from line‐transect data, Ecology, 47(2), p. 311-313.
Taranets, I. P., Smurov, A. V. and Kuznetsova, N. A. (2012). Different size sample representativeness in population accounting and estimation of horizontal distribution of soil-dwelling Collembola, Moscow University Biological Sciences Bulletin, 67(3-4), p. 126-129.
Van Belle, G. and Martin, D. C. (1993). Sample size as a function of coefficient of variation and ratio of means, The American Statistician, 47(3), p. 165-167.
Ward, M., Güntert, A., Campbell, J. and Pike, I. (2009). Proteomics for brain disorders—the promise for biomarkers, Annals of the New York Academy of Sciences, 1180(1), p. 68-74.
Zhu, H. and Lakkis, H. (2014). Sample size calculation for comparing two negative binomial rates, Statistics in Medicine, 33(3), p. 376-387.