Question 1

How do I determine the right sample size for my study?

Accepted Answer

Sample size depends on four key factors: (1) Confidence level - typically 95% for biological research, meaning you want to be 95% confident your results reflect the true population. (2) Margin of error - the acceptable range of uncertainty, usually 3-5% for surveys. (3) Population proportion - if unknown, use 50% as it gives the maximum (most conservative) sample size. (4) Population size - for small populations, a finite population correction reduces the required sample. For clinical trials, you also need to consider effect size, power, and expected dropout rate. Start with the statistical requirements and then add a buffer of 10-20% for non-response or data quality issues.

Question 2

What is margin of error and how does it affect sample size?

Accepted Answer

Margin of error (also called confidence interval width) defines how close your sample estimate will be to the true population value. A margin of error of plus or minus 3% means if your sample shows 60%, the true value is likely between 57% and 63%. Reducing margin of error dramatically increases required sample size: going from 5% to 3% nearly triples the sample size, and going from 5% to 1% increases it 25-fold. In biological research, acceptable margins depend on the precision needed. Drug efficacy studies may need plus or minus 2%, while ecological surveys may accept plus or minus 10%.

Question 3

Why does using p=50% give the most conservative sample size?

Accepted Answer

The formula includes p*(1-p), which is maximized when p=0.50 (giving 0.25). If you know the true proportion is near 10% or 90%, p*(1-p)=0.09, requiring much fewer samples. Using p=50% guarantees your sample is large enough regardless of the actual proportion. However, if you have strong prior evidence about the proportion (from pilot studies or previous research), using a more realistic estimate can significantly reduce your required sample size and save resources. For biostatistics studies where the outcome prevalence is known to be rare (e.g., 5%), using p=0.05 can reduce sample requirements by 75%.

Question 4

How should I account for non-response or dropout in my sample size?

Accepted Answer

Always inflate your calculated sample size to account for anticipated non-response or dropout. The adjusted size is n_adjusted = n / response_rate. Common response rates: mailed surveys 30-50%, online surveys 10-30%, clinical trials 70-90%, in-person interviews 60-80%. For a 12-month clinical trial expecting 20% dropout, multiply your sample by 1.25 (divide by 0.80). For multi-year longitudinal studies, compound the dropout rate: if 10% drop out each year over 3 years, retention is 0.9^3 = 72.9%. Always report both your target and achieved sample sizes in publications.

Sample Size Calculator

Formula

Worked Examples

Example 1: Survey of Hospital Patients

Example 2: Ecological Bird Population Study

Frequently Asked Questions

How do I determine the right sample size for my study?

What is margin of error and how does it affect sample size?

Why does using p=50% give the most conservative sample size?

How should I account for non-response or dropout in my sample size?

References