Biostatistics Practice Questions

Q: Which statement is TRUE about the standard normal distribution curve?

Standard Deviation = 1, Mean = 0. ### Explanation The **Standard Normal Distribution (SND)**, also known as the **Z-distribution**, is a specific type of normal distribution used in biostatistics to compare different sets of data by converting raw scores into standard scores (Z-scores). **Why Option B is Correct:** By definition, a Standard Normal Distribution is a normal distribution that has been "standardized" to have a **Mean ($\mu$) of 0** and a **Standard Deviation ($\sigma$) of 1**. This allows researchers to determine the probability of a value occurring within a certain number of standard deviations from the mean using a universal Z-table. **Analysis of Incorrect Options:** * **Option A:** This is mathematically incorrect. A standard deviation cannot be 0 in a distribution (as there would be no variation), and the mean must be 0 for standardization. * **Options C & D:** A normal distribution (and by extension, the SND) is **perfectly symmetrical** and bell-shaped. By definition, it has **zero skewness**. In a skewed distribution, the mean, median, and mode do not coincide; however, in an SND, Mean = Median = Mode = 0. **High-Yield Clinical Pearls for NEET-PG:** * **Z-score Formula:** $Z = (x - \mu) / \sigma$. It indicates how many standard deviations a value is from the mean. * **Area under the curve:** * Mean ± 1 SD: **68.2%** of values * Mean ± 2 SD: **95.4%** of values * Mean ± 3 SD: **99.7%** of values * **Total Area:** The total area under the curve is always **1 (or 100%)**. * **Point of Inflection:** In an SND, the curve changes from convex to concave at ±1 SD.

Q: If each value of a given group of observations is multiplied by 10, what is the standard deviation of the resulting observations?

Original standard deviation x 10. ### Explanation **1. Why Option A is Correct:** Standard Deviation (SD) is a measure of dispersion that quantifies the spread of data points around the mean. In biostatistics, the properties of SD regarding mathematical operations are high-yield: * **Multiplication/Division:** If every observation in a data set is multiplied or divided by a constant ($k$), the new standard deviation is the original standard deviation multiplied or divided by that same constant ($k$). * **Reasoning:** Since SD is expressed in the same units as the original data, scaling the data by 10 scales the spread (distance between points) by exactly 10. **2. Why Other Options are Incorrect:** * **Option B:** This would only occur if every observation were divided by 10. * **Option C:** Standard deviation is never affected by subtraction or addition in this manner. * **Option D:** The SD remains the same only if a constant is **added or subtracted** from every observation. This is because adding a constant shifts the entire distribution (changing the mean) but does not change the distance between the values (the spread). **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Change of Origin vs. Scale:** * SD is **independent** of change of origin (addition/subtraction). * SD is **dependent** on change of scale (multiplication/division). * **Variance:** If observations are multiplied by $k$, the **Variance** (which is $SD^2$) increases by $k^2$. In this question, the variance would increase by 100 ($10^2$). * **Coefficient of Variation (CV):** If every value is multiplied by a constant, the CV remains **unchanged** (because both the Mean and SD increase proportionately). * **Standard Error (SE):** SE is calculated as $SD / \sqrt{n}$. If SD increases 10-fold and sample size remains the same, the SE also increases 10-fold.

Q: The list of all units in a population is called:

Sampling Frame. ### Explanation **Correct Answer: B. Sampling Frame** In biostatistics, the **Sampling Frame** is the actual list or register of all the individual units (elements) from which a sample is drawn. It serves as the operational definition of the target population. For example, if a researcher wants to study the prevalence of hypertension in a specific village, the electoral roll or the village health register containing the names of all residents acts as the sampling frame. **Analysis of Incorrect Options:** * **A. Random Sampling:** This is a **technique** or method of selecting a sample where every unit has an equal and known chance of being selected. It is a process, not a list. * **C. Bias:** This refers to a **systematic error** in the design, conduct, or analysis of a study that results in a mistaken estimate of an exposure's effect on the risk of disease. * **D. Parameter:** This is a **numerical value** (like mean or proportion) that describes a characteristic of the entire population (e.g., the true mean blood pressure of all Indians). Values derived from a sample are called "Statistics." **High-Yield Clinical Pearls for NEET-PG:** * **Sampling Unit:** The individual entity chosen from the sampling frame (e.g., a person, a household, or a hospital bed). * **Sampling Fraction:** The ratio of the sample size ($n$) to the total population size ($N$). Formula: $n/N$. * **Probability vs. Non-Probability Sampling:** Random sampling (Simple, Stratified, Systematic, Cluster, Multi-stage) allows for the calculation of sampling error, whereas non-probability sampling (Quota, Convenience, Snowball) does not. * **Gold Standard:** Simple Random Sampling is the most basic probability sampling design where every unit has an equal probability of inclusion.

Q: According to the WHO recommended Expanded Programme on Immunization (EPI) cluster sampling method for assessing primary immunization coverage, what is the specified age group of children to be surveyed?

12-23 months. ### Explanation **1. Why 12-23 months is the Correct Answer:** The primary goal of the WHO EPI cluster sampling survey is to assess **primary immunization coverage**. According to the National Immunization Schedule, a child is considered "fully immunized" only after receiving all primary vaccines (BCG, 3 doses of DPT/Pentavalent, 3 doses of OPV, and Measles/MR) by the age of 12 months. Therefore, to evaluate if a child has successfully completed this cycle, the survey targets children who have just passed this milestone—the **12-23 month age group**. This ensures that the data reflects the most recent completion of the primary schedule. **2. Analysis of Incorrect Options:** * **0-12 months (Option A):** Children in this age group are still in the process of receiving their primary vaccines. Including them would lead to an underestimation of coverage, as many would not yet be eligible for the Measles/MR vaccine (given at 9-12 months). * **6-12 months (Option B) & 9-12 months (Option C):** These ranges are too narrow and exclude children who may have completed their schedule slightly late. They do not provide a statistically representative window for assessing "completed" status. **3. High-Yield Clinical Pearls for NEET-PG:** * **The 30 x 7 Design:** The EPI cluster survey traditionally uses **30 clusters**, with **7 children** sampled from each cluster (Total N = 210). * **Sampling Technique:** It utilizes **Two-Stage Stratified Cluster Sampling**. The first stage (selecting clusters) is based on **Probability Proportional to Size (PPS)**. * **Primary Objective:** It is designed to estimate immunization coverage with a precision of **+/- 10%** and a **95% confidence level**. * **Recent Update:** While the classic EPI method uses 30x7, modern WHO surveys (2018 onwards) often use larger sample sizes and more complex designs, but for NEET-PG, the **12-23 months** and **30x7** remain the gold standard facts.

Q: What is the denominator used in the calculation of the maternal mortality rate?

None of the above. ### Explanation The correct answer is **None of the above** because the denominator for the **Maternal Mortality Rate (MMR)** is **100,000 live births**. In biostatistics and public health, it is crucial to distinguish between a "Ratio" and a "Rate." Despite its name, the Maternal Mortality Rate is technically a **ratio** because the numerator (maternal deaths) is not a subset of the denominator (live births). #### Analysis of Options: * **A. 1,000 live births:** This is the multiplier used for the Infant Mortality Rate (IMR) and Neonatal Mortality Rate (NMR), not MMR. * **C. 1,000 total births:** Total births (live births + stillbirths) are used as the denominator for the **Perinatal Mortality Rate**. * **D. Mid-year population:** This is the denominator for the **Crude Death Rate** or **Maternal Mortality Ratio (per mid-year population)** in some older demographic contexts, but it is not the standard for MMR. #### High-Yield Clinical Pearls for NEET-PG: * **Definition of Maternal Death:** Death of a woman while pregnant or within **42 days** of delivery, irrespective of the duration and site of pregnancy, from any cause related to or aggravated by the pregnancy. * **MMR Formula:** (Number of maternal deaths / Total number of live births) × **100,000**. * **Maternal Mortality Ratio vs. Rate:** In some advanced texts, "Maternal Mortality Rate" uses the number of women of reproductive age (15–49 years) as the denominator, while "Maternal Mortality Ratio" uses live births. However, in the context of standard Indian health statistics (like SRS), the term "Rate" is often used interchangeably with the 100,000 live birth denominator. * **Current Trend:** Always remember the latest SRS (Sample Registration System) data for India's MMR for potential image-based or fact-based questions.

Q: What is the number of degrees of freedom in a 4x4 contingency table?

9. ### Explanation **1. Why the Correct Answer is Right** In biostatistics, the **Degrees of Freedom (df)** represents the number of values in a final calculation that are free to vary. For a contingency table used in a Chi-square test, the formula to calculate degrees of freedom is: **$df = (r - 1) \times (c - 1)$** * Where **$r$** = number of rows * Where **$c$** = number of columns For a **4x4 table**: $df = (4 - 1) \times (4 - 1)$ $df = 3 \times 3 = \mathbf{9}$ Conceptually, this means if you know the marginal totals (row and column sums) of a 4x4 table, you only need to know 9 cell values to determine the remaining 7 cells. **2. Why the Other Options are Wrong** * **Option A (4):** This is simply the number of rows or columns ($r$ or $c$), which does not account for the interaction between them. * **Option B (8):** This is often a result of adding $(r-1) + (c-1)$, which is $3 + 3 = 6$ (incorrectly calculated here as 8), or confusing the formula. * **Option D (16):** This is the total number of cells ($r \times c$). It ignores the fact that the row and column totals constrain the variability of the data. **3. Clinical Pearls & High-Yield Facts for NEET-PG** * **Chi-Square Test:** The most common application of this formula is the Chi-square test, used to compare **proportions** or test the **association between two categorical variables**. * **2x2 Table:** The most high-yield table in exams. Its $df$ is always **1** $[(2-1) \times (2-1)]$. * **Yates’ Correction:** Applied only to a 2x2 contingency table when the expected frequency in any cell is **< 5**. * **Standard Normal Curve:** The $df$ for a t-test is **$n - 1$** (for a single sample) or **$(n1 + n2) - 2$** (for two independent samples).

Q: Rejecting the null hypothesis when it is actually true is known as:

Type I error. ### Explanation In biostatistics, hypothesis testing involves making a decision about a population based on sample data. The **Null Hypothesis ($H_0$)** typically states that there is no difference or association between variables. **Why Type I Error is Correct:** A **Type I error** occurs when we **reject the null hypothesis when it is actually true**. In clinical terms, this is a "False Positive" result—concluding that a treatment works or a difference exists when, in reality, it does not. The probability of committing a Type I error is denoted by **$\alpha$ (alpha)**, which is usually set at 0.05 (5%) in medical research. **Analysis of Incorrect Options:** * **Type II error ($\beta$):** This occurs when we **fail to reject a null hypothesis that is actually false**. This is a "False Negative"—concluding there is no effect when one actually exists. * **Power ($1-\beta$):** This is the probability of correctly rejecting a false null hypothesis (detecting a difference that truly exists). It represents the study's ability to avoid a Type II error. * **Specificity:** While related to diagnostic testing, in the context of hypothesis testing, the probability of correctly failing to reject a true null hypothesis ($1-\alpha$) is analogous to specificity (correctly identifying those without the disease). **NEET-PG High-Yield Pearls:** * **$\alpha$ (Alpha):** Maximum tolerable probability of Type I error (Level of significance). * **$\beta$ (Beta):** Probability of Type II error. * **Confidence Level ($1-\alpha$):** Probability of correctly accepting a true null hypothesis. * **Power ($1-\beta$):** Ideally should be $\geq 80\%$. It is increased by increasing the sample size. * **Memory Aid:** Type **I** is **I**ncorrectly rejecting; Type **II** is **I**ncorrectly accepting (failing to reject).

Question 1

Which of the following are non-random sampling methods?

Accepted Answer

Cluster Sampling

Answer

Quota sampling

Answer

Stratified random sampling

Answer

Convenience Sampling

Question 2

In a community of 5000 people, the crude birth rate is 30 per 1000 people. What is the number of pregnant females?

Accepted Answer

150

Answer

65

Answer

175

Answer

200

Question 3

Which statement is TRUE about the standard normal distribution curve?

Accepted Answer

Standard Deviation = 1, Mean = 0

Answer

Mean = -1, Standard Deviation = 0

Answer

Curve skews towards the left

Answer

Curve skews towards the right

Question 4

If each value of a given group of observations is multiplied by 10, what is the standard deviation of the resulting observations?

Accepted Answer

Original standard deviation x 10

Answer

Original standard deviation / 10

Answer

Original standard deviation - 10

Answer

Original standard deviation itself

Question 5

The list of all units in a population is called:

Accepted Answer

Sampling Frame

Answer

Random sampling

Answer

Bias

Answer

Parameter

Question 6

According to the WHO recommended Expanded Programme on Immunization (EPI) cluster sampling method for assessing primary immunization coverage, what is the specified age group of children to be surveyed?

Accepted Answer

12-23 months

Answer

0-12 months

Answer

6-12 months

Answer

9-12 months

Question 7

What is the denominator used in the calculation of the maternal mortality rate?

Accepted Answer

None of the above

Answer

1,000 live births

Answer

1,000 total births

Answer

Mid-year population

Question 8

Which of the following is NOT true about cluster sampling?

Accepted Answer

The sample size is the same as that of simple random sampling.

Answer

It is a two-stage sampling method.

Answer

It is cheaper than other methods of sampling.

Answer

It has the disadvantage of higher sampling error.

Question 9

What is the number of degrees of freedom in a 4x4 contingency table?

Accepted Answer

9

Answer

4

Answer

8

Answer

16

Question 10

Rejecting the null hypothesis when it is actually true is known as:

Accepted Answer

Type I error

Answer

Type II error

Answer

Power

Answer

Specificity

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?