Biostatistics Practice Questions

Q: If one wants to compare variability between two characteristics with different means, which measurement should be used?

Coefficient of variation. ### Explanation **1. Why Coefficient of Variation (CV) is correct:** The **Coefficient of Variation (CV)** is the most appropriate measure for comparing variability between two groups with different units or significantly different means. It is a **relative measure** of dispersion, calculated as: $$CV = \frac{\text{Standard Deviation (SD)}}{\text{Mean}} \times 100$$ By expressing the SD as a percentage of the mean, it "normalizes" the data, allowing for a fair comparison. For example, comparing the variability of birth weight (in kg) versus adult height (in cm) is only possible using CV. **2. Why the other options are incorrect:** * **Standard Deviation (SD):** This is an absolute measure of dispersion. It is expressed in the same units as the mean. It is useful for describing variability within a single distribution but cannot be used to compare two groups with different units or vastly different scales (e.g., comparing the weight of elephants vs. mice). * **Variance:** This is simply the square of the SD ($SD^2$). Like SD, it is an absolute measure and is sensitive to the scale of the data. * **Percentile:** This is a measure of **position**, not variability. It indicates the value below which a certain percentage of observations fall (e.g., the 50th percentile is the median). **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Unitless Measure:** CV is a pure number (unitless), making it ideal for comparing different parameters. * **Precision:** In laboratory medicine, a lower CV indicates higher **precision** of a diagnostic test. * **Standard Error (SE):** Do not confuse SD with SE. SD measures the scatter of observations around the mean, while SE measures the scatter of "sample means" around the "population mean" (used for calculating Confidence Intervals). * **Normal Distribution:** In a Gaussian curve, Mean = Median = Mode. SD is used here to define the 68-95-99.7 rule.

Q: Anemia is classified into mild, moderate, and severe on which scale?

Ordinal. ### Explanation **1. Why Ordinal is Correct:** The **Ordinal scale** is used for data that can be categorized into distinct groups with a **natural rank or order**, but where the exact mathematical difference between the categories is not defined. In this case, anemia is classified into Mild, Moderate, and Severe. There is a clear progression in severity (Severe > Moderate > Mild), but the "distance" between mild and moderate is not necessarily the same as the distance between moderate and severe. **2. Why the Other Options are Incorrect:** * **Nominal (B):** This scale is for naming or labeling categories without any inherent order (e.g., Blood groups A, B, AB, O; Gender; or Yes/No). Since anemia severity has a specific rank, it is not nominal. * **Interval (A):** This scale has a defined order and equal intervals between values, but **no absolute zero** (e.g., Temperature in Celsius). Anemia categories do not have equal mathematical intervals. * **Ratio (D):** This is the highest level of measurement. It has equal intervals and a **true zero point** (e.g., Height, Weight, or the actual Hemoglobin value in g/dL). While Hemoglobin concentration itself is a ratio scale, the *classification* into mild/moderate/severe is ordinal. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Mnemonic for Scales:** **NOIR** (Nominal < Ordinal < Interval < Ratio) in increasing order of statistical power. * **Qualitative Data:** Includes Nominal and Ordinal scales. * **Quantitative Data:** Includes Interval and Ratio scales. * **Key Distinction:** If you can say "A is more than B" but cannot say "by exactly how much," it is **Ordinal**. * **Common Ordinal Examples in Exams:** TNM Staging of cancer, APGAR score, Glasgow Coma Scale (GCS), and Likert scales (Strongly agree to Strongly disagree).

Q: The weights of 9 students in a class are as follows: 72, 74, 76, 72, 76, 78, 80, 73, and 72. What is the mode?

72. ### Explanation **1. Why the Correct Answer is Right:** In biostatistics, the **Mode** is defined as the value that occurs with the highest frequency in a given data set. It represents the most "popular" or common observation. To find the mode, we count the frequency of each value in the provided dataset: * **72:** Appears **3 times** (72, 72, 72) * **73:** Appears 1 time * **74:** Appears 1 time * **76:** Appears 2 times * **78:** Appears 1 time * **80:** Appears 1 time Since the value **72** occurs most frequently (3 times), it is the mode of this distribution. **2. Why the Other Options are Incorrect:** * **Option B (74):** This value occurs only once. It is close to the mean but does not meet the criteria for the mode. * **Option C (76):** This value occurs twice. While more frequent than others, it is still less frequent than 72. * **Option D (78):** This value occurs only once. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Measures of Central Tendency:** The three main measures are Mean (average), Median (middle value), and Mode (most frequent). * **Unimodal vs. Bimodal:** A dataset with one mode is unimodal. If two values tied for the highest frequency, it would be called **bimodal**. * **Effect of Outliers:** The Mode is the **least affected** by extreme values (outliers), whereas the Mean is the most affected. * **Qualitative Data:** The Mode is the only measure of central tendency that can be used for **nominal (qualitative) data** (e.g., most common blood group in a population). * **Relationship in Normal Distribution:** In a perfectly symmetrical bell-shaped curve, **Mean = Median = Mode**.

Q: Which of the following is true about a right-skewed distribution curve?

Predominance of higher values. ### Explanation In biostatistics, the "skewness" of a distribution refers to its asymmetry. A **Right-Skewed Distribution** (also known as a **Positively Skewed Distribution**) occurs when the tail of the curve extends toward the right side (higher values). **1. Why Option A is Correct:** In a right-skewed distribution, the majority of data points are clustered at the lower end of the scale, but there are a few **extreme outliers with very high values**. These outliers "pull" the mean toward the right. Therefore, the distribution is characterized by a **predominance of higher values** in the tail, which dictates the direction of the skew. **2. Why the other options are incorrect:** * **Option B:** While the "bulk" or mode of the data is at the lower end, the term "skewed" specifically describes the direction of the outliers/tail. A predominance of lower values in the tail would be a left-skew. * **Option C:** A right-skewed distribution has a **longer tail to the right**. A longer tail to the left indicates a negatively skewed distribution. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **The "Mean-Median-Mode" Rule:** In a right-skewed distribution, the relationship is always: **Mean > Median > Mode**. The Mean is most affected by outliers and is pulled furthest toward the tail. * **Clinical Example:** Income distribution or incubation periods of most infectious diseases (e.g., Salmonellosis) are typically right-skewed. * **Memory Aid:** The tail tells the tale. If the tail points to the **Right** (Positive side of the X-axis), it is **Right/Positively** skewed.

Q: Which of the following statements about a standard normal distribution curve is true?

Mean equals median.. ### Explanation **1. Why the Correct Answer is Right:** A **Standard Normal Distribution (Z-distribution)** is a specific type of normal distribution that is perfectly **symmetrical** and bell-shaped. In any perfectly symmetrical distribution, the central tendencies—**Mean, Median, and Mode—are all equal** and coincide at the peak of the curve (the center). This divides the area under the curve into two equal halves of 0.5 each. **2. Why the Incorrect Options are Wrong:** * **Option A:** In a normal distribution, the Mean is equal to the Median, not twice its value. A mean significantly different from the median indicates a "skewed" distribution. * **Option C:** In a *Standard* Normal Distribution, the **Median is 0**, while the **Variance is 1**. Therefore, they are not equal. * **Option D:** Variance is the square of the Standard Deviation ($\sigma^2$). For a Standard Normal Distribution, both the Standard Deviation ($\sigma$) and Variance ($\sigma^2$) are equal to **1**. Thus, the SD is not twice the variance. **3. High-Yield Clinical Pearls for NEET-PG:** * **Parameters of Standard Normal Distribution:** Mean ($\mu$) = 0 and Standard Deviation ($\sigma$) = 1. * **Total Area:** The total area under the curve is always **1 (or 100%)**. * **The 68-95-99.7 Rule (Empirical Rule):** * Mean ± 1 SD covers **68.2%** of values. * Mean ± 2 SD covers **95.4%** of values. * Mean ± 3 SD covers **99.7%** of values. * **Z-score:** This represents the number of standard deviations a data point is from the mean. It is calculated as: $Z = (X - \mu) / \sigma$.

Q: The rate adjusted to allow for the age distribution of the population is:

Age standardized mortality rate. ### Explanation **Correct Answer: D. Age standardized mortality rate** **Why it is correct:** Mortality is heavily influenced by the age structure of a population; for instance, a population with more elderly individuals will naturally have a higher number of deaths. To compare the health status of two different populations (e.g., India vs. Japan), we must eliminate the confounding effect of age. **Age standardization** (or adjustment) is a statistical technique that applies the age-specific death rates of a local population to a "Standard World Population." This ensures that any observed difference in mortality is due to actual health factors rather than differences in the age makeup. **Why the other options are incorrect:** * **A. Perinatal mortality rate:** This measures late fetal deaths (after 28 weeks) and early neonatal deaths (first week of life). It is a specific indicator of obstetric and pediatric care, not a measure adjusted for the general population's age distribution. * **B. Crude mortality rate (CDR):** This is the simplest measure of mortality (Total deaths / Mid-year population × 1000). It is "crude" precisely because it **does not** account for age or sex distribution, making it unsuitable for comparing different regions. * **C. Fertility rate:** This measures the reproductive performance of a population (births), not mortality. **High-Yield Clinical Pearls for NEET-PG:** * **Standardized Mortality Ratio (SMR):** Used in **Indirect Standardization**. Formula: (Observed Deaths / Expected Deaths) × 100. * **Direct Standardization:** Used when age-specific death rates of the population under study are known. * **Age** is the most important confounding factor in epidemiology when comparing disease or death rates across different geographic areas. * The **"Standard World Population"** (Segi’s or WHO) is the most common reference used for calculating these rates.

Q: Systolic blood pressure of a group of individuals follows a normal distribution curve with a mean of 120. What percentage of values are above the mean?

50%. ### Explanation **1. Why the Correct Answer is Right:** The fundamental property of a **Normal Distribution (Gaussian Distribution)** is that it is perfectly **symmetrical** and bell-shaped. In a normal distribution, the **Mean, Median, and Mode are all equal** and located at the center of the curve. Because the median represents the 50th percentile, exactly half (50%) of the observations lie below the mean and the other half (**50%**) lie above the mean. Regardless of the specific numerical value of the mean (in this case, 120 mmHg), the area under the curve to the right of the center always represents 50% of the total population. **2. Why the Incorrect Options are Wrong:** * **Option A (25%):** This represents the first quartile (Q1) or the area beyond 0.67 standard deviations from the mean, not the division at the mean itself. * **Option B (75%):** This would represent the area below the third quartile (Q3). * **Option D (100%):** This would encompass the entire population under the curve. By definition, a mean cannot have 100% of values above it unless the distribution is extremely skewed or degenerate. **3. NEET-PG High-Yield Clinical Pearls:** * **Symmetry:** In a Normal Distribution, Skewness = 0 and Kurtosis = 3. * **The 68-95-99.7 Rule (Empirical Rule):** * Mean ± 1 SD covers **68.2%** of values. * Mean ± 2 SD covers **95.4%** of values. * Mean ± 3 SD covers **99.7%** of values. * **Standard Normal Distribution:** A special case where the Mean = 0 and Standard Deviation = 1. * **Z-score:** Indicates how many standard deviations a value is from the mean. A value at the mean has a Z-score of 0.

Question 1

If one wants to compare variability between two characteristics with different means, which measurement should be used?

Accepted Answer

Coefficient of variation

Answer

Standard deviation

Answer

Percentile

Answer

Variance

Question 2

Anemia is classified into mild, moderate, and severe on which scale?

Accepted Answer

Ordinal

Answer

Interval

Answer

Nominal

Answer

Ratio

Question 3

The weights of 9 students in a class are as follows: 72, 74, 76, 72, 76, 78, 80, 73, and 72. What is the mode?

Accepted Answer

72

Answer

74

Answer

76

Answer

78

Question 4

Which of the following is true about a right-skewed distribution curve?

Accepted Answer

Predominance of higher values

Answer

Predominance of lower values

Answer

Longer tail to the left

Answer

All of the above

Question 5

Which of the following statements about a standard normal distribution curve is true?

Accepted Answer

Mean equals median.

Answer

Mean is twice the median.

Answer

Median equals variance.

Answer

Standard deviation equals twice the variance.

Question 6

In a population of 1 lakh, there are 4000 live births per annum and the under-5 population is 15000. If there are 280 infant deaths per annum, what is the under-5 mortality rate?

Accepted Answer

26.50%

Answer

40%

Answer

10%

Answer

69%

Question 7

The rate adjusted to allow for the age distribution of the population is:

Accepted Answer

Age standardized mortality rate

Answer

Perinatal mortality rate

Answer

Crude mortality rate

Answer

Fertility rate

Question 8

Which of the following is NOT true about Kaplan-Meier analysis?

Accepted Answer

It is used as a mortality indicator.

Answer

It is a type of survival analysis.

Answer

It is used to see how patient survival is increasing or benefiting with treatment.

Answer

It is a non-parametric statistic used to estimate the survival function from lifetime data.

Question 9

What does the value calculated as (True Positive) / (True Positive + False Positive) x 100 represent in a diagnostic test?

Accepted Answer

Sensitivity

Answer

Specificity

Answer

Positive Predictive Value (PPV)

Answer

Negative Predictive Value (NPV)

Question 10

Systolic blood pressure of a group of individuals follows a normal distribution curve with a mean of 120. What percentage of values are above the mean?

Accepted Answer

50%

Answer

25%

Answer

75%

Answer

100%

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?