Biostatistics Practice Questions

Q: What is a case-control study?

Retrospective study. ### Explanation **1. Why Option B is Correct:** A **Case-Control Study** is fundamentally a **retrospective study** because it starts with the effect (disease) and looks backward in time to identify the cause (exposure). In this design, researchers identify individuals with a specific condition (**Cases**) and compare them to individuals without the condition (**Controls**). By reviewing medical records or conducting interviews, they determine the frequency of exposure to a suspected risk factor in both groups. Because the direction of inquiry is "backward" from outcome to exposure, it is classified as retrospective. **2. Why Other Options are Incorrect:** * **Option A (Prospective study):** These studies (like Cohort studies) start with a group of exposed and non-exposed individuals and follow them forward in time to see who develops the disease. * **Option C (Cross-sectional study):** These are "snapshot" studies that measure exposure and outcome simultaneously at a single point in time. They cannot establish a temporal relationship (which came first). **3. NEET-PG High-Yield Clinical Pearls:** * **Measure of Association:** The **Odds Ratio (OR)** is the key statistic derived from Case-Control studies. (Remember: *C*ase-Control = *O*dds Ratio). * **Suitability:** This is the best study design for **rare diseases** (e.g., specific cancers) because you start with the cases already diagnosed. * **Bias:** These studies are highly prone to **Recall Bias** (patients with the disease are more likely to remember past exposures than healthy controls) and **Selection Bias**. * **Matching:** This technique is used in case-control studies to eliminate the effects of confounding variables.

Q: If the annual growth rate of a population is 1.5-2%, what number of years will be required to double the population?

35-47 years. ### Explanation The correct answer is **B. 35-47 years**. **1. Underlying Concept: The Rule of 70** In demography and biostatistics, the time required for a population to double is calculated using the **"Rule of 70."** This is a simplified formula derived from the natural logarithm of 2. The formula is: \[ \text{Doubling Time (T)} = \frac{70}{\text{Annual Growth Rate (r)}} \] **Calculation for the given range:** * **At 2% growth rate:** $ 70 / 2 = 35 $ years. * **At 1.5% growth rate:** $ 70 / 1.5 \approx 46.6 $ (rounded to 47) years. Therefore, at a growth rate of 1.5–2%, the population will double in approximately **35–47 years**. **2. Analysis of Incorrect Options** * **Option A (70-47 years):** This would correspond to a much lower growth rate of 1% to 1.5%. * **Option C (35-28 years):** This corresponds to a higher growth rate of 2% to 2.5% ($70/2.5 = 28$). * **Option D (28-23 years):** This corresponds to a very high growth rate of 2.5% to 3% ($70/3 \approx 23.3$). **3. Clinical Pearls & High-Yield Facts for NEET-PG** * **Demographic Gap:** The phase in the Demographic Cycle where the death rate falls while the birth rate remains high, leading to rapid population growth (Stage 2). * **Net Reproduction Rate (NRR):** The goal for population stabilization is an **NRR of 1**. This is achieved when the Total Fertility Rate (TFR) reaches **2.1** (Replacement level fertility). * **India’s Status:** India is currently in **Stage 3** of the demographic cycle (Late expanding), characterized by a falling birth rate and a low death rate. * **Vital Statistics:** Always remember that the "Rule of 70" is the standard for doubling time, though some textbooks occasionally use the "Rule of 69" for more precise continuous compounding. For NEET-PG, 70 is the gold standard.

Q: Observe the following curves. What will happen to Sensitivity and Specificity if the curve changes from Blue to Red?

Both Sensitivity and Specificity increase. ***Both Sensitivity and Specificity increase*** - When the **ROC curve** shifts from blue to red (higher **AUC**), the diagnostic test becomes inherently better at discriminating between disease and non-disease states. - This represents an **improvement in test performance** where both true positive rate (**sensitivity**) and true negative rate (**specificity**) increase simultaneously across all threshold values. *Both Sensitivity and Specificity decrease* - This would occur if the ROC curve moved closer to the **diagonal line of no discrimination** (AUC approaching 0.5). - A curve shift from blue to red represents **improved performance**, not deterioration of both metrics. *Sensitivity increases and Specificity decreases* - This describes moving the **cut-off point** along a single ROC curve toward the upper-right, trading specificity for sensitivity. - The question shows a **curve shift** (different test performance), not a threshold adjustment on the same curve. *Sensitivity decreases and Specificity increases* - This describes moving the **cut-off point** along a single ROC curve toward the lower-left, trading sensitivity for specificity. - Again, this represents **threshold adjustment** rather than the fundamental improvement in test discrimination shown by the curve shift.

Q: A study was conducted to analyze the degrees of freedom for a dataset. The data points for 'Material Location' were recorded as (X, Y) coordinates: Glass (8, 23), Cupboard (56, 3), and Metal (1, 14). What is the calculated degree of freedom for this dataset?

2. ### Explanation **1. Why the Correct Answer is Right:** In biostatistics, **Degrees of Freedom (df)** refers to the number of independent values or quantities which can be assigned to a statistical distribution. For a simple dataset consisting of a single sample of size '$n$', the formula is: **$df = n - 1$** In this study, the dataset consists of three distinct categories/locations: 1. Glass 2. Cupboard 3. Metal Here, $n = 3$. Therefore, $df = 3 - 1 = \mathbf{2}$. The $(X, Y)$ coordinates provided are the specific data values (observations) within those categories, but they do not change the number of independent categories being compared. Once two categories are determined, the third is fixed relative to the total, leaving only 2 "free" to vary. **2. Why Incorrect Options are Wrong:** * **Option A (1):** This would be the $df$ if there were only 2 categories (e.g., Case vs. Control). * **Option C (3):** This represents the total number of observations ($n$). It fails to subtract the one degree of freedom lost when estimating the sample mean. * **Option D (4):** This is mathematically incorrect for a sample size of 3. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Chi-Square Test ($r \times c$ table):** $df = (r - 1) \times (c - 1)$. This is a frequent NEET-PG calculation. * **Paired t-test:** $df = n - 1$ (where $n$ is the number of pairs). * **Unpaired t-test:** $df = n_1 + n_2 - 2$. * **Concept:** $df$ is essentially the "mathematical elbow room." It represents the number of observations minus the number of constraints (parameters being estimated).

Q: A standard normal distribution has which of the following characteristics?

A mean of 0 and a standard deviation of 1. ### Explanation In Biostatistics, a **Standard Normal Distribution** (also known as the **Z-distribution**) is a specific type of normal distribution used to standardize different sets of data for comparison. **1. Why Option B is Correct:** By definition, a standard normal distribution is a normal distribution that has been "standardized" such that the **Mean ($\mu$) is 0** and the **Standard Deviation ($\sigma$) is 1**. This allows any value ($x$) from a normal distribution to be converted into a **Z-score** using the formula: $Z = (x - \mu) / \sigma$. This transformation ensures that the center of the curve sits at zero and the spread is measured in units of 1. **2. Why the Other Options are Incorrect:** * **Option A:** While the standard deviation is 1, the mean must be 0. A mean of 1 would shift the entire bell curve to the right. * **Option C:** This is not a defining characteristic. In a standard normal distribution, the mean (0) is actually *smaller* than the standard deviation (1). * **Option D:** In a normal distribution, only approximately **68%** of scores fall within one standard deviation ($\pm 1\sigma$) of the mean, not all scores. **3. NEET-PG High-Yield Clinical Pearls:** * **Empirical Rule (68-95-99.7 Rule):** * $\pm 1\sigma$ covers **68.2%** of the area. * $\pm 2\sigma$ covers **95.4%** of the area. * $\pm 3\sigma$ covers **99.7%** of the area. * **Key Properties:** The curve is symmetrical, bell-shaped, and the **Mean = Median = Mode**. * **Z-score:** A Z-score of +1.96 corresponds to the 95% confidence interval boundary in a two-tailed test.

Q: In a study, 30 out of 50 smokers developed lung cancer, and 10 out of 50 non-smokers developed lung cancer. What is the odds ratio?

6. ### Explanation **1. Why the Correct Answer (C) is Right** The **Odds Ratio (OR)** is a measure of association used primarily in case-control studies to quantify the relationship between an exposure and an outcome. It is calculated as the ratio of the odds of exposure in cases to the odds of exposure in controls, or more simply, using a 2x2 contingency table: | | Disease (+) | Disease (-) | Total | | :--- | :---: | :---: | :---: | | **Exposed (Smokers)** | 30 (a) | 20 (b) | 50 | | **Non-exposed (Non-smokers)** | 10 (c) | 40 (d) | 50 | * **a (Exposed cases):** 30 * **b (Exposed non-cases):** 50 - 30 = 20 * **c (Non-exposed cases):** 10 * **d (Non-exposed non-cases):** 50 - 10 = 40 **Formula:** $OR = \frac{a \times d}{b \times c}$ **Calculation:** $OR = \frac{30 \times 40}{20 \times 10} = \frac{1200}{200} = \mathbf{6}$ An OR of 6 indicates that the odds of developing lung cancer are 6 times higher in smokers compared to non-smokers. **2. Why Other Options are Wrong** * **Option A (4):** This is a common distractor if a student incorrectly calculates the ratio of diseased individuals (30/10) or makes a calculation error. * **Option B (2.8):** This value is close to the **Relative Risk (RR)**. $RR = \frac{\text{Incidence in exposed}}{\text{Incidence in non-exposed}} = \frac{30/50}{10/50} = \frac{0.6}{0.2} = 3$. * **Option D (7):** Incorrect calculation; does not correspond to any standard epidemiological measure for this data. **3. NEET-PG Clinical Pearls** * **Odds Ratio** is the only measure of association that can be calculated in **Case-Control studies**. * **Relative Risk (RR)** is calculated in **Cohort studies**. * When a disease is rare, the OR is a good approximation of the RR. * **Attributable Risk (AR):** Indicates the amount of disease that can be prevented by removing the exposure. $AR = \frac{I_e - I_u}{I_e} \times 100$.

Q: Which of the following is true regarding case-control studies?

The odds ratio can be calculated.. In epidemiology, a **Case-Control Study** is an observational, analytical study used to identify the association between an exposure and an outcome. ### Why the Correct Answer is Right **Option B (Odds Ratio):** Since case-control studies start with people who already have the disease (cases), we cannot determine the actual risk of developing the disease. Instead, we calculate the **Odds Ratio (OR)**, which is the ratio of the odds of exposure among cases to the odds of exposure among controls. It serves as an estimate of the relative risk. ### Why Other Options are Wrong * **Option A:** This is partially true but technically described as **retrospective**. While it looks backward from effect (disease) to cause (exposure), the standard epidemiological phrasing is that it is "retrospective" in nature. However, Option B is the definitive statistical hallmark of this study design. * **Option C:** **Incidence cannot be calculated** in case-control studies because the denominator (population at risk) is unknown. Incidence can only be calculated in **Cohort Studies**. * **Option D:** Case-control studies are ideal for **rare diseases** and typically require a **smaller sample size** compared to cohort studies, making them inexpensive and quick to conduct. ### High-Yield NEET-PG Pearls * **Direction:** Backward (Effect $\rightarrow$ Cause). * **Measure of Association:** Odds Ratio ($ad/bc$). * **Best for:** Rare diseases or diseases with long latency periods (e.g., Cancer). * **Main Bias:** **Recall Bias** (cases remember past exposures more vividly than controls). * **Matching:** Done to eliminate the effects of **confounding variables**.

Question 1

Regarding the chi-square test, which of the following statements is true?

Accepted Answer

It measures the significance of the difference between two proportions.

Answer

The null hypothesis states that there is no difference.

Answer

It does not test for significance.

Answer

It tests for correlation and regression.

Question 2

What is a case-control study?

Accepted Answer

Retrospective study

Answer

Prospective study

Answer

Cross-sectional study

Answer

None of the above

Question 3

If the annual growth rate of a population is 1.5-2%, what number of years will be required to double the population?

Accepted Answer

35-47 years

Answer

70-47 years

Answer

35-28 years

Answer

28-23 years

Question 4

Observe the following curves. What will happen to Sensitivity and Specificity if the curve changes from Blue to Red?

Accepted Answer

Both Sensitivity and Specificity increase

Answer

Both Sensitivity and Specificity decrease

Answer

Sensitivity increases and Specificity decreases

Answer

Sensitivity decreases and Specificity increases

Question 5

A study was conducted to analyze the degrees of freedom for a dataset. The data points for 'Material Location' were recorded as (X, Y) coordinates: Glass (8, 23), Cupboard (56, 3), and Metal (1, 14). What is the calculated degree of freedom for this dataset?

Accepted Answer

2

Answer

1

Answer

3

Answer

4

Question 6

Which statistical test is used to compare Kaplan-Meier survival curves?

Accepted Answer

Log rank test

Answer

T-test

Answer

Chi-square test

Answer

Wilcoxon rank-sum test

Question 7

A standard normal distribution has which of the following characteristics?

Accepted Answer

A mean of 0 and a standard deviation of 1

Answer

A mean of 1 and a standard deviation of 1

Answer

A mean larger than its standard deviation

Answer

All scores within one standard deviation of the mean

Question 8

What is the best method to remove confounding?

Accepted Answer

Stratified randomization

Answer

Randomization

Answer

Restriction

Answer

Multivariate analysis

Question 9

In a study, 30 out of 50 smokers developed lung cancer, and 10 out of 50 non-smokers developed lung cancer. What is the odds ratio?

Accepted Answer

6

Answer

4

Answer

2.8

Answer

7

Question 10

Which of the following is true regarding case-control studies?

Accepted Answer

The odds ratio can be calculated.

Answer

It proceeds from effect to cause.

Answer

Incidence can be calculated.

Answer

It requires a large number of patients.

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?