Biostatistics Practice Questions

Q: In a population with a standard deviation of 2, what is the confidence limit?

95.40%. ### Explanation This question tests the fundamental understanding of the **Normal Distribution (Gaussian Curve)** and its properties regarding standard deviation (SD) and confidence limits. #### Why 95.40% is Correct In a normal distribution, the area under the curve represents the probability or percentage of observations. The relationship between the mean and standard deviation follows the **Empirical Rule**: * **Mean ± 1 SD** covers approximately **68.3%** of the data. * **Mean ± 2 SD** covers approximately **95.4%** of the data. * **Mean ± 3 SD** covers approximately **99.7%** of the data. Since the question asks for the confidence limit associated with a standard deviation of **2**, the corresponding value is **95.40%**. #### Analysis of Incorrect Options * **A. 68.30%:** This represents the confidence limit for **1 SD**. * **C. 99.70%:** This represents the confidence limit for **3 SD**. * **D. 76.20%:** This is a distractor and does not correspond to standard integer SD values in a normal distribution. #### NEET-PG High-Yield Pearls 1. **Z-score:** The number of standard deviations a point is from the mean is called the Z-score. For this question, Z = 2. 2. **95% vs. 95.4%:** In medical research, we often use **1.96 SD** to define the **95% Confidence Interval**. However, for exactly **2 SD**, the value is **95.4%**. 3. **Normal Distribution Characteristics:** It is bell-shaped, symmetrical, and the Mean, Median, and Mode all coincide at the center. 4. **Standard Normal Distribution:** A specific normal distribution where the Mean = 0 and SD = 1.

Q: A study investigated the role of herbal tea consumption in the prevention of the common cold. Data was collected on individuals who developed a cold and those who did not, categorized by their herbal tea consumption. The results are presented in the following table: | | Had cold | Did not have a cold | |----------------|----------|---------------------| | Herbal tea consumed | 12 | 34 | | Herbal tea not consumed | 23 | 38 | Which of the following analytical tests is appropriate for analyzing this data?

Chi square test. ### Explanation **1. Why Chi-square Test is Correct:** The Chi-square ($\chi^2$) test is the statistical test of choice when comparing **proportions** or determining the association between two **categorical (qualitative) variables**. In this study, both the exposure (Herbal tea: Yes/No) and the outcome (Cold: Yes/No) are nominal/categorical data. The data is presented in a **2x2 contingency table**. Since we are comparing the frequency of cold occurrence between two independent groups to see if a significant association exists, the Chi-square test is the most appropriate tool. **2. Why Other Options are Incorrect:** * **Z-test (Option A):** While a Z-test can compare two proportions, it is typically reserved for very large sample sizes (usually $n > 30$ per group). In clinical research involving contingency tables, Chi-square is the standard. * **Student’s t-test (Unpaired) (Option D):** This test is used to compare the **means** of continuous (quantitative) data between two independent groups (e.g., comparing mean blood pressure between males and females). It cannot be used for categorical frequencies. * **Student’s t-test (Paired) (Option C):** This is used to compare the **means** of two sets of observations from the same group (e.g., "before and after" treatment measurements). **3. NEET-PG High-Yield Pearls:** * **Qualitative + Qualitative data:** Use Chi-square test (or Fischer’s Exact test if any cell value is 2 groups):** Use ANOVA (Analysis of Variance). * **Quantitative + Quantitative:** Use Correlation/Regression. * **Non-parametric alternative to Unpaired t-test:** Mann-Whitney U test.

Q: A drug trial shows 60% improvement in the test group compared to 40% improvement in a standard group. What is the most appropriate statistical test to determine the significance of this result?

Chi-square test. **Explanation** The core of this question lies in identifying the **type of data** being analyzed. In this drug trial, the outcome is "improvement," which is expressed as a percentage (60% vs. 40%). This represents **Qualitative (Categorical) Data**, where patients are classified into two categories: "Improved" or "Not Improved." **1. Why Chi-square test is correct:** The Chi-square ($\chi^2$) test is the standard non-parametric test used to compare the **proportions or frequencies** of two or more independent groups. Since we are comparing the proportion of improvement in the test group versus the standard group, the Chi-square test is the most appropriate tool to determine if the observed difference is statistically significant or due to chance. **2. Why other options are incorrect:** * **Student’s T-test:** Used to compare the **means** of two independent groups (e.g., comparing mean blood pressure levels). It requires Quantitative (Numerical) data. * **Paired T-test:** Used for Quantitative data in **dependent** samples, such as "before and after" studies on the same group of individuals. * **Test for variance (F-test):** Used to compare the spread or distribution of data between two groups, rather than comparing their proportions or means. **Clinical Pearls for NEET-PG:** * **Qualitative Data (Proportions/Percentages) →** Use Chi-square test or Z-test for proportions. * **Quantitative Data (Means) →** Use T-test (for 2 groups) or ANOVA (for >2 groups). * **Small Samples:** If any cell frequency in a 2x2 table is less than 5, use **Fisher’s Exact Test** instead of Chi-square. * **Memory Aid:** **C**hi-square is for **C**ategorical data.

Q: Which statistical test is used to determine the association between two variables?

Correlation. **Explanation:** In biostatistics, the primary goal of **Correlation** is to determine the strength and direction of a linear relationship (association) between two continuous variables. It is quantified by the correlation coefficient (r), which ranges from -1 to +1. A value near zero indicates no association, while values near ±1 indicate a strong association. **Why the other options are incorrect:** * **Chi-squared test ($\chi^2$):** This test is used to compare proportions or to determine the association between two **categorical** (qualitative) variables (e.g., smoking status and presence of lung cancer). While it tests for association, "Correlation" is the more specific term used for the general degree of relationship between variables in this context. * **Regression:** While related to correlation, regression is used to **predict** the value of a dependent variable based on the value of an independent variable. It establishes a functional relationship (cause-effect) rather than just a simple association. * **None of the above:** Incorrect, as Correlation is the standard statistical tool for measuring association. **High-Yield Clinical Pearls for NEET-PG:** * **Correlation (r):** Measures *association*. It does not imply causation. * **Coefficient of Determination ($r^2$):** Represents the proportion of variance in one variable that is predictable from the other. * **Scatter Diagram:** The best visual method to represent the correlation between two numerical variables. * **P-value:** Used to determine if the observed association is statistically significant or due to chance.

Q: The mean of the plasma volumes of 25 patients is 12.5 litres. The standard deviation is 0.25. Calculate the standard error.

0.05. ### **Explanation** The core concept tested here is the **Standard Error of Mean (SEM)**, which measures the dispersion of sample means around the population mean. It indicates how much the mean of a single sample is likely to vary from the true population mean. **1. Why Option A is Correct:** The formula for Standard Error (SE) is: $$\text{SE} = \frac{\text{Standard Deviation (SD)}}{\sqrt{n}}$$ Where: * **SD** = 0.25 * **n (Sample size)** = 25 * **$\sqrt{n}$** = $\sqrt{25}$ = 5 **Calculation:** $$\text{SE} = \frac{0.25}{5} = \mathbf{0.05}$$ Thus, the standard error is 0.05. Note that the "Mean" (12.5) provided in the question is a distractor and is not required for this specific calculation. **2. Why Other Options are Incorrect:** * **Option B (0.5):** This is the result if you divide the SD by $\sqrt{n}$ but misplace the decimal, or if you mistakenly use $n=0.25$ in the denominator. * **Option C (0.01):** This result occurs if you divide the SD by $n$ (0.25/25) instead of the square root of $n$. * **Option D (0.1):** This is a calculation error often made by incorrectly estimating the square root or decimal division. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **SD vs. SE:** Standard Deviation describes the **variability within a single sample**, while Standard Error describes the **uncertainty of the sample mean** compared to the population. * **Sample Size Impact:** As the sample size ($n$) increases, the Standard Error decreases. This means larger studies provide more precise estimates of the population mean. * **Confidence Intervals (CI):** SE is used to calculate CI. For a 95% CI, the formula is: $\text{Mean} \pm (1.96 \times \text{SE})$. * **Standard Error of Proportion:** If the data is in percentages/proportions, the formula changes to $\sqrt{pq/n}$.

Q: When observations are made before and after the exposure to a factor, what type of statistical test is it?

Paired T-Test. ### Explanation The correct answer is **Paired T-Test**. **1. Why Paired T-Test is Correct:** The Paired T-test (also known as the dependent t-test) is used to compare the means of two related groups. In medical research, this typically involves a **"before-and-after"** scenario where the same set of individuals is measured twice (e.g., blood pressure before and after starting an antihypertensive drug). Because the observations are made on the same subjects, the data points are "paired," and the test evaluates whether the mean difference between these pairs is statistically significant. **2. Why the Other Options are Incorrect:** * **Chi-square test:** This is a non-parametric test used for **qualitative (categorical) data** to compare proportions or associations (e.g., comparing the number of smokers vs. non-smokers in two groups). * **Unpaired T-Test (Independent T-test):** This is used to compare the means of **two independent groups** (e.g., comparing the hemoglobin levels of Group A vs. Group B). * **ANOVA (Analysis of Variance):** This is used when comparing the means of **three or more independent groups**. **3. High-Yield Clinical Pearls for NEET-PG:** * **Data Type:** T-tests and ANOVA are used for **Quantitative (Numerical)** data that follows a normal distribution. * **Sample Size:** T-tests are generally preferred for small sample sizes ($n 30$). * **Memory Aid:** * **P**aired = **P**re and **P**ost (same person). * **U**npaired = **U**nrelated groups. * **Non-parametric alternative:** If the "before-and-after" data is not normally distributed, the **Wilcoxon Signed-Rank Test** is used instead of the Paired T-test.

Q: What is the most common measure of dispersion used in biostatistics?

Standard deviation. **Explanation:** **Why Standard Deviation (SD) is the Correct Answer:** In biostatistics, the **Standard Deviation** is the most frequently used measure of dispersion because it summarizes how much individual observations vary around the arithmetic mean. Unlike variance, SD is expressed in the **same units** as the original data, making it clinically intuitive. It is the essential component for calculating the Standard Error and is fundamental to the "Normal Distribution" curve, where approximately 95% of values fall within Mean ± 2 SD. **Analysis of Incorrect Options:** * **A. Mean:** This is a measure of **Central Tendency**, not dispersion. It represents the average value but tells us nothing about how spread out the data points are. * **B. Range:** While simple to calculate (Maximum value – Minimum value), it is the most unstable measure of dispersion because it only considers two extreme values and ignores the rest of the dataset. * **C. Variance:** Variance is the square of the Standard Deviation. While mathematically important in ANOVA tests, it is less commonly used in descriptive clinical reports because its units are squared (e.g., $mg^2/dl^2$), making it difficult to interpret clinically. **High-Yield Clinical Pearls for NEET-PG:** * **Most sensitive measure of dispersion:** Standard Deviation (uses all observations). * **Best measure of dispersion for skewed data:** Interquartile Range (IQR). * **Coefficient of Variation:** Used to compare the relative dispersion between two series with different units (e.g., comparing height in cm vs. weight in kg). * **Standard Error (SE):** Measures the variation of the sample mean from the true population mean ($SE = SD / \sqrt{n}$).

Question 1

In a population with a standard deviation of 2, what is the confidence limit?

Accepted Answer

95.40%

Answer

68.30%

Answer

99.70%

Answer

76.20%

Question 2

Reliability means:

Accepted Answer

Number of times the same results are obtained on repeated trials

Answer

Number of times variation is seen

Answer

Extent of accuracy

Answer

Level of simplicity

Question 3

A pharmaceutical company develops a new antihypertensive drug. Samples of 24 hypertensive patients, randomly selected from a large population of hypertensive people, are randomly divided into 2 groups of 12. One group is given the new drug over a period of 1 month; the other group is given a placebo according to the same schedule. Neither the patients nor the treating physicians are aware of which patients are in which group. At the end of the month, measurements are made of the patient’s blood pressures. What type of study is this?

Accepted Answer

A randomized controlled clinical trial

Answer

A crossover design

Answer

A single-blind experiment

Answer

A prospective study

Question 4

A study investigated the role of herbal tea consumption in the prevention of the common cold. Data was collected on individuals who developed a cold and those who did not, categorized by their herbal tea consumption. The results are presented in the following table:

|                | Had cold | Did not have a cold |
|----------------|----------|---------------------|
| Herbal tea consumed | 12       | 34                  |
| Herbal tea not consumed | 23       | 38                  |

Which of the following analytical tests is appropriate for analyzing this data?

Accepted Answer

Chi square test

Answer

Z test

Answer

Student's t-test (paired)

Answer

Student's t-test (unpaired)

Question 5

The positive predictive value of a diagnostic test is not dependent upon which of the following factors?

Accepted Answer

Incidence of the disease

Answer

Specificity of the test

Answer

Prevalence of the disease

Answer

Sensitivity of the test

Question 6

A drug trial shows 60% improvement in the test group compared to 40% improvement in a standard group. What is the most appropriate statistical test to determine the significance of this result?

Accepted Answer

Chi-square test

Answer

Student's T-test

Answer

Paired T-test

Answer

Test for variance

Question 7

Which statistical test is used to determine the association between two variables?

Accepted Answer

Correlation

Answer

Chi-squared test (X2)

Answer

Regression

Answer

None of the above

Question 8

The mean of the plasma volumes of 25 patients is 12.5 litres. The standard deviation is 0.25. Calculate the standard error.

Accepted Answer

0.05

Answer

0.5

Answer

0.01

Answer

0.1

Question 9

When observations are made before and after the exposure to a factor, what type of statistical test is it?

Accepted Answer

Paired T-Test

Answer

Chi-square test

Answer

Unpaired T-Test

Answer

ANOVA test

Question 10

What is the most common measure of dispersion used in biostatistics?

Accepted Answer

Standard deviation

Answer

Mean

Answer

Range

Answer

Variance

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?