Biostatistics Practice Questions

Q: What is mean deviation?

A measure of dispersion. **Explanation:** **Mean Deviation** is a statistical tool used to quantify the spread or variability of data points around a central value (usually the mean). In Biostatistics, it is defined as the arithmetic average of the absolute differences (ignoring plus or minus signs) between each observation and the mean. **Why Option A is correct:** In medical research, we need to know how much individual values (e.g., blood pressure readings in a population) vary from the average. **Measures of dispersion** describe this "scatter." Mean deviation, along with Range, Variance, and Standard Deviation, falls into this category because it measures the extent to which data points are dispersed around the center. **Why other options are incorrect:** * **B. A ratio:** A ratio expresses the relationship between two independent quantities (e.g., Maternal Mortality Ratio). Mean deviation is an absolute measure of spread, not a comparison of two distinct groups. * **C. A range:** Range is the simplest measure of dispersion, calculated as the difference between the maximum and minimum values. Mean deviation is more complex as it involves all observations in the dataset. * **D. An average:** While the *calculation* involves taking an average of deviations, the term "average" usually refers to measures of central tendency (Mean, Median, Mode). Mean deviation is a measure of *variability*, not the center. **High-Yield Clinical Pearls for NEET-PG:** * **Standard Deviation (SD):** The most commonly used measure of dispersion in medical literature. It is the square root of the variance. * **Coefficient of Variation (CV):** Used to compare the relative dispersion of two sets of data with different units (e.g., comparing height in cm vs. weight in kg). * **Normal Distribution:** In a normal curve, Mean ± 1 SD covers **68%** of values, Mean ± 2 SD covers **95%**, and Mean ± 3 SD covers **99.7%**.

Q: Which of the following is NOT a measure of central tendency?

Range. **Explanation:** In biostatistics, data is summarized using two primary types of descriptive statistics: **Measures of Central Tendency** and **Measures of Dispersion**. **Why "Range" is the correct answer:** The **Range** is a **Measure of Dispersion** (variability). It represents the simplest way to measure the spread of data by calculating the difference between the highest and lowest values in a distribution. It does not describe the "center" or "typical" value of a dataset, but rather how scattered the data points are. **Why the other options are incorrect:** * **Mean (Arithmetic Average):** The most common measure of central tendency. it is calculated by summing all observations and dividing by the total number. It is sensitive to extreme values (outliers). * **Median (Positional Average):** The middle-most value when data is arranged in ascending or descending order. It is the best measure of central tendency for skewed distributions as it is not affected by outliers. * **Mode (Nominal Average):** The value that occurs most frequently in a dataset. A distribution can be unimodal, bimodal, or multimodal. **NEET-PG High-Yield Pearls:** 1. **Relationship in Normal Distribution:** Mean = Median = Mode (Symmetrical Bell Curve). 2. **Skewed Distributions:** * **Positively Skewed:** Mean > Median > Mode (Tail to the right). * **Negatively Skewed:** Mode > Median > Mean (Tail to the left). 3. **Best Measure:** Median is preferred for skewed data (e.g., survival time, incubation periods); Mean is preferred for normally distributed data (e.g., height, BP). 4. **Other Measures of Dispersion:** Standard Deviation (most common), Variance, and Interquartile Range.

Q: When a scale is described as good, satisfactory, and poor, what type of measurement scale is being used?

Ordinal. ### **Explanation** **1. Why Ordinal is Correct:** The measurement scale used here is **Ordinal** because the data categories (Good, Satisfactory, Poor) follow a **natural order or rank**. In an ordinal scale, the relative position of the categories is known (e.g., Good is better than Satisfactory), but the exact numerical difference between the ranks is not defined. In clinical practice, this is commonly seen in grading the severity of a disease or the quality of a patient's recovery. **2. Why Other Options are Incorrect:** * **Nominal:** This scale is used for naming or labeling categories without any inherent order (e.g., Gender: Male/Female; Blood Groups: A, B, AB, O). Since "Good" is clearly superior to "Poor," it cannot be nominal. * **Interval:** This scale has a defined order and equal intervals between values, but **no true zero point** (e.g., Temperature in Celsius). We cannot say that the "distance" between Good and Satisfactory is mathematically equal to the distance between Satisfactory and Poor. * **Ratio:** This is the highest level of measurement. It has all the properties of an interval scale plus a **true absolute zero** (e.g., Height, Weight, Blood Pressure). Qualitative descriptors like "Good" cannot have a mathematical zero. --- ### **High-Yield Clinical Pearls for NEET-PG** * **Mnemonic for Scales (Lowest to Highest Complexity):** **NOIR** (**N**ominal < **O**rdinal < **I**nterval < **R**atio). * **Qualitative Data:** Includes Nominal and Ordinal scales. * **Quantitative Data:** Includes Interval and Ratio scales. * **Common Ordinal Examples in Exams:** * Cancer Staging (Stage I, II, III, IV) * Glasgow Coma Scale (GCS) score * Likert Scales (Strongly Agree to Strongly Disagree) * Pain Scales (Mild, Moderate, Severe) * **Statistical Note:** For Ordinal data, the **Median** is the most appropriate measure of central tendency.

Q: What is the median of the following set of values: 2, 5, 7, 10, 10, 13, 25?

10. **Explanation:** The **Median** is a measure of central tendency that represents the middle-most value in a data set when the observations are arranged in ascending or descending order. **Why Option A is Correct:** To calculate the median, follow these steps: 1. **Arrange the data:** The set is already in ascending order: 2, 5, 7, 10, 10, 13, 25. 2. **Count the observations (n):** Here, $n = 7$ (an odd number). 3. **Apply the formula:** For an odd number of observations, the Median is the $(\frac{n+1}{2})^{th}$ value. * Calculation: $(\frac{7+1}{2}) = 4^{th}$ value. 4. **Identify the value:** The 4th value in the sequence is **10**. **Why Incorrect Options are Wrong:** * **Option B (13):** This is the 6th value in the set. It would only be the median if the data set were much larger or differently distributed. * **Option C (25):** This is the maximum value (range limit), not the central value. * **Option D (5):** This is the 2nd value. Selecting this suggests a calculation error or failing to count to the center of the set. **High-Yield Clinical Pearls for NEET-PG:** * **Robustness:** Unlike the Mean, the Median is **not affected by extreme values (outliers)**. In this set, even if 25 were replaced by 250, the median would remain 10. * **Skewed Data:** The Median is the preferred measure of central tendency for **skewed distributions** (e.g., incubation periods, survival time, or income). * **Even Datasets:** If $n$ is even, the median is the average of the two middle-most values. * **Relationship:** In a perfectly symmetrical (Normal) distribution, **Mean = Median = Mode**.

Q: Which of the following is NOT a characteristic of a case-control study?

Incidence rates may be computed directly.. ### Explanation **Why the correct answer is right:** In a **Case-Control Study**, the investigator starts with the "effect" (disease) and looks backward to find the "cause" (exposure). Because the study begins with a fixed number of diseased individuals rather than a population at risk followed over time, there is no **denominator** to calculate the number of new cases occurring in a population. Therefore, **Incidence Rates cannot be computed directly**. Incidence can only be calculated in longitudinal studies like **Cohort Studies** or Randomized Controlled Trials (RCTs). **Analysis of incorrect options:** * **Option A:** This is the fundamental definition of a case-control study. It is a retrospective design where cases (diseased) are compared with controls (non-diseased) regarding past exposure. * **Option B:** Since the study relies on participants' memories or old records to determine exposure, it is highly susceptible to **Recall Bias**, making this a true characteristic. * **Option C:** Defining a "case" can be challenging. Researchers must establish strict diagnostic criteria (e.g., histological vs. clinical diagnosis) and decide whether to use incident (new) or prevalent (existing) cases to avoid selection bias. **High-Yield Clinical Pearls for NEET-PG:** * **Measure of Association:** The primary measure in a Case-Control study is the **Odds Ratio (OR)**, which is an estimation of Relative Risk. * **Direction:** It is a **Retrospective** study (Proceeds from Effect to Cause). * **Suitability:** It is the best design for studying **rare diseases** or diseases with long latency periods. * **Nesting:** A "Nested Case-Control Study" is one conducted within a large cohort study, which helps minimize selection and information bias.

Q: What is the odds ratio?

ad/bc. ### Explanation The **Odds Ratio (OR)** is a measure of association used primarily in **Case-Control studies** to quantify the relationship between an exposure and an outcome. It represents the ratio of the odds of exposure among the cases to the odds of exposure among the controls. To calculate the OR, we use a standard **2x2 Contingency Table**: | | Disease (+) (Cases) | Disease (-) (Controls) | | :--- | :---: | :---: | | **Exposed (+)** | **a** | **b** | | **Non-Exposed (-)** | **c** | **d** | 1. **Odds of exposure in cases:** $a / c$ 2. **Odds of exposure in controls:** $b / d$ 3. **Odds Ratio:** $(a/c) \div (b/d) = \mathbf{ad/bc}$ #### Analysis of Options: * **D (ad/bc): Correct.** This is the "cross-product ratio," derived by multiplying the diagonal cells (exposed cases × non-exposed controls) and dividing by the product of the other diagonal (exposed controls × non-exposed cases). * **A, B, and C:** These are incorrect mathematical arrangements of the 2x2 table cells that do not represent any standard epidemiological measure of association. #### NEET-PG High-Yield Pearls: * **Study Design:** OR is the hallmark of **Case-Control studies**. It is used because the incidence of disease cannot be calculated in these studies (as the researcher determines the number of cases). * **Interpretation:** * **OR > 1:** Positive association (Risk factor). * **OR = 1:** No association. * **OR < 1:** Negative association (Protective factor). * **Rare Disease Assumption:** When a disease is rare (incidence < 5%), the Odds Ratio becomes a good estimate of the **Relative Risk (RR)**. * **Cross-sectional studies:** OR can also be used here, but it is then termed the "Prevalence Odds Ratio."

Q: Which type of variable best describes the number of family members?

Discrete variable. **Explanation:** In Biostatistics, variables are classified based on the nature of the data they represent. The **number of family members** is a classic example of a **Discrete Variable**. **1. Why Discrete Variable is correct:** A discrete variable is a type of quantitative (numerical) variable that can only take on specific, whole-number values. These values are obtained by **counting**. Since you cannot have 4.5 or 5.2 family members—only 4, 5, or 6—the data exists in distinct, separate units with no possible values in between. **2. Why other options are incorrect:** * **Continuous Variable:** These are numerical variables that can take any value within a range, including decimals and fractions. They are typically obtained by **measuring**. Examples include height (165.5 cm), weight (70.2 kg), or blood pressure. * **Qualitative / Categorical Variable:** These describe a quality or attribute rather than a numerical quantity. They are expressed in words or categories. Examples include gender (Male/Female), blood group (A, B, AB, O), or socioeconomic status. While "number of family members" can be grouped into categories (e.g., Small vs. Large), the number itself is inherently quantitative. **Clinical Pearls for NEET-PG:** * **Memory Aid:** **D**iscrete = **D**isconnected (whole numbers); **C**ontinuous = **C**onnected (decimals possible). * **Scales of Measurement:** Discrete and Continuous variables fall under **Interval** or **Ratio** scales. * **High-Yield Example:** Number of hospital beds or number of cases of a disease are **Discrete**; Hemoglobin levels or Serum Creatinine are **Continuous**.

Q: Perinatal mortality is expressed in terms of?

Deaths/1000 live births. **Explanation:** The **Perinatal Mortality Rate (PMR)** is a key indicator of the quality of antenatal, natal, and postnatal care. It encompasses late fetal deaths (stillbirths) and early neonatal deaths. **1. Why Option B is Correct:** According to the **World Health Organization (WHO)** and the **National Health Mission (NHM)** in India, the Perinatal Mortality Rate is calculated as the number of perinatal deaths (late fetal deaths after 28 weeks of gestation + first-week neonatal deaths) per **1,000 live births**. While some older definitions used "total births" (live births + stillbirths) as the denominator, the standardized reporting for national health statistics in India (SRS) and many international bodies uses **1,000 live births** to ensure consistency with other mortality indicators like IMR and NMR. **2. Why Other Options are Incorrect:** * **Option A:** While "total births" is used in the *theoretical* definition of PMR to include stillbirths in the denominator, most standardized competitive exams and the SRS (Sample Registration System) in India prioritize **live births** as the denominator for ease of calculation and comparison. * **Options C & D:** These are mathematically incorrect. Mortality rates in maternal and child health are typically expressed per 1,000 (IMR, NMR, PMR) or per 100,000 (MMR). A denominator of 10,000 is not standard for these indicators. **High-Yield Clinical Pearls for NEET-PG:** * **Components of PMR:** Late Fetal Deaths (Stillbirths >28 weeks) + Early Neonatal Deaths (0-7 days of life). * **Best Indicator:** PMR is considered the best indicator of **obstetric care** and maternal health status. * **Denominator Rule:** * **IMR, NMR, PMR:** Per 1,000 live births. * **MMR (Maternal Mortality Ratio):** Per 100,000 live births. * **Stillbirth Definition:** Fetal death occurring after 28 weeks of gestation (weight >1000g).

Q: All of the following are non-parametric tests except?

Student t-test. **Explanation:** The core of this question lies in distinguishing between **Parametric** and **Non-parametric** statistical tests. **1. Why Student’s t-test is the correct answer:** The **Student’s t-test** is a **Parametric test**. Parametric tests are used when the data follows a **Normal (Gaussian) Distribution** and the variables are measured on an interval or ratio scale (quantitative data). The t-test specifically compares the means of two groups (e.g., comparing the mean hemoglobin levels between two groups of pregnant women). **2. Why the other options are incorrect (Non-parametric tests):** Non-parametric tests (Distribution-free tests) are used when the data is non-normal, skewed, or qualitative (nominal/ordinal). * **Chi-square test (Option A):** Used to compare proportions and test the association between two categorical variables (e.g., smoking status and lung cancer). * **Sign test (Option B):** A non-parametric alternative to the paired t-test, used to compare paired observations based on the direction of the difference. * **Fisher’s exact test (Option C):** Used instead of the Chi-square test for categorical data when the sample size is very small (expected frequency in any cell is <5). **High-Yield Clinical Pearls for NEET-PG:** * **Memory Aid:** If the test name contains "Mean" or "Standard Deviation," it is likely Parametric (e.g., Z-test, t-test, ANOVA). * **ANOVA (F-test):** Used to compare means of **three or more** groups. * **Wilcoxon Rank Sum / Mann-Whitney U test:** The non-parametric equivalent of the unpaired t-test. * **Kruskal-Wallis test:** The non-parametric equivalent of ANOVA. * **Correlation:** Pearson’s (Parametric) vs. Spearman’s (Non-parametric).

Question 1

What is mean deviation?

Accepted Answer

A measure of dispersion

Answer

A ratio

Answer

A range

Answer

An average

Question 2

Which of the following is NOT a measure of central tendency?

Accepted Answer

Range

Answer

Mean

Answer

Median

Answer

Mode

Question 3

When a scale is described as good, satisfactory, and poor, what type of measurement scale is being used?

Accepted Answer

Ordinal

Answer

Nominal

Answer

Interval

Answer

Ratio

Question 4

What is the median of the following set of values: 2, 5, 7, 10, 10, 13, 25?

Accepted Answer

10

Answer

13

Answer

25

Answer

5

Question 5

All of the following statements regarding frequency distributions are true, except:

Accepted Answer

A bimodal frequency distribution has two tails and one peak.

Answer

Frequency distributions are usually illustrated by histograms.

Answer

Frequencies are commonly illustrated by bar charts in qualitative data.

Answer

Line charts are useful when comparing two or more frequency distributions by drawing them on the same diagram.

Question 6

Which of the following is NOT a characteristic of a case-control study?

Accepted Answer

Incidence rates may be computed directly.

Answer

Cases with the disease are compared to controls without the disease.

Answer

Assessment of past exposure may be biased.

Answer

Definition of cases may be difficult.

Question 7

What is the odds ratio?

Accepted Answer

ad/bc

Answer

ab/cd

Answer

ac/bd

Answer

bd/ac

Question 8

Which type of variable best describes the number of family members?

Accepted Answer

Discrete variable

Answer

Qualitative variable

Answer

Continuous variable

Answer

Categorical variable

Question 9

Perinatal mortality is expressed in terms of?

Accepted Answer

Deaths/1000 live births

Answer

Deaths/1000 total births

Answer

Deaths/10000 total births

Answer

Deaths/10000 live births

Question 10

All of the following are non-parametric tests except?

Accepted Answer

Student t-test

Answer

Chi-square test

Answer

Sign test

Answer

Fisher exact test

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?