Biostatistics Practice Questions

Q: All of the following are tests for the degree of closeness of a measured or calculated quantity to its actual or true value, except:

Range chart. In biostatistics and quality control, it is crucial to distinguish between **Accuracy** and **Precision**. **1. Why "Range Chart" is the correct answer:** The question asks for tests that measure the closeness to the "true value," which is the definition of **Accuracy**. A **Range Chart (R-chart)** measures the difference between the highest and lowest values in a sample. It is a measure of **dispersion or variability**, which relates to **Precision** (the consistency of results), not accuracy. It does not indicate how close the results are to the actual target value. **2. Explanation of incorrect options:** * **Mean Chart ($\bar{X}$ chart):** This tracks the average of samples over time. Since the mean is compared against the target/true value, it is a primary tool for monitoring **Accuracy**. * **Levey-Jennings (LJ) Chart:** Widely used in clinical laboratories, this chart plots control data against established mean and standard deviation limits. It is used to detect both systematic errors (loss of accuracy) and random errors. * **Shewhart Control Chart:** This is the umbrella term for quality control charts (including Mean and Range charts). In medical statistics, they are used to ensure a process remains within "statistical control" relative to its true intended value. **Clinical Pearls for NEET-PG:** * **Accuracy:** Closeness to the "True Value." (Measured by Mean, LJ charts). * **Precision:** Closeness of repeated measurements to "Each Other." (Measured by Range, Standard Deviation, Coefficient of Variation). * **Systematic Error:** Affects Accuracy (Shift in Mean). * **Random Error:** Affects Precision (Increased Range/SD). * **Westgard Rules:** These are the specific criteria used to interpret LJ charts to decide if an analytical run is in control.

Q: What value of the correlation coefficient signifies a strong correlation?

1. **Explanation:** The **Correlation Coefficient (Pearson’s ‘r’)** is a statistical measure used to quantify the strength and direction of a linear relationship between two continuous variables (e.g., the relationship between BMI and Blood Pressure). **1. Why Option B is Correct:** The value of 'r' ranges strictly from **-1 to +1**. * A value of **1** (or -1) signifies a **perfect/strongest possible correlation**, where all data points lie exactly on a straight line. * As the value approaches 1, the strength of the relationship increases. In medical research, values >0.7 are generally considered "strong." **2. Why Other Options are Incorrect:** * **Option A (Zero):** A correlation coefficient of 0 indicates **no linear relationship** between the variables. * **Option C (Less than 1):** While values like 0.8 or 0.9 are strong, they are mathematically "weaker" than 1. Additionally, this range includes values near zero (e.g., 0.1), which represent very weak correlations. * **Option D (More than 1):** This is statistically **impossible**. The correlation coefficient cannot exceed +1 or be less than -1. **Clinical Pearls & High-Yield Facts for NEET-PG:** * **Directionality:** A positive sign (+) means both variables move in the same direction; a negative sign (-) means they move in opposite directions (e.g., Exercise vs. Resting Heart Rate). * **Coefficient of Determination ($r^2$):** This represents the proportion of variance in one variable explained by the other. If $r = 0.7$, then $r^2 = 0.49$ (49% of the variance is explained). * **Scatter Diagram:** This is the visual method used to represent correlation. A straight line at 45° indicates $r = 1$. * **Limitation:** Correlation does **not** imply causation.

Q: The response which is graded by an observer on an agree or disagree continuum is based on which type of scale?

Likert Scale. ### Explanation **1. Why Likert Scale is Correct:** The **Likert Scale** is a psychometric scale commonly used in health research to measure attitudes, beliefs, or opinions. It typically presents a statement and asks the respondent to choose from a continuum of fixed responses, most commonly a 5-point or 7-point scale (e.g., *Strongly Disagree, Disagree, Neutral, Agree, Strongly Agree*). It is the gold standard for assessing responses on an "agree-disagree" continuum. **2. Why Other Options are Incorrect:** * **Visual Analog Scale (VAS):** This is a continuous scale (usually a 10 cm line) where the patient marks a point representing their state (e.g., pain intensity from "no pain" to "worst pain"). It does not use discrete "agree/disagree" categories. * **Guttman Scale (Cumulative Scale):** This scale consists of a series of statements arranged in a hierarchical order. If a respondent agrees with a higher-intensity statement, it is assumed they agree with all lower-intensity statements preceding it. * **Adjective Scale:** This uses a list of adjectives (e.g., "happy," "tired," "anxious") to describe a state, rather than a continuum of agreement with a specific statement. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Types of Data:** Likert scales produce **Ordinal Data** (data with a natural order but where the distance between intervals is not necessarily equal). * **Central Tendency:** For Likert scales, the **Median** or **Mode** is the most appropriate measure of central tendency, not the Mean. * **Qualitative vs. Quantitative:** While the responses are qualitative, they are often assigned numerical values (1-5) for statistical coding. * **Memory Aid:** Remember **L**ikert = **L**evels of agreement.

Q: Which statistical test is used to determine the association between two variables?

Chi-square test. ### Explanation **1. Why Chi-square test is correct:** The **Chi-square ($\chi^2$) test** is a non-parametric test used to determine if there is a significant **association** between two **categorical (qualitative)** variables. In medical research, it is frequently used to compare proportions. For example, determining if the incidence of a disease (Yes/No) is associated with a risk factor like smoking (Yes/No). It tests the "null hypothesis" that there is no relationship between the variables. **2. Why other options are incorrect:** * **Correlation (Option B):** While correlation measures the strength and direction of a relationship between two variables, it is specifically used for **quantitative (numerical)** data (e.g., height and weight). It does not determine "association" in the categorical sense used in contingency tables. * **Regression (Option C):** Regression is used to **predict** the value of a dependent variable based on the value of an independent variable. It quantifies the functional relationship rather than just testing for the presence of an association. * **None of the above (Option D):** Incorrect, as Chi-square is the standard test for association between categorical variables. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Type of Data:** Chi-square = Qualitative/Categorical data; T-test/ANOVA = Quantitative data. * **Yates’ Correction:** Applied to a $2 \times 2$ Chi-square table when any cell frequency is less than 5. * **McNemar’s Test:** A variation of the Chi-square test used for **paired** data (e.g., comparing results in the same patient before and after treatment). * **Null Hypothesis ($H_0$):** Chi-square assumes $H_0$ is true (no association). If $p < 0.05$, we reject $H_0$ and conclude a significant association exists.

Q: For the given distribution of weights of a group of students, which measure of central tendency is most appropriate?

Mean. ***Mean*** - For a **normal/symmetric distribution** like student weights, the mean provides the most **accurate representation** of central tendency as it uses all data points. - In a **bell-shaped distribution**, mean, median, and mode are approximately equal, making mean the **preferred choice** due to its mathematical properties. *Median* - While median is **robust to outliers**, it's not the most appropriate for **symmetric distributions** where extreme values are naturally distributed. - Median is preferred for **skewed distributions** or when data contains significant outliers that could distort the mean. *Mode* - Mode identifies the **most frequently occurring value** but may not represent the central tendency well in continuous data like weights. - Most useful for **categorical data** or discrete variables, less meaningful for **continuous measurements** like body weight. *All of the above* - Though all measures can be calculated, the **mean is specifically most appropriate** for normally distributed continuous data. - Different measures serve different purposes: mean for **normal distributions**, median for **skewed data**, and mode for **categorical variables**.

Q: A study provides the following test results for a disease: Present (40 +ve, 10 -ve), Absent (225 +ve, 225 -ve). What is the sensitivity of this study?

80. ### Explanation **1. Why the correct answer is right:** Sensitivity is the ability of a test to correctly identify those with the disease (True Positives). To calculate sensitivity, we use the formula: **Sensitivity = [True Positives (TP) / (True Positives + False Negatives (FN))] × 100** From the data provided: * **Disease Present:** 40 tested positive (TP), 10 tested negative (FN). Total diseased = 50. * **Disease Absent:** 225 tested positive (FP), 225 tested negative (TN). Total healthy = 450. Applying the formula: Sensitivity = [40 / (40 + 10)] × 100 Sensitivity = [40 / 50] × 100 = **80%**. **2. Why the incorrect options are wrong:** * **Option A (40):** This represents the absolute number of True Positives, not the percentage (sensitivity). * **Option B (20):** This is the False Negative Rate (10/50 × 100 = 20%). Sensitivity and False Negative Rate are complementary (Sensitivity + FNR = 100%). * **Option D (50):** This represents the Specificity of the test [TN / (TN + FP)], calculated as [225 / (225 + 225)] × 100 = 50%. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Sensitivity (True Positive Rate):** Crucial for **screening tests** (e.g., ELISA for HIV) because it minimizes False Negatives. A highly sensitive test, when negative, helps **Rule Out** the disease (**SNOUT**). * **Specificity (True Negative Rate):** Crucial for **confirmatory tests** (e.g., Western Blot) because it minimizes False Positives. A highly specific test, when positive, helps **Rule In** the disease (**SPIN**). * **Predictive Values:** Unlike sensitivity/specificity, Positive and Negative Predictive Values are heavily dependent on the **prevalence** of the disease in the population.

Q: The Sample Registration System (SRS) is a:

Dual record system. **Explanation:** The **Sample Registration System (SRS)** is a large-scale demographic survey in India used to provide reliable annual estimates of birth rate, death rate, and other fertility/mortality indicators. **Why Option A is Correct:** The SRS is fundamentally a **Dual Record System**. It employs two independent methods of data collection to ensure accuracy and minimize under-reporting: 1. **Continuous Enumeration:** A resident part-time enumerator (usually a teacher or Anganwadi worker) records births and deaths as they occur in a specific sample unit. 2. **Retrospective Half-yearly Survey:** An independent supervisor visits the same sample unit every six months to conduct a fresh survey. The data from both sources are then **matched**, and any discrepancies are field-verified to arrive at a final unduplicated count. **Why Other Options are Incorrect:** * **Option B:** While SRS provides annual estimates, it is not a simple "survey conducted every year." It is a continuous longitudinal process combined with biannual surveys. * **Option C:** SRS uses **Stratified Multi-stage Random Sampling**, not quota sampling. Quota sampling is a non-probability sampling method, which would not yield representative national data. * **Option D:** The sample size for SRS is primarily determined based on the **Infant Mortality Rate (IMR)** and crude birth rate, not the Maternal Mortality Ratio (MMR). **High-Yield Pearls for NEET-PG:** * **Gold Standard:** SRS is considered the most reliable source of vital statistics in India (more accurate than the Civil Registration System). * **Nodal Agency:** It is conducted by the **Office of the Registrar General of India (RGI)**. * **Initiation:** It was started on a pilot basis in 1964-65 and became fully operational in 1969-70. * **Key Data:** It is the primary source for calculating **IMR, MMR, and Total Fertility Rate (TFR)** in India.

Q: Trends can be represented by?

Line diagram. **Explanation:** In biostatistics, the choice of a graphical representation depends on the type of data and the objective of the study. **Why Line Diagram is Correct:** A **Line Diagram** (or line graph) is the most appropriate tool for representing **trends over time** (time-series data). It connects individual data points with lines, allowing for the visualization of fluctuations, increases, or decreases in a variable (e.g., maternal mortality rate or disease incidence) over a continuous period. **Analysis of Incorrect Options:** * **Bar Chart:** Used for representing **discrete, qualitative, or nominal data** (e.g., number of hospital beds in different cities). It compares categories rather than showing a continuous trend. * **Histogram:** Used for **continuous quantitative data** to show frequency distribution. Unlike a bar chart, there are no gaps between the bars. It represents the distribution of a single variable (e.g., age distribution of a population), not a trend over time. * **Pie Chart:** Used to show the **proportional distribution** of different components of a whole at a single point in time (e.g., causes of death in a specific year). It does not show changes over time. **High-Yield Clinical Pearls for NEET-PG:** * **Trend Analysis:** Always look for "Line Diagram" or "Run Chart" when the question mentions time, years, or trends. * **Frequency Polygon:** Created by joining the midpoints of the tops of the bars in a histogram; also used for frequency distributions. * **Scatter Diagram:** Used to show the **correlation** (relationship) between two continuous variables. * **Ogive:** A graph representing cumulative frequency.

Question 1

All of the following are tests for the degree of closeness of a measured or calculated quantity to its actual or true value, except:

Accepted Answer

Range chart

Answer

Mean chart

Answer

Levey Jennings (LJ) chart

Answer

Shewhart control chart

Question 2

In what interval is the sample registration system conducted?

Accepted Answer

6 months

Answer

1 year

Answer

2 years

Answer

5 years

Question 3

What is true about a confounding factor?

Accepted Answer

It is itself a risk factor for the disease.

Answer

It is found equally between the study and the control groups.

Answer

Confounding can be eliminated by selecting a small group.

Answer

It is associated with either the exposure or the disease.

Question 4

What value of the correlation coefficient signifies a strong correlation?

Accepted Answer

1

Answer

Zero

Answer

Less than 1

Answer

More than 1

Question 5

The response which is graded by an observer on an agree or disagree continuum is based on which type of scale?

Accepted Answer

Likert Scale

Answer

Visual analog scale

Answer

Guttman Scale

Answer

Adjective scale

Question 6

Which statistical test is used to determine the association between two variables?

Accepted Answer

Chi-square test

Answer

Correlation

Answer

Regression

Answer

None of the above

Question 7

For the given distribution of weights of a group of students, which measure of central tendency is most appropriate?

Accepted Answer

Mean

Answer

Median

Answer

Mode

Answer

All of the above

Question 8

A study provides the following test results for a disease: Present (40 +ve, 10 -ve), Absent (225 +ve, 225 -ve). What is the sensitivity of this study?

Accepted Answer

80

Answer

40

Answer

20

Answer

50

Question 9

The Sample Registration System (SRS) is a:

Accepted Answer

Dual record system

Answer

Survey conducted every year

Answer

System using quota sampling

Answer

System where MMR decides the sample size

Question 10

Trends can be represented by?

Accepted Answer

Line diagram

Answer

Bar chart

Answer

Histogram

Answer

Pie chart

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?