Biostatistics Practice Questions

Q: Which of the following is the national level system that provides annual national and state-level reliable estimates of fertility and mortality?

Sample Registration System. **Explanation:** The **Sample Registration System (SRS)** is the correct answer because it is the primary source of continuous, reliable, national, and state-level estimates of fertility (Birth Rate) and mortality (Death Rate, IMR, MMR) in India. 1. **Why SRS is correct:** * **Dual Record System:** SRS employs a unique "Dual Record System" involving continuous enumeration by a resident part-time enumerator and an independent half-yearly survey by a supervisor. This cross-check ensures high data reliability. * **Frequency:** It provides **annual** estimates, making it the most updated source for vital statistics between decennial censuses. 2. **Why other options are incorrect:** * **Census:** Conducted once every **10 years**. While it provides comprehensive demographic data, it does not provide annual estimates of fertility and mortality. * **Civil Registration System (CRS):** This is the continuous registration of births and deaths (legal requirement). However, due to significant under-reporting in many Indian states, it is currently considered **unreliable** for calculating national rates compared to the SRS. * **Adhoc Surveys:** These (like NFHS or DLHS) are periodic and thematic. They provide deep insights into maternal and child health but are not the primary national system for annual vital rate estimation. **High-Yield Facts for NEET-PG:** * **SRS** is the gold standard for **IMR (Infant Mortality Rate)** and **MMR (Maternal Mortality Ratio)** data in India. * **SRS** is under the jurisdiction of the **Registrar General of India (RGI)**, Ministry of Home Affairs. * **Time limit for CRS registration:** Births must be registered within **21 days**; Deaths must be registered within **21 days**. * **Denominator for MMR:** 100,000 live births (Note: All other mortality rates use 1,000 as the denominator).

Q: Which test detects true negatives?

Specificity. ### Explanation **Correct Option: D. Specificity** **Why it is correct:** Specificity is the ability of a diagnostic test to correctly identify those **without the disease**. It is defined as the proportion of **true negatives** among all healthy individuals (True Negatives / [True Negatives + False Positives]). A highly specific test has a low false-positive rate, meaning if the test result is positive, you can be highly confident the patient actually has the disease (SP-P-IN: **Sp**ecificity, **P**ositive result, rules **In**). **Why the other options are incorrect:** * **A. Relative Risk (RR):** This is a measure of **association** used in cohort studies. It compares the incidence of disease in an exposed group versus an unexposed group. It does not measure test accuracy. * **B. Odds Ratio (OR):** This is a measure of **association** used primarily in case-control studies. It represents the odds that an outcome will occur given a particular exposure, compared to the odds of the outcome occurring in the absence of that exposure. * **C. Sensitivity:** This is the ability of a test to correctly identify those **with the disease**. It detects **true positives**. A highly sensitive test is used for screening because a negative result effectively rules out the disease (SN-N-OUT: **S**e**n**sitivity, **N**egative result, rules **Out**). **High-Yield Clinical Pearls for NEET-PG:** * **Sensitivity** = $TP / (TP + FN)$ (True Positive Rate) * **Specificity** = $TN / (TN + FP)$ (True Negative Rate) * **Screening Tests:** Require high **Sensitivity** to ensure no cases are missed. * **Confirmatory Tests:** Require high **Specificity** to ensure healthy people aren't misdiagnosed. * **Predictive Values:** Unlike sensitivity/specificity, Positive Predictive Value (PPV) and Negative Predictive Value (NPV) are heavily influenced by the **prevalence** of the disease in the population.

Q: In a normal distribution curve, which values are the same?

Mode and Mean. ### Explanation In biostatistics, a **Normal Distribution** (also known as a Gaussian distribution) is characterized by a perfectly symmetrical, bell-shaped curve. **Why the correct answer is right:** The central tendency of a normal distribution is its most defining feature. Because the curve is perfectly symmetrical around the center, the peak of the curve represents the most frequent value (**Mode**), the exact middle value (**Median**), and the average of all values (**Mean**). Therefore, in a true normal distribution: **Mean = Median = Mode** **Analysis of Incorrect Options:** * **Options A, B, and D (Standard Deviation):** The Standard Deviation (SD) is a measure of **dispersion** (how spread out the data is), not a measure of central tendency. While the Mean, Median, and Mode define the *location* of the center of the curve, the SD defines the *width* or flatness of the bell. There is no mathematical requirement for the SD to equal the Mean, Median, or Mode. **High-Yield Facts for NEET-PG:** 1. **Symmetry:** In a normal distribution, the area to the left of the mean is exactly 50%, and the area to the right is 50%. 2. **The 68-95-99.7 Rule (Empirical Rule):** * Mean ± 1 SD covers **68.2%** of the values. * Mean ± 2 SD covers **95.4%** of the values. * Mean ± 3 SD covers **99.7%** of the values. 3. **Skewness:** If the Mean > Median > Mode, the curve is **Positively Skewed** (tail to the right). If the Mode > Median > Mean, it is **Negatively Skewed** (tail to the left). 4. **Standard Normal Distribution:** A special case where the **Mean is 0** and the **Standard Deviation is 1**.

Q: Which of the following is an example of non-random sampling?

Clinical trial sampling. ### Explanation In biostatistics, sampling techniques are broadly categorized into **Probability (Random)** and **Non-Probability (Non-random)** sampling. **Why Clinical Trial Sampling is the Correct Answer:** Clinical trials typically utilize **Convenience Sampling** or **Purposive Sampling**, which are non-random methods. Participants are selected based on specific inclusion and exclusion criteria (e.g., patients attending a specific OPD with a particular disease). While the *assignment* to treatment groups within a trial is often randomized (Randomized Controlled Trial), the initial selection of the study population from the general community is non-random. **Analysis of Incorrect Options:** * **A. Probability sampling:** This is the definition of random sampling, where every unit in the population has a known, non-zero chance of being selected. * **B. Non-purposive sampling:** This is a distractor term. Purposive sampling is non-random; therefore, "non-purposive" would theoretically align closer to random methods. * **C. Cluster random sampling:** This is a type of probability sampling where the population is divided into clusters (e.g., villages), and entire clusters are selected at random. **High-Yield Clinical Pearls for NEET-PG:** * **Simple Random Sampling:** The "Gold Standard"; uses a random number table or computer generator. * **Systematic Random Sampling:** Selecting every $k^{th}$ unit (Sampling Interval = $N/n$). It is often used in field surveys. * **Stratified Random Sampling:** Best for heterogeneous populations; ensures representation from all subgroups (strata). * **Snowball Sampling:** A non-random method used for "hidden" populations (e.g., IV drug users, commercial sex workers). * **Quota Sampling:** The non-random equivalent of stratified sampling.

Q: All of the following are examples of a nominal scale, except:

Blood pressure. ### Explanation The core of this question lies in understanding the **Scales of Measurement** used in biostatistics. Data is categorized into four levels: Nominal, Ordinal, Interval, and Ratio. **Why Blood Pressure is the Correct Answer:** Blood pressure is a **Ratio Scale** (a type of quantitative/numerical data). It has a true zero point, and the intervals between values are equal and meaningful (e.g., the difference between 120 and 130 mmHg is the same as between 140 and 150 mmHg). Because it represents a measured quantity rather than a descriptive category, it is not a nominal scale. **Analysis of Incorrect Options (Nominal Scales):** Nominal scales are used for qualitative data where items are assigned into distinct groups or "names" without any inherent quantitative value or natural order. * **Race (Option A):** Categorical data based on ethnic origin. There is no mathematical "rank" between different races. * **Sex (Option B):** A classic example of a **Dichotomous Nominal Scale** (Male/Female). * **Iris Color (Option C):** Qualitative data (Blue, Brown, Green). These are labels used for identification with no numerical hierarchy. **High-Yield Clinical Pearls for NEET-PG:** * **NOIR Mnemonic:** Remember the hierarchy from simplest to most complex: **N**ominal < **O**rdinal < **I**nterval < **R**atio. * **Ordinal Scale:** Data with a natural rank/order but unequal intervals (e.g., Cancer Staging, Socio-economic status, Likert scales). * **Discrete vs. Continuous:** Blood pressure is **continuous** data (can have decimals), whereas the number of patients in a ward is **discrete** data. * **Statistical Tests:** Nominal data is usually analyzed using the **Chi-square test**, while Ratio data (like BP) is analyzed using **T-tests** or **ANOVA**.

Q: What is the best graphic representation for the frequency distribution of data gathered from a continuous variable?

Histogram. ### Explanation **Why Histogram is the Correct Answer:** A **Histogram** is the most appropriate graphical representation for a **continuous variable** (e.g., height, weight, hemoglobin levels, or blood pressure). In a histogram, the data is divided into continuous class intervals (bins) represented on the X-axis, while the frequency is shown on the Y-axis. Because the data is continuous, the bars are drawn touching each other without any gaps, signifying that there is no break between the classes. The area of each bar is proportional to the frequency of that interval. **Why Other Options are Incorrect:** * **A & B. Simple and Multiple Bar Graphs:** These are used for **discrete (categorical) or qualitative data** (e.g., number of hospital beds, gender, or types of blood groups). In bar graphs, there are distinct gaps between the bars because the categories are independent and not continuous. * **C. Line Diagram:** These are primarily used to show **trends over time** (time-series data), such as the incidence of malaria over a decade or maternal mortality rates over several years. **High-Yield Clinical Pearls for NEET-PG:** * **Frequency Polygon:** Created by joining the midpoints of the tops of the bars in a histogram. It is also used for continuous data and is better for comparing two or more distributions on the same graph. * **Ogive (Cumulative Frequency Curve):** Used to determine the **median** of a distribution. * **Scatter Diagram:** Used to show the **correlation** (relationship) between two continuous variables. * **Pie Chart:** Used to show the relative proportion of various components of a whole (qualitative data).

Q: What does repeatability of a test refer to?

All of the above. **Explanation:** In biostatistics, **Repeatability** (also known as reliability or reproducibility) refers to the ability of a diagnostic test or measurement to produce consistent results when performed multiple times under the same conditions on the same subject. **Why "All of the above" is correct:** * **Obtaining the same results on repeated trials:** This is the literal definition of repeatability. If a blood pressure cuff gives a reading of 120/80 mmHg three times in a row on the same stable patient, it is repeatable. * **Precision of the test:** Precision is the statistical synonym for repeatability. It describes how close the measurements are to *each other*, regardless of whether they are close to the "true" value. * **Consistency of results:** This refers to the lack of variation (random error) in the test results over time or between different observers. **Analysis of Options:** Since repeatability encompasses the concepts of consistency, precision, and the replication of results, all three individual options (A, B, and C) are fundamentally describing the same attribute of a diagnostic tool. **High-Yield Clinical Pearls for NEET-PG:** * **Reliability vs. Validity:** Reliability (Repeatability/Precision) is about **consistency**. Validity (Accuracy) is about **truth** (how close the result is to the gold standard). * **The "Bullseye" Analogy:** * Tight cluster away from the center = Precise but not Accurate. * Scattered around the center = Accurate but not Precise. * Tight cluster in the center = Both Precise and Accurate. * **Evaluation:** Repeatability is measured using the **Kappa statistic** (for qualitative data) or the **Intraclass Correlation Coefficient** (for quantitative data). * **Source of Error:** Reliability is affected by **random error**, whereas Validity is affected by **systematic error (bias)**.

Question 1

Which of the following statements about data diagrams is FALSE?

Accepted Answer

A bimodal frequency distribution has two peaks.

Answer

Frequency distributions are usually illustrated by histograms.

Answer

Frequencies are commonly illustrated by bar charts.

Answer

Frequency polygons are useful for comparing multiple frequency distributions on the same diagram.

Question 2

Which of the following is the national level system that provides annual national and state-level reliable estimates of fertility and mortality?

Accepted Answer

Sample Registration System

Answer

Census

Answer

Adhoc Survey

Answer

Civil Registration System

Question 3

A study is planned to check stool occult blood positivity using hemoccult test among participants aged 50-75 years. The test is repeated if the result is positive, but not repeated if the test is negative. What is the effect on sensitivity and specificity?

Accepted Answer

Sensitivity decreases, specificity increases

Answer

Sensitivity increases, specificity decreases

Answer

Sensitivity unchanged, specificity increases

Answer

Sensitivity increases, specificity unchanged

Question 4

Which test detects true negatives?

Accepted Answer

Specificity

Answer

Relative risk

Answer

Odds ratio

Answer

Sensitivity

Question 5

In a normal distribution curve, which values are the same?

Accepted Answer

Mode and Mean

Answer

Standard Deviation and Mean

Answer

Standard Deviation and Mode

Answer

Standard Deviation and Median

Question 6

Which of the following is an example of non-random sampling?

Accepted Answer

Clinical trial sampling

Answer

Probability sampling

Answer

Non-purposive sampling

Answer

Cluster random sampling

Question 7

All of the following are examples of a nominal scale, except:

Accepted Answer

Blood pressure

Answer

Race

Answer

Sex

Answer

Iris color

Question 8

What is the best graphic representation for the frequency distribution of data gathered from a continuous variable?

Accepted Answer

Histogram

Answer

Simple bar graph

Answer

Multiple bar graph

Answer

Line diagram

Question 9

In a study comparing a common drug (NSAID) and a rare drug (Dypirone) causing a disease, the relative risk (RR) and attributable risk (AR) were calculated. Which of the following statements is true regarding the risks associated with these drugs?

Accepted Answer

NSAID has higher RR and AR than Dypirone

Answer

NSAID has lower RR and AR than Dypirone

Answer

NSAID has lower RR and higher AR than Dypirone

Answer

AR and RR of both drugs are the same.

Question 10

What does repeatability of a test refer to?

Accepted Answer

All of the above

Answer

Obtaining the same results on repeated trials

Answer

Precision of the test

Answer

Consistency of results

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?