Biostatistics Practice Questions

Q: What is true about the Standard Deviation curve?

Mean = Median. ### Explanation **1. Why the Correct Answer is Right:** The "Standard Deviation curve" refers to the **Normal Distribution Curve** (also known as the Gaussian curve). In a perfectly symmetrical normal distribution, the data is evenly distributed around the center. Because of this symmetry: * The **Mean** (average), **Median** (middle value), and **Mode** (most frequent value) are all equal and coincide at the peak of the curve. * Therefore, **Mean = Median = Mode** is a fundamental property of this distribution. **2. Why the Incorrect Options are Wrong:** * **Option B (Mean = 2 x Median):** This relationship does not exist in a normal distribution. If the mean were twice the median, the curve would be heavily positively skewed, not bell-shaped. * **Option C (Median = Variance):** Median is a measure of central tendency, while Variance is a measure of dispersion (spread). They represent different statistical properties and are not inherently equal. * **Option D (Standard deviation = 2 x Variance):** This is mathematically incorrect. By definition, **Variance = (Standard Deviation)²**. Conversely, Standard Deviation is the square root of Variance. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Area under the Curve:** * Mean ± 1 SD covers **68.3%** of values. * Mean ± 2 SD covers **95.4%** of values. * Mean ± 3 SD covers **99.7%** of values. * **Z-score:** Indicates how many standard deviations a value is from the mean. * **Skewness:** If Mean > Median, it is **Positively Skewed** (tail to the right). If Mean < Median, it is **Negatively Skewed** (tail to the left). * **Standard Error:** Calculated as $SD / \sqrt{n}$. It measures the displacement of the sample mean from the true population mean.

Q: In a hospital, out of 11 babies, 5 weighed over 2.5 kg and 5 weighed less than 2.5 kg. What value does 2.5 represent?

Median. ### Explanation The correct answer is **Median**. **1. Why Median is correct:** The **Median** is the middle-most value of a data set when the observations are arranged in ascending or descending order. In this scenario, there are 11 babies (an odd number). The question states that 5 babies are above 2.5 kg and 5 babies are below 2.5 kg. This places the value of 2.5 kg exactly at the center (the 6th position), dividing the distribution into two equal halves. By definition, the value that divides a distribution such that an equal number of observations lie above and below it is the Median. **2. Why other options are incorrect:** * **Arithmetic Mean:** This is the average calculated by summing all values and dividing by the total count ($n=11$). We cannot determine the mean here because the specific weights of the other 10 babies are unknown. * **Geometric Mean:** This is the $n^{th}$ root of the product of all values. It is typically used for rates, ratios, or data following a logarithmic distribution (e.g., bacterial counts). * **Mode:** This represents the most frequently occurring value in a data set. The data provided does not indicate which weight occurs most often. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Best Measure of Central Tendency:** For **skewed data** or data with **outliers** (extreme values), the Median is the most robust and preferred measure because it is not affected by extremes. * **Normal Distribution:** In a perfectly symmetrical (Gaussian) distribution, the **Mean = Median = Mode**. * **Qualitative Data:** The **Mode** is the only measure of central tendency that can be used for nominal/qualitative data (e.g., most common blood group). * **Formula for Median:** If $n$ is odd, Median = $(\frac{n+1}{2})^{th}$ item. If $n$ is even, it is the average of the two middle terms.

Q: The fasting blood sugar (FBS) of a population is normally distributed with a mean of 105 mg% and a standard deviation of 10 mg%. Within what range will 95% of the population's FBS fall?

85 mg% and 125 mg%. ### Explanation **Concept: The Normal Distribution (Gaussian Curve)** In Biostatistics, a "Normal Distribution" is a symmetrical bell-shaped curve defined by its Mean ($\mu$) and Standard Deviation (SD or $\sigma$). The distribution follows the **Empirical Rule (68-95-99.7 rule)**, which dictates the percentage of data points falling within specific SD limits from the mean: * **Mean ± 1 SD:** Covers **68.3%** of the population. * **Mean ± 2 SD:** Covers **95.4%** (commonly rounded to 95%) of the population. * **Mean ± 3 SD:** Covers **99.7%** of the population. **Calculation for this Question:** * Given Mean ($\mu$) = 105 mg% * Given SD ($\sigma$) = 10 mg% * For 95% of the population, the range is **Mean ± 2 SD**. * Calculation: $105 \pm (2 \times 10) \rightarrow 105 \pm 20$. * Lower Limit: $105 - 20 = \mathbf{85}$ * Upper Limit: $105 + 20 = \mathbf{125}$ * Therefore, 95% of the population falls between **85 mg% and 125 mg%**. --- ### Analysis of Options * **A (104–106 mg%):** This represents a very narrow range, likely confusing the Standard Deviation with the Standard Error of the Mean. * **B (95–115 mg%):** This is the range for **Mean ± 1 SD**, which covers only 68.3% of the population. * **D (75–135 mg%):** This is the range for **Mean ± 3 SD**, which covers 99.7% of the population. --- ### High-Yield NEET-PG Pearls 1. **Normal Distribution Characteristics:** Mean, Median, and Mode are all equal and coincide at the peak. 2. **Standard Normal Curve:** A normal distribution with a Mean of 0 and an SD of 1. 3. **Z-Score:** Indicates how many SDs a value is from the mean. For the 95% confidence limit, the precise Z-score is **1.96** (often rounded to 2 in exams). 4. **Reference Range:** In clinical medicine, the "normal range" for lab tests is typically defined as the Mean ± 2 SD, intentionally excluding the extreme 5% of the population.

Q: The population frequency for phenylketonuria is 1 in 10,000. What is the carrier frequency for the disease?

1/100. ### Explanation This question is based on the **Hardy-Weinberg Principle**, which is a high-yield topic in Biostatistics and Genetics. The principle uses the equation: **$p^2 + 2pq + q^2 = 1$**, where: * **$q^2$** = Frequency of the disease (autosomal recessive condition). * **$2pq$** = Frequency of carriers (heterozygotes). * **$p$** = Frequency of the normal allele (usually taken as 1 since $q$ is very small). **Step-by-Step Calculation:** 1. **Identify $q^2$:** The disease frequency is given as 1 in 10,000. So, $q^2 = 1/10,000$. 2. **Calculate $q$:** Take the square root of $q^2$. $\sqrt{1/10,000} = 1/100$ (or 0.01). 3. **Calculate $2pq$:** Since $p \approx 1$, the carrier frequency is $2 \times 1 \times 0.01 = 0.02$. 4. **Convert to fraction:** $0.02 = 2/100 = \mathbf{1/50}$. *Note: In many competitive exams, if 1/50 is not an option, the closest approximation or the value of $2q$ is used. Here, $2 \times (1/100) = 1/50$. However, looking at the provided key, **1/100** is marked correct, which represents the value of **$q$** (the gene frequency) rather than $2pq$. In strict mathematical terms, the carrier frequency is 1/50, but in simplified MCQ contexts, examiners sometimes look for the square root of the disease frequency.* **Why other options are incorrect:** * **B, C, and D:** These values do not correlate with the square root of 1/10,000 ($q$) or the calculation for $2pq$ (1/50). They are mathematically inconsistent with the Hardy-Weinberg equilibrium for the given prevalence. **Clinical Pearls for NEET-PG:** * **Phenylketonuria (PKU):** An autosomal recessive deficiency of phenylalanine hydroxylase. * **Rule of Thumb:** If the disease frequency is $1/X$, the carrier frequency is approximately $2/\sqrt{X}$. * **Hardy-Weinberg Requirements:** Large population, random mating, no mutation, no selection, and no migration.

Q: An investigator studying the life expectancy of IV drug abusers divides a sample of patients into HIV positive and HIV negative groups. What type of data does this division constitute?

Nominal. ### Explanation **Why Nominal is Correct:** In biostatistics, **Nominal data** (from the Latin *nomen*, meaning name) refers to data that is categorized into distinct groups based on names or labels without any inherent quantitative value or natural order. In this study, the investigator divides patients into two groups: **HIV positive** and **HIV negative**. These are simply qualitative labels used for classification. There is no "rank" between them (one is not mathematically "higher" or "more" than the other in terms of scale), making it a classic example of nominal data. **Why Other Options are Incorrect:** * **B. Ordinal:** This data type involves categories that have a **natural rank or order** (e.g., Stages of Cancer I-IV, Socioeconomic status, or Likert scales). While HIV status is binary, it does not represent a progressive scale of the same variable in this context. * **C. Interval:** This is a type of quantitative data where the distance between values is meaningful and equal, but there is no true zero point (e.g., Temperature in Celsius). HIV status is qualitative, not quantitative. * **D. Poisson:** This is not a type of data, but a **probability distribution** used to describe the number of independent events occurring within a fixed interval of time or space (e.g., the number of rare deaths in a hospital per year). **Clinical Pearls for NEET-PG:** * **Binary/Dichotomous Data:** A subtype of nominal data where only two categories exist (e.g., Dead/Alive, Male/Female, Smoker/Non-smoker). * **Hierarchy of Data:** Nominal (Lowest) $\rightarrow$ Ordinal $\rightarrow$ Interval $\rightarrow$ Ratio (Highest/Most powerful for statistical tests). * **Memory Aid:** **NOIR** (Nominal, Ordinal, Interval, Ratio). * **Key Distinction:** If you can "rank" the data but cannot measure the exact distance between ranks, it is **Ordinal**. If you can only "name" the groups, it is **Nominal**.

Q: In a community, 30% of the population is below 15 years of age and 10% is over 65 years of age. What is the dependency ratio?

66.60%. ### Explanation **1. Understanding the Correct Answer (C: 66.60%)** The **Dependency Ratio** is a demographic indicator that measures the burden on the productive part of the population. It is defined as the ratio of the "dependent" population (those not typically in the labor force) to the "working-age" population. * **Dependent Population:** Children (<15 years) + Elderly (≥65 years). * In this case: 30% (Children) + 10% (Elderly) = **40%**. * **Working-age Population:** Individuals aged 15–64 years. * Since the total population is 100%, the working-age group is: 100% – 40% = **60%**. **Formula:** $$\text{Dependency Ratio} = \frac{(\text{Population } <15) + (\text{Population } \geq65)}{\text{Population } 15–64} \times 100$$ **Calculation:** $$\text{Dependency Ratio} = \frac{40}{60} \times 100 = \frac{2}{3} \times 100 = \mathbf{66.66\%}$$ **2. Why Other Options are Incorrect** * **A (20%):** This is the difference between the child and elderly populations, which has no demographic significance here. * **B (40%):** This represents the total percentage of dependents in the *entire* population, but the ratio must be calculated against the *working-age* population, not the total. * **D (3%):** This is a mathematical error, likely derived from multiplying the two dependent percentages (0.30 × 0.10). **3. NEET-PG High-Yield Pearls** * **Total Dependency Ratio:** Sum of Young and Old dependency ratios. * **Young Dependency Ratio:** $(\text{Pop } <15 / \text{Pop } 15–64) \times 100$. * **Old Dependency Ratio:** $(\text{Pop } \geq65 / \text{Pop } 15–64) \times 100$. * **Demographic Dividend:** Occurs when the dependency ratio declines due to a bulge in the working-age population (15–64 years). * **India’s Context:** India is currently experiencing a "demographic dividend" as the proportion of the working-age population is increasing relative to dependents.

Q: All of the following are associated with Disability Adjusted Life Year (DALY) except:

It is not equal to healthy life lost. ### Explanation **Disability-Adjusted Life Year (DALY)** is a summary measure of population health used to quantify the burden of disease. One DALY represents the loss of the equivalent of **one year of full health**. **Why Option D is the Correct Answer:** The statement "It is not equal to healthy life lost" is **incorrect** (making it the correct choice for an "except" question). By definition, DALY is a measure of the gap between current health status and an ideal health situation where the entire population lives to an advanced age, free of disease and disability. Therefore, **1 DALY = 1 year of healthy life lost.** **Analysis of Other Options:** * **Option A & B:** DALY is calculated using the formula: **DALY = YLL + YLD**. * **YLL (Years of Life Lost):** Mortality component (due to premature death). * **YLD (Years Lived with Disability):** Morbidity component (years lost due to disability/illness). * **Option C:** DALY is specifically designed to capture the impact of **chronic illnesses** and non-fatal conditions over time, which traditional mortality rates (like CDR) fail to account for. --- ### High-Yield Pearls for NEET-PG: * **Origin:** Concept introduced by Christopher Murray and Lopez (World Bank) in 1990. * **Components:** DALY = YLL + YLD. * **Global Burden of Disease (GBD):** DALY is the primary unit used in GBD studies to compare the relative impact of different diseases (e.g., comparing Depression vs. Heart Disease). * **QALY vs. DALY:** * **QALY (Quality Adjusted Life Year):** Measures the *benefit* of an intervention (Health Gain). * **DALY:** Measures the *burden* of a disease (Health Loss). * **Weightage:** In DALY calculation, disability is weighted from 0 (perfect health) to 1 (death).

Q: In a community of 10,000 people, there are 2,000 children aged 0-6 years. Among those older than 7 years, 4,000 are literate. What is the effective literacy rate?

50%. ### Explanation **1. Why the Correct Answer (C) is Right:** In biostatistics and demography, the **Effective Literacy Rate** is calculated differently from the Crude Literacy Rate. While the crude rate considers the total population, the effective rate excludes children aged **0-6 years** from the denominator, as they are developmentally considered "not yet literate" by census standards. The formula for Effective Literacy Rate is: $$\text{Effective Literacy Rate} = \frac{\text{Number of Literate Persons (7+ years)}}{\text{Total Population} - \text{Population in 0-6 age group}} \times 100$$ **Calculation:** * Total Population = 10,000 * Children (0-6 years) = 2,000 * Literate Persons = 4,000 * Denominator (Population aged 7+) = 10,000 - 2,000 = 8,000 * **Rate** = $(4,000 / 8,000) \times 100 = \mathbf{50\%}$ **2. Why the Incorrect Options are Wrong:** * **Option A (20%):** This represents the percentage of children in the population (2,000/10,000), which is irrelevant to literacy. * **Option B (40%):** This is the **Crude Literacy Rate** (4,000/10,000). It incorrectly includes the 0-6 age group in the denominator. * **Option D (60%):** This value does not correspond to any standard demographic calculation based on the provided data. **3. High-Yield Clinical Pearls for NEET-PG:** * **Definition of Literate:** A person aged 7 years and above who can both read and write with understanding in any language. * **Crude vs. Effective:** Always check the denominator. If the question asks for "Literacy Rate" without qualification, in the Indian Census context, it usually refers to the **Effective Literacy Rate**. * **Census Fact:** Literacy rates in India have shown a steady decadal increase; always remember that the female literacy rate is a key indicator of a community's health status and maternal/child outcomes.

Q: What is true about the Visual Analog Scale?

Used for the measurement of pain.. The **Visual Analog Scale (VAS)** is a validated psychometric measuring instrument used to quantify subjective characteristics or attitudes that cannot be directly measured. ### Why Option B is Correct In clinical practice and research, the VAS is most commonly used to measure the **intensity of pain**. It typically consists of a 10 cm (100 mm) horizontal line with verbal anchors at each end (e.g., "No pain" at 0 and "Worst imaginable pain" at 10). The patient marks a point on the line that represents their current perception of pain. It is highly sensitive and provides a continuous scale for statistical analysis, making it superior to simple categorical scales. ### Why Other Options are Incorrect * **Option A:** ABO blood group determination is a **nominal (categorical)** classification based on antigen-antibody reactions, not a subjective analog scale. * **Option C:** While both are used in psychometrics, a **Likert scale** is a discrete, ordinal scale (e.g., 1 to 5 rating: Strongly Disagree to Strongly Agree). The VAS is a continuous scale. * **Option D:** The VAS measures a **single variable** (unidimensional) along a continuum; it is not a tool for comparing or correlating two different data variables. ### High-Yield Clinical Pearls for NEET-PG * **Type of Data:** VAS provides **ratio data** (if there is a true zero) or **interval data**, allowing for more robust parametric statistical testing compared to ordinal scales. * **Other Uses:** Besides pain, VAS is used for subjective symptoms like dyspnea, fatigue, or anxiety. * **Reliability:** It is considered more sensitive to change than the Numerical Rating Scale (NRS) but requires better motor skills and cognitive function from the patient. * **Length:** The standard validated length of the VAS line is **10 cm**.

Question 1

What is true about the Standard Deviation curve?

Accepted Answer

Mean = Median

Answer

Mean = 2 x Median

Answer

Median = Variance

Answer

Standard deviation = 2 x Variance

Question 2

In a hospital, out of 11 babies, 5 weighed over 2.5 kg and 5 weighed less than 2.5 kg. What value does 2.5 represent?

Accepted Answer

Median

Answer

Geometric mean

Answer

Arithmetic mean

Answer

Mode

Question 3

The fasting blood sugar (FBS) of a population is normally distributed with a mean of 105 mg% and a standard deviation of 10 mg%. Within what range will 95% of the population's FBS fall?

Accepted Answer

85 mg% and 125 mg%

Answer

104 mg% and 106 mg%

Answer

95 mg% and 115 mg%

Answer

75 mg% and 135 mg%

Question 4

The population frequency for phenylketonuria is 1 in 10,000. What is the carrier frequency for the disease?

Accepted Answer

1/100

Answer

1/200

Answer

1/500

Answer

1/1000

Question 5

An investigator studying the life expectancy of IV drug abusers divides a sample of patients into HIV positive and HIV negative groups. What type of data does this division constitute?

Accepted Answer

Nominal

Answer

Ordinal

Answer

Interval

Answer

Poisson

Question 6

In a community, 30% of the population is below 15 years of age and 10% is over 65 years of age. What is the dependency ratio?

Accepted Answer

66.60%

Answer

20%

Answer

40%

Answer

3%

Question 7

All of the following are associated with Disability Adjusted Life Year (DALY) except:

Accepted Answer

It is not equal to healthy life lost

Answer

Years lost due to disability

Answer

Years of Life Lost (YLL)

Answer

A measure of the effects of chronic illness over time

Question 8

In a community of 10,000 people, there are 2,000 children aged 0-6 years. Among those older than 7 years, 4,000 are literate. What is the effective literacy rate?

Accepted Answer

50%

Answer

20%

Answer

40%

Answer

60%

Question 9

Which of the following statements about confidence intervals is true?

Accepted Answer

A 95% confidence interval covers approximately 2 standard errors around the mean.

Answer

A smaller confidence level results in a larger confidence interval.

Answer

Less variability in the data leads to a wider confidence interval.

Answer

Sample size does not affect the confidence interval.

Question 10

What is true about the Visual Analog Scale?

Accepted Answer

Used for the measurement of pain.

Answer

A type of scale used for ABO blood group determination.

Answer

Also known as a Likert scale.

Answer

Used for only two variables of data.

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?