Biostatistics Practice Questions

Q: What is the term for the initial number of individuals in a life table cohort?

Cohort size. ### Explanation **Correct Answer: D. Cohort size** In biostatistics and demography, a **Life Table** is a statistical tool used to track the mortality experience of a specific group of individuals. The **Cohort size** refers to the total number of individuals present at the beginning of the study (Age 0). This group is followed until the last member dies to calculate life expectancy and survival rates. **Analysis of Options:** * **A. Radix:** While often used interchangeably in general discussion, the "Radix" specifically refers to the **arbitrary fixed number** (usually 100,000 or 1,000) assigned as the starting population in a hypothetical life table to simplify calculations. While "Cohort size" is the literal term for the initial number of individuals, "Radix" is the mathematical constant used for standardization. * **B. Radius:** This is a geometric term and has no relevance to life tables or demographic statistics. * **C. Origin:** In statistics, the origin refers to the starting point $(0,0)$ on a graph. While the life table starts at "Age 0," the term does not describe the number of individuals. **High-Yield Facts for NEET-PG:** * **Types of Life Tables:** 1. **Cohort (Current) Life Table:** Based on the actual mortality experience of a birth cohort. 2. **Period (Abridged) Life Table:** Based on the mortality rates of a population at a specific point in time (more commonly used). * **Expectation of Life ($e_x$):** The average number of additional years a person is expected to live if current mortality patterns continue. * **$l_x$:** Represents the number of persons living at the beginning of a specific age interval. * **$q_x$:** Represents the probability of dying between age $x$ and $x+1$.

Q: Mean hemoglobin values are compared between two independent population groups. What is the most appropriate statistical test to use?

Unpaired T-test. To select the correct statistical test, we must evaluate the **type of data** and the **number of groups** being compared. ### 1. Why Unpaired T-test is Correct The question involves comparing **Mean Hemoglobin values**, which is **quantitative (numerical/continuous) data**. These values are being compared between **two independent population groups** (e.g., males vs. females or Group A vs. Group B). * The **Unpaired T-test** (also known as the Independent Samples T-test) is specifically designed to compare the means of two independent groups when the data follows a normal distribution. ### 2. Why Other Options are Incorrect * **Paired T-test:** Used for quantitative data when the two sets of observations are related (e.g., "Before and After" treatment measurements in the same individual). * **Chi-square test:** Used for **qualitative (categorical) data** to compare proportions or associations (e.g., comparing the percentage of anemic vs. non-anemic patients between two groups). * **Fisher's exact test:** Also used for qualitative data, specifically when the sample size is small (e.g., any cell value in a 2x2 table is less than 5). ### 3. NEET-PG High-Yield Pearls * **Rule of Thumb:** If the data is **Quantitative**, use a **T-test** (2 groups) or **ANOVA** (>2 groups). If the data is **Qualitative**, use **Chi-square**. * **Z-test:** Used instead of a T-test if the sample size is large (**n > 30**). * **Non-parametric equivalent:** If the data is not normally distributed, the **Mann-Whitney U test** is the non-parametric alternative to the Unpaired T-test.

Q: Two groups are tested for anemia. Which statistical test should be used?

Chi-square test. ### Explanation The key to selecting the correct statistical test lies in identifying the **type of data** being analyzed. **Why Chi-square test is correct:** In this scenario, "anemia" is a **qualitative (categorical) variable**. When testing for anemia, individuals are classified into categories: "Anemic" or "Non-anemic." When comparing two independent groups (e.g., Group A vs. Group B) based on a categorical outcome, we use a **contingency table** (2x2 table) and apply the **Chi-square test**. It assesses the significance of the difference between observed and expected frequencies. **Why the other options are incorrect:** * **Paired T-test:** Used for **quantitative (numerical)** data when comparing means of the *same* group at two different times (e.g., hemoglobin levels before and after treatment). * **Unpaired (Independent) T-test:** Used for **quantitative** data when comparing the means of *two* different groups (e.g., comparing the mean Hemoglobin value in g/dL between Group A and Group B). * **ANOVA (Analysis of Variance):** Used for **quantitative** data when comparing the means of *three or more* groups. **Clinical Pearls for NEET-PG:** * **Categorical Data (Proportions/Percentages) →** Chi-square test. * **Numerical Data (Means/SD) →** T-test (for 2 groups) or ANOVA (for >2 groups). * If the sample size is very small (any cell value in a 2x2 table is <5), use **Fisher’s Exact Test** instead of Chi-square. * **Correlation coefficient (r)** measures the strength of a linear relationship between two numerical variables, while **Regression** predicts the value of one variable based on another.

Q: What is the current neonatal mortality rate per 1000 live births?

25/1000 live births. ### Explanation **Correct Answer: A. 25/1000 live births** The **Neonatal Mortality Rate (NMR)** is defined as the number of deaths of infants under 28 days of age per 1,000 live births. According to the latest **Sample Registration System (SRS) Bulletin (2020)**, the NMR in India has declined to **20 per 1,000 live births**. However, in the context of standard NEET-PG questions based on the **NFHS-5 (National Family Health Survey 2019-21)** data, the national average is recorded as **24.9 per 1,000 live births**, which is rounded to **25**. **Analysis of Incorrect Options:** * **Option B (34/1000):** This figure is closer to the current **Infant Mortality Rate (IMR)** in India, which is 35 per 1,000 live births (NFHS-5) or 28 per 1,000 live births (SRS 2020). * **Option C (33/1000):** This was the NMR recorded in earlier surveys (around 2011-2013). It represents an outdated statistic and does not reflect the progress made under the National Health Mission (NHM). **High-Yield Clinical Pearls for NEET-PG:** * **NMR Components:** It is divided into Early Neonatal Mortality (0-7 days) and Late Neonatal Mortality (7-28 days). Early neonatal deaths contribute to nearly 75% of total neonatal deaths. * **Leading Causes:** The primary causes of neonatal mortality in India are **Prematurity/Low Birth Weight (35%)**, followed by Neonatal Infections (Sepsis) and Birth Asphyxia. * **SDG Target:** The Sustainable Development Goal (SDG 3.2) aims to reduce NMR to at least **12 per 1,000 live births** by 2030. * **Indicator of Care:** NMR is a sensitive indicator of the quality of antenatal and intrapartum care.

Q: Which of the following is an example of a nominal variable?

Roll number. **Explanation:** In biostatistics, variables are classified based on their level of measurement. A **Nominal variable** is a type of qualitative (categorical) data where numbers or names are used solely as labels to identify or categorize items. There is no inherent numerical value, order, or rank associated with these labels. * **Why Roll Number is correct:** Although a roll number consists of digits, it is a **Nominal variable**. It serves only as a unique identifier for a student. You cannot perform meaningful mathematical operations on it (e.g., adding two roll numbers is meaningless), nor does a higher roll number imply "more" of a specific attribute. Other examples include Gender, Blood Group, and Marital Status. **Analysis of Incorrect Options:** * **Temperature (Option A):** This is a **Scale/Interval variable**. It is quantitative data where the distance between points is equal, but there is no "absolute zero" (0°C does not mean absence of heat). * **TNM Staging (Option C):** This is an **Ordinal variable**. While it is categorical, there is a clear, inherent rank or progression (Stage II is more advanced than Stage I). * **Mid-arm circumference (Option D):** This is a **Ratio variable** (Continuous quantitative data). It has a defined absolute zero and can be measured in decimals. **High-Yield Clinical Pearls for NEET-PG:** * **NOIR Hierarchy:** Remember the hierarchy from simplest to most complex: **N**ominal < **O**rdinal < **I**nterval < **R**atio. * **Qualitative Data:** Includes Nominal and Ordinal. * **Quantitative Data:** Includes Interval and Ratio (Discrete or Continuous). * **Visual Aid:** If you can rank the data, it's **Ordinal**; if it's just a name/label, it's **Nominal**.

Q: The Kaplan-Meier method is used for estimating:

Survival. **Explanation:** The **Kaplan-Meier method** (also known as the product-limit method) is a non-parametric statistic used to estimate the **survival function** from time-to-event data. In medical research, it is the gold standard for analyzing "time to death" or "time to a specific clinical event" (like relapse or recovery). **Why Survival is Correct:** The method calculates the probability of an event occurring at specific time intervals. Its unique strength is handling **censored data**—cases where the event hasn't happened yet by the end of the study or the patient is lost to follow-up. The results are typically visualized using a **Kaplan-Meier Curve**, which displays a characteristic "step-ladder" pattern. **Why Other Options are Incorrect:** * **Incidence:** Refers to the number of new cases in a population over a period. It is calculated using simple proportions or person-time rates, not survival analysis. * **Prevalence:** Refers to the total number of existing cases (old + new) at a specific point in time. It is a cross-sectional measure. * **Frequency:** A general term for the count or occurrence of a variable; it does not account for the "time-to-event" dimension required for Kaplan-Meier analysis. **High-Yield Clinical Pearls for NEET-PG:** * **Log-Rank Test:** This is the statistical test used to compare two different Kaplan-Meier survival curves (e.g., Treatment vs. Placebo). * **Hazard Ratio:** Often reported alongside survival curves to indicate the relative risk of the event occurring in one group versus another. * **Median Survival Time:** The time at which 50% of the study subjects are still alive; it is easily identified on a Kaplan-Meier plot.

Q: Calculate the mean and mode of the following set of values: 2, 2, 3, 4, 4, 4, 4, 5, 5, 7, 8, 8, 9.

Mean: 5, Mode: 4. **Explanation:** In Biostatistics, measures of central tendency are essential for summarizing medical data. This question tests the ability to calculate the **Mean** (arithmetic average) and identify the **Mode** (most frequent value). **1. Calculation of Mean:** The mean is calculated by the formula: $\text{Mean} = \frac{\sum X}{n}$ (Sum of all observations / Total number of observations). * Sum ($\sum X$): $2+2+3+4+4+4+4+5+5+7+8+8+9 = 65$ * Total count ($n$): $13$ * Mean: $65 / 13 = \mathbf{5}$ **2. Identification of Mode:** The mode is the value that appears most frequently in a data set. * Frequency of 2: (2 times) * Frequency of 3: (1 time) * **Frequency of 4: (4 times)** * Frequency of 5: (2 times) * Frequency of 7: (1 time) * Frequency of 8: (2 times) * Frequency of 9: (1 time) * Since 4 appears most often, the **Mode is 4**. **Analysis of Options:** * **Option B (Correct):** Correctly identifies Mean as 5 and Mode as 4. * **Option A:** Incorrect; 5 is the mean, not the mode. * **Option C:** Incorrect; 9 is the maximum value, not the mode. * **Option D:** Incorrect; 9 is the maximum value, not the mean. **High-Yield Clinical Pearls for NEET-PG:** * **Mean:** Most sensitive measure of central tendency but highly influenced by **outliers** (extreme values). * **Median:** The best measure of central tendency for **skewed data**. * **Mode:** The only measure that can be used for **nominal (qualitative) data** (e.g., most common blood group). * **Relationship:** In a perfectly symmetrical (Normal) distribution: **Mean = Median = Mode**.

Q: In a village of 1 lakh population, among 20,000 exposed to smoking, 200 developed cancer, and among 40,000 unexposed individuals, 40 developed cancer. What is the relative risk of smoking in the development of cancer?

10. ### Explanation **1. Understanding the Correct Answer (B: 10)** Relative Risk (RR) is the ratio of the incidence of a disease among the exposed group to the incidence of the disease among the unexposed group. It is the primary measure of association in **Cohort Studies**. * **Incidence among exposed ($I_e$):** $\frac{\text{Number of cases in exposed}}{\text{Total exposed}} = \frac{200}{20,000} = 0.01$ (or 10 per 1000) * **Incidence among unexposed ($I_u$):** $\frac{\text{Number of cases in unexposed}}{\text{Total unexposed}} = \frac{40}{40,000} = 0.001$ (or 1 per 1000) * **Relative Risk (RR):** $\frac{I_e}{I_u} = \frac{0.01}{0.001} = \mathbf{10}$ This means smokers are 10 times more likely to develop cancer compared to non-smokers. **2. Why Other Options are Incorrect** * **Option A (20):** This would occur if the incidence in the exposed group was doubled or the unexposed incidence was halved. It overestimates the association. * **Option C (5):** This would be the result if the number of cases in the exposed group was only 100 instead of 200. * **Option D (15):** This is a mathematical error, likely arising from subtracting incidences rather than dividing them (Attributable Risk calculation error). **3. NEET-PG High-Yield Pearls** * **Relative Risk (RR):** Measures the **strength of association**. RR > 1 indicates a positive association (risk factor); RR = 1 indicates no association. * **Attributable Risk (AR):** $(I_e - I_u) / I_e \times 100$. It indicates the amount of disease that can be prevented if the exposure is eliminated. * **Odds Ratio (OR):** Used in **Case-Control studies** as an estimate of RR. * **Incidence** can only be calculated in Cohort studies, not Case-Control studies.

Question 1

What is the term for the initial number of individuals in a life table cohort?

Accepted Answer

Cohort size

Answer

Radix

Answer

Radius

Answer

Origin

Question 2

Mean hemoglobin values are compared between two independent population groups. What is the most appropriate statistical test to use?

Accepted Answer

Unpaired T-test

Answer

Paired T-test

Answer

Chi-square test

Answer

Fisher's exact test

Question 3

Two groups are tested for anemia. Which statistical test should be used?

Accepted Answer

Chi-square test

Answer

Paired T-test

Answer

Unpaired T-test

Answer

ANOVA

Question 4

A test to measure blood pressure has successive readings for the same person as follows: 110/70 mmHg, 128/80 mmHg, 132/70 mmHg, 160/90 mmHg. The given test has a mean blood pressure of 120/80 mmHg. What can be concluded about the test's validity and reliability?

Accepted Answer

Low validity, Low reliability

Answer

Low validity, high reliability

Answer

High validity, low reliability

Answer

High validity, high reliability

Question 5

What is the current neonatal mortality rate per 1000 live births?

Accepted Answer

25/1000 live births

Answer

34/1000 live births

Answer

33/1000 live births

Answer

None of the above

Question 6

General fertility rate is a better measure of fertility than the crude birth rate because the denominator includes which of the following?

Accepted Answer

Females aged 15-45 years

Answer

Midyear population

Answer

Total female population

Answer

Married female population

Question 7

Which of the following is an example of a nominal variable?

Accepted Answer

Roll number

Answer

Temperature

Answer

TNM staging

Answer

Mid-arm circumference

Question 8

The Kaplan-Meier method is used for estimating:

Accepted Answer

Survival

Answer

Incidence

Answer

Frequency

Answer

Prevalence

Question 9

Calculate the mean and mode of the following set of values: 2, 2, 3, 4, 4, 4, 4, 5, 5, 7, 8, 8, 9.

Accepted Answer

Mean: 5, Mode: 4

Answer

Mean: 4, Mode: 5

Answer

Mean: 5, Mode: 9

Answer

Mean: 9, Mode: 5

Question 10

In a village of 1 lakh population, among 20,000 exposed to smoking, 200 developed cancer, and among 40,000 unexposed individuals, 40 developed cancer. What is the relative risk of smoking in the development of cancer?

Accepted Answer

10

Answer

20

Answer

5

Answer

15

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?