What is the term for the initial number of individuals in a life table cohort?
Mean hemoglobin values are compared between two independent population groups. What is the most appropriate statistical test to use?
Two groups are tested for anemia. Which statistical test should be used?
A test to measure blood pressure has successive readings for the same person as follows: 110/70 mmHg, 128/80 mmHg, 132/70 mmHg, 160/90 mmHg. The given test has a mean blood pressure of 120/80 mmHg. What can be concluded about the test's validity and reliability?
What is the current neonatal mortality rate per 1000 live births?
General fertility rate is a better measure of fertility than the crude birth rate because the denominator includes which of the following?
Which of the following is an example of a nominal variable?
The Kaplan-Meier method is used for estimating:
Calculate the mean and mode of the following set of values: 2, 2, 3, 4, 4, 4, 4, 5, 5, 7, 8, 8, 9.
In a village of 1 lakh population, among 20,000 exposed to smoking, 200 developed cancer, and among 40,000 unexposed individuals, 40 developed cancer. What is the relative risk of smoking in the development of cancer?
Explanation: ### Explanation **Correct Answer: D. Cohort size** In biostatistics and demography, a **Life Table** is a statistical tool used to track the mortality experience of a specific group of individuals. The **Cohort size** refers to the total number of individuals present at the beginning of the study (Age 0). This group is followed until the last member dies to calculate life expectancy and survival rates. **Analysis of Options:** * **A. Radix:** While often used interchangeably in general discussion, the "Radix" specifically refers to the **arbitrary fixed number** (usually 100,000 or 1,000) assigned as the starting population in a hypothetical life table to simplify calculations. While "Cohort size" is the literal term for the initial number of individuals, "Radix" is the mathematical constant used for standardization. * **B. Radius:** This is a geometric term and has no relevance to life tables or demographic statistics. * **C. Origin:** In statistics, the origin refers to the starting point $(0,0)$ on a graph. While the life table starts at "Age 0," the term does not describe the number of individuals. **High-Yield Facts for NEET-PG:** * **Types of Life Tables:** 1. **Cohort (Current) Life Table:** Based on the actual mortality experience of a birth cohort. 2. **Period (Abridged) Life Table:** Based on the mortality rates of a population at a specific point in time (more commonly used). * **Expectation of Life ($e_x$):** The average number of additional years a person is expected to live if current mortality patterns continue. * **$l_x$:** Represents the number of persons living at the beginning of a specific age interval. * **$q_x$:** Represents the probability of dying between age $x$ and $x+1$.
Explanation: To select the correct statistical test, we must evaluate the **type of data** and the **number of groups** being compared. ### 1. Why Unpaired T-test is Correct The question involves comparing **Mean Hemoglobin values**, which is **quantitative (numerical/continuous) data**. These values are being compared between **two independent population groups** (e.g., males vs. females or Group A vs. Group B). * The **Unpaired T-test** (also known as the Independent Samples T-test) is specifically designed to compare the means of two independent groups when the data follows a normal distribution. ### 2. Why Other Options are Incorrect * **Paired T-test:** Used for quantitative data when the two sets of observations are related (e.g., "Before and After" treatment measurements in the same individual). * **Chi-square test:** Used for **qualitative (categorical) data** to compare proportions or associations (e.g., comparing the percentage of anemic vs. non-anemic patients between two groups). * **Fisher's exact test:** Also used for qualitative data, specifically when the sample size is small (e.g., any cell value in a 2x2 table is less than 5). ### 3. NEET-PG High-Yield Pearls * **Rule of Thumb:** If the data is **Quantitative**, use a **T-test** (2 groups) or **ANOVA** (>2 groups). If the data is **Qualitative**, use **Chi-square**. * **Z-test:** Used instead of a T-test if the sample size is large (**n > 30**). * **Non-parametric equivalent:** If the data is not normally distributed, the **Mann-Whitney U test** is the non-parametric alternative to the Unpaired T-test.
Explanation: ### Explanation The key to selecting the correct statistical test lies in identifying the **type of data** being analyzed. **Why Chi-square test is correct:** In this scenario, "anemia" is a **qualitative (categorical) variable**. When testing for anemia, individuals are classified into categories: "Anemic" or "Non-anemic." When comparing two independent groups (e.g., Group A vs. Group B) based on a categorical outcome, we use a **contingency table** (2x2 table) and apply the **Chi-square test**. It assesses the significance of the difference between observed and expected frequencies. **Why the other options are incorrect:** * **Paired T-test:** Used for **quantitative (numerical)** data when comparing means of the *same* group at two different times (e.g., hemoglobin levels before and after treatment). * **Unpaired (Independent) T-test:** Used for **quantitative** data when comparing the means of *two* different groups (e.g., comparing the mean Hemoglobin value in g/dL between Group A and Group B). * **ANOVA (Analysis of Variance):** Used for **quantitative** data when comparing the means of *three or more* groups. **Clinical Pearls for NEET-PG:** * **Categorical Data (Proportions/Percentages) →** Chi-square test. * **Numerical Data (Means/SD) →** T-test (for 2 groups) or ANOVA (for >2 groups). * If the sample size is very small (any cell value in a 2x2 table is <5), use **Fisher’s Exact Test** instead of Chi-square. * **Correlation coefficient (r)** measures the strength of a linear relationship between two numerical variables, while **Regression** predicts the value of one variable based on another.
Explanation: ### Explanation **1. Why Option A is Correct (Understanding the Concepts)** * **Reliability (Precision/Consistency):** Reliability refers to the ability of a test to yield consistent results when repeated under the same conditions. In this case, the readings (110/70, 128/80, 132/70, 160/90) show a **wide variation**. Because the results are scattered and inconsistent for the same individual, the test has **low reliability**. * **Validity (Accuracy):** Validity refers to how close the test results are to the "true value" or the gold standard. The question states the true mean blood pressure is 120/80 mmHg. The provided readings are significantly higher or lower than this mean (e.g., 160/90 vs. 120/80). Since the test fails to accurately reflect the true value, it has **low validity**. **2. Why Other Options are Incorrect** * **Option B (Low validity, high reliability):** This would occur if the readings were consistent but wrong (e.g., all readings were exactly 150/90). Here, the readings are neither consistent nor accurate. * **Option C (High validity, low reliability):** While the *average* of scattered readings might sometimes hit the true mean, high validity generally requires the test to be consistently close to the truth. In clinical practice, a test with such high variance cannot be considered valid. * **Option D (High validity, high reliability):** This would require the readings to be both tightly clustered together and very close to 120/80 mmHg. **3. Clinical Pearls & High-Yield Facts for NEET-PG** * **Reliability vs. Validity:** Think of a dartboard. * **Reliable but not Valid:** Darts are clustered together but far from the bullseye (Systematic Error). * **Valid and Reliable:** Darts are clustered in the bullseye. * **Neither:** Darts are scattered everywhere (Random Error). * **Key Relationship:** A test can be reliable without being valid, but a test **cannot be valid if it is not reliable**. * **Evaluation Metrics:** Reliability is measured by the **Kappa Coefficient** (for qualitative data) or **Intraclass Correlation** (for quantitative data). Validity is measured by **Sensitivity and Specificity**. * **Bias:** Systematic error affects validity; Random error affects reliability.
Explanation: ### Explanation **Correct Answer: A. 25/1000 live births** The **Neonatal Mortality Rate (NMR)** is defined as the number of deaths of infants under 28 days of age per 1,000 live births. According to the latest **Sample Registration System (SRS) Bulletin (2020)**, the NMR in India has declined to **20 per 1,000 live births**. However, in the context of standard NEET-PG questions based on the **NFHS-5 (National Family Health Survey 2019-21)** data, the national average is recorded as **24.9 per 1,000 live births**, which is rounded to **25**. **Analysis of Incorrect Options:** * **Option B (34/1000):** This figure is closer to the current **Infant Mortality Rate (IMR)** in India, which is 35 per 1,000 live births (NFHS-5) or 28 per 1,000 live births (SRS 2020). * **Option C (33/1000):** This was the NMR recorded in earlier surveys (around 2011-2013). It represents an outdated statistic and does not reflect the progress made under the National Health Mission (NHM). **High-Yield Clinical Pearls for NEET-PG:** * **NMR Components:** It is divided into Early Neonatal Mortality (0-7 days) and Late Neonatal Mortality (7-28 days). Early neonatal deaths contribute to nearly 75% of total neonatal deaths. * **Leading Causes:** The primary causes of neonatal mortality in India are **Prematurity/Low Birth Weight (35%)**, followed by Neonatal Infections (Sepsis) and Birth Asphyxia. * **SDG Target:** The Sustainable Development Goal (SDG 3.2) aims to reduce NMR to at least **12 per 1,000 live births** by 2030. * **Indicator of Care:** NMR is a sensitive indicator of the quality of antenatal and intrapartum care.
Explanation: ### Explanation **1. Why Option A is Correct:** The **General Fertility Rate (GFR)** is considered a superior measure to the Crude Birth Rate (CBR) because it restricts the denominator to the population actually "at risk" of childbirth. While the CBR uses the entire population, the GFR uses the **total number of females in the reproductive age group (15–44 or 15–49 years)**. By excluding children, the elderly, and males—who cannot biologically conceive—the GFR provides a more accurate reflection of a community's fertility potential. **2. Why Other Options are Incorrect:** * **Option B (Midyear population):** This is the denominator for the **Crude Birth Rate (CBR)**. It is a weak indicator because it includes individuals who are not physiologically capable of bearing children (males, children, and post-menopausal women). * **Option C (Total female population):** This is rarely used as a standalone denominator in fertility because it includes age groups (infants and the elderly) that do not contribute to births, thus diluting the rate. * **Option D (Married female population):** This is the denominator for the **General Marital Fertility Rate (GMFR)**. While specific, it excludes births occurring outside of wedlock, making it a measure of marital fertility rather than overall biological fertility. **3. High-Yield NEET-PG Pearls:** * **Hierarchy of Fertility Indicators:** Total Fertility Rate (TFR) > General Fertility Rate (GFR) > Crude Birth Rate (CBR). * **Total Fertility Rate (TFR):** The average number of children a woman would have if she experiences current age-specific fertility rates through her reproductive years. It is the best indicator of overall fertility. * **Replacement Level Fertility:** A TFR of **2.1** is considered the replacement level (where a population exactly replaces itself). * **Net Reproduction Rate (NRR):** The number of *daughters* a newborn girl will bear. An **NRR of 1** is the demographic goal for population stabilization.
Explanation: **Explanation:** In biostatistics, variables are classified based on their level of measurement. A **Nominal variable** is a type of qualitative (categorical) data where numbers or names are used solely as labels to identify or categorize items. There is no inherent numerical value, order, or rank associated with these labels. * **Why Roll Number is correct:** Although a roll number consists of digits, it is a **Nominal variable**. It serves only as a unique identifier for a student. You cannot perform meaningful mathematical operations on it (e.g., adding two roll numbers is meaningless), nor does a higher roll number imply "more" of a specific attribute. Other examples include Gender, Blood Group, and Marital Status. **Analysis of Incorrect Options:** * **Temperature (Option A):** This is a **Scale/Interval variable**. It is quantitative data where the distance between points is equal, but there is no "absolute zero" (0°C does not mean absence of heat). * **TNM Staging (Option C):** This is an **Ordinal variable**. While it is categorical, there is a clear, inherent rank or progression (Stage II is more advanced than Stage I). * **Mid-arm circumference (Option D):** This is a **Ratio variable** (Continuous quantitative data). It has a defined absolute zero and can be measured in decimals. **High-Yield Clinical Pearls for NEET-PG:** * **NOIR Hierarchy:** Remember the hierarchy from simplest to most complex: **N**ominal < **O**rdinal < **I**nterval < **R**atio. * **Qualitative Data:** Includes Nominal and Ordinal. * **Quantitative Data:** Includes Interval and Ratio (Discrete or Continuous). * **Visual Aid:** If you can rank the data, it's **Ordinal**; if it's just a name/label, it's **Nominal**.
Explanation: **Explanation:** The **Kaplan-Meier method** (also known as the product-limit method) is a non-parametric statistic used to estimate the **survival function** from time-to-event data. In medical research, it is the gold standard for analyzing "time to death" or "time to a specific clinical event" (like relapse or recovery). **Why Survival is Correct:** The method calculates the probability of an event occurring at specific time intervals. Its unique strength is handling **censored data**—cases where the event hasn't happened yet by the end of the study or the patient is lost to follow-up. The results are typically visualized using a **Kaplan-Meier Curve**, which displays a characteristic "step-ladder" pattern. **Why Other Options are Incorrect:** * **Incidence:** Refers to the number of new cases in a population over a period. It is calculated using simple proportions or person-time rates, not survival analysis. * **Prevalence:** Refers to the total number of existing cases (old + new) at a specific point in time. It is a cross-sectional measure. * **Frequency:** A general term for the count or occurrence of a variable; it does not account for the "time-to-event" dimension required for Kaplan-Meier analysis. **High-Yield Clinical Pearls for NEET-PG:** * **Log-Rank Test:** This is the statistical test used to compare two different Kaplan-Meier survival curves (e.g., Treatment vs. Placebo). * **Hazard Ratio:** Often reported alongside survival curves to indicate the relative risk of the event occurring in one group versus another. * **Median Survival Time:** The time at which 50% of the study subjects are still alive; it is easily identified on a Kaplan-Meier plot.
Explanation: **Explanation:** In Biostatistics, measures of central tendency are essential for summarizing medical data. This question tests the ability to calculate the **Mean** (arithmetic average) and identify the **Mode** (most frequent value). **1. Calculation of Mean:** The mean is calculated by the formula: $\text{Mean} = \frac{\sum X}{n}$ (Sum of all observations / Total number of observations). * Sum ($\sum X$): $2+2+3+4+4+4+4+5+5+7+8+8+9 = 65$ * Total count ($n$): $13$ * Mean: $65 / 13 = \mathbf{5}$ **2. Identification of Mode:** The mode is the value that appears most frequently in a data set. * Frequency of 2: (2 times) * Frequency of 3: (1 time) * **Frequency of 4: (4 times)** * Frequency of 5: (2 times) * Frequency of 7: (1 time) * Frequency of 8: (2 times) * Frequency of 9: (1 time) * Since 4 appears most often, the **Mode is 4**. **Analysis of Options:** * **Option B (Correct):** Correctly identifies Mean as 5 and Mode as 4. * **Option A:** Incorrect; 5 is the mean, not the mode. * **Option C:** Incorrect; 9 is the maximum value, not the mode. * **Option D:** Incorrect; 9 is the maximum value, not the mean. **High-Yield Clinical Pearls for NEET-PG:** * **Mean:** Most sensitive measure of central tendency but highly influenced by **outliers** (extreme values). * **Median:** The best measure of central tendency for **skewed data**. * **Mode:** The only measure that can be used for **nominal (qualitative) data** (e.g., most common blood group). * **Relationship:** In a perfectly symmetrical (Normal) distribution: **Mean = Median = Mode**.
Explanation: ### Explanation **1. Understanding the Correct Answer (B: 10)** Relative Risk (RR) is the ratio of the incidence of a disease among the exposed group to the incidence of the disease among the unexposed group. It is the primary measure of association in **Cohort Studies**. * **Incidence among exposed ($I_e$):** $\frac{\text{Number of cases in exposed}}{\text{Total exposed}} = \frac{200}{20,000} = 0.01$ (or 10 per 1000) * **Incidence among unexposed ($I_u$):** $\frac{\text{Number of cases in unexposed}}{\text{Total unexposed}} = \frac{40}{40,000} = 0.001$ (or 1 per 1000) * **Relative Risk (RR):** $\frac{I_e}{I_u} = \frac{0.01}{0.001} = \mathbf{10}$ This means smokers are 10 times more likely to develop cancer compared to non-smokers. **2. Why Other Options are Incorrect** * **Option A (20):** This would occur if the incidence in the exposed group was doubled or the unexposed incidence was halved. It overestimates the association. * **Option C (5):** This would be the result if the number of cases in the exposed group was only 100 instead of 200. * **Option D (15):** This is a mathematical error, likely arising from subtracting incidences rather than dividing them (Attributable Risk calculation error). **3. NEET-PG High-Yield Pearls** * **Relative Risk (RR):** Measures the **strength of association**. RR > 1 indicates a positive association (risk factor); RR = 1 indicates no association. * **Attributable Risk (AR):** $(I_e - I_u) / I_e \times 100$. It indicates the amount of disease that can be prevented if the exposure is eliminated. * **Odds Ratio (OR):** Used in **Case-Control studies** as an estimate of RR. * **Incidence** can only be calculated in Cohort studies, not Case-Control studies.
Collection and Presentation of Data
Practice Questions
Measures of Central Tendency
Practice Questions
Measures of Dispersion
Practice Questions
Normal Distribution
Practice Questions
Sampling Methods
Practice Questions
Sample Size Calculation
Practice Questions
Hypothesis Testing
Practice Questions
Tests of Significance
Practice Questions
Correlation and Regression
Practice Questions
Survival Analysis
Practice Questions
Multivariate Analysis
Practice Questions
Statistical Software in Research
Practice Questions
Get full access to all questions, explanations, and performance tracking.
Start For Free