If the Total Fertility Rate (TFR) in a population is 4, what would be the approximate Gross Reproduction Rate (GRR)?
All of the following are true about cluster sampling except:
Disability Adjusted Life Year (DALY) is a measure of?
The estimated mean Hemoglobin (Hb) of 100 women is 10 g/dL, with a standard deviation of 1 g/dL. What is the standard error of the estimate?
What is the most common reason for a screening test to yield a high number of false positives?
A village is divided into 5 lanes, and then each lane is sampled randomly. This is an example of which type of sampling?
In a study comparing women with breast cancer to those without, 75 out of 100 cases used calcium supplements, while 25 out of 100 non-cases used calcium supplements. Calculate the cross-product ratio.
In the calculation of Years of Potential Life Lost (YPLL), what is the denominator used?
What is true about the reliability of a test?
A drug company is developing a new pregnancy-test kit for use on an outpatient basis. The company used the pregnancy test on 100 women who are known to be pregnant. Out of 100 women, 99 showed positive test. Upon using the same test on 100 non-pregnant women, 90 showed negative result. What is the sensitivity of the test?
Explanation: ### Explanation **1. Understanding the Correct Answer (A):** The **Gross Reproduction Rate (GRR)** is a specific subset of the **Total Fertility Rate (TFR)**. While TFR represents the average number of children (both male and female) a woman would have during her reproductive years, GRR represents only the average number of **female** children. Biologically, the secondary sex ratio at birth is approximately **105 males for every 100 females**. This means that roughly **48.8%** (approximately half) of all births are female. Therefore, the mathematical relationship is: $$\text{GRR} \approx \text{TFR} \times 0.488 \text{ (or roughly TFR} \div 2)$$ Given a TFR of 4, the GRR is $4 \times 0.488 \approx 1.95$, which rounds to **2**. **2. Analysis of Incorrect Options:** * **Option B (4):** This equals the TFR. GRR cannot equal TFR unless a population produces only female children, which is biologically impossible. * **Option C (8) & D (16):** These values are mathematically incorrect as GRR is always a fraction of the TFR, never a multiple of it. **3. High-Yield Clinical Pearls for NEET-PG:** * **Net Reproduction Rate (NRR):** Unlike GRR, NRR accounts for **maternal mortality**. It is the number of daughters a newborn girl will bear, assuming she is subject to current fertility and mortality rates. * **NRR = 1:** This is the demographic goal for population stabilization (Replacement Level Fertility). In India, this usually corresponds to a **TFR of 2.1**. * **Relationship:** $\text{TFR} > \text{GRR} > \text{NRR}$. * If NRR is 1, the population will eventually stop growing (Zero Population Growth).
Explanation: **Explanation** In Biostatistics, **Cluster Sampling** is a probability sampling technique where the population is divided into naturally occurring groups called "clusters" (e.g., villages, schools, or wards). **Why Option A is the correct answer (The False Statement):** In **Simple Random Sampling (SRS)**, the sampling unit is the individual, and every individual has an equal chance of being selected. In **Cluster Sampling**, the sampling unit is the cluster itself. The fundamental difference lies in homogeneity: * In Cluster Sampling, we want **heterogeneity within** the cluster (it should represent a "mini-population") and **homogeneity between** clusters. * In SRS, individuals are selected independently. Therefore, clusters are fundamentally different from the units in SRS in terms of selection logic and variance. **Analysis of Incorrect Options (True Statements):** * **Option B:** It is considered **rapid and simple** because it eliminates the need for a complete sampling frame (a list of every individual in the population), which is often impossible in large-scale field surveys. * **Option C:** The **sample size varies** because clusters (like villages) often have unequal numbers of people. Additionally, the "Design Effect" must be accounted for, often requiring a larger sample size than SRS to achieve the same statistical power. * **Option D:** It is a **probability sampling** method because clusters are selected using random techniques, ensuring every cluster has a known chance of selection. **High-Yield Pearls for NEET-PG:** * **WHO 30 x 7 Cluster Technique:** Originally used for the Expanded Programme on Immunization (EPI) to estimate vaccination coverage. It involves 30 clusters and 7 children per cluster (Total N=210). * **Primary Sampling Unit (PSU):** In cluster sampling, the PSU is the cluster (e.g., the village), not the individual. * **Design Effect:** Cluster sampling has more "sampling error" than SRS. To compensate, the sample size is usually multiplied by a factor (often 2).
Explanation: ### Explanation **Disability-Adjusted Life Year (DALY)** is a summary measure of population health used to quantify the **Global Burden of Disease**. It was developed by the World Bank and the WHO to move beyond simple mortality rates and account for the impact of non-fatal health conditions. **1. Why Option B is Correct:** DALY measures the gap between current health status and an ideal health situation where the entire population lives to an advanced age, free of disease and disability. It is calculated as the sum of two components: * **YLL (Years of Life Lost):** Due to premature mortality. * **YLD (Years Lived with Disability):** The time spent in states of less than full health. Thus, **1 DALY = 1 lost year of "healthy" life.** **2. Analysis of Incorrect Options:** * **Option A (Life Expectancy):** This is a measure of longevity (the average number of years a person is expected to live) and does not account for the quality of those years or the burden of disability. * **Option C (Quality of Life):** While related, Quality of Life is a subjective perception. The specific metric for this is **QALY (Quality-Adjusted Life Year)**, which measures the *benefit* of a medical intervention rather than the *burden* of a disease. * **Option D (Human Development):** This is measured by the **Human Development Index (HDI)**, which includes life expectancy, education (mean/expected years of schooling), and per capita income (GNI). **3. High-Yield Clinical Pearls for NEET-PG:** * **Formula:** $DALY = YLL + YLD$. * **YLL Calculation:** Number of deaths $\times$ standard life expectancy at age of death. * **YLD Calculation:** Number of incident cases $\times$ disability weight $\times$ average duration of the case until remission or death. * **Disability Weight:** Ranges from **0 (perfect health)** to **1 (death)**. * DALY is the primary indicator used in the **Global Burden of Disease (GBD) Study**.
Explanation: ### Explanation **Concept and Calculation:** The **Standard Error (SE)**, specifically the Standard Error of the Mean (SEM), measures the dispersion of sample means around the true population mean. It quantifies the precision of the estimate; a smaller SE indicates a more reliable estimate. The formula for Standard Error is: $$\text{SE} = \frac{\text{Standard Deviation (SD)}}{\sqrt{\text{Sample Size (n)}}}$$ Applying the values from the question: * Standard Deviation (SD) = 1 g/dL * Sample Size (n) = 100 * $\text{SE} = \frac{1}{\sqrt{100}} = \frac{1}{10} = \mathbf{0.1}$ **Analysis of Options:** * **Option D (0.1):** Correct. This is the result of dividing the SD by the square root of the sample size. * **Option A (0.001):** Incorrect. This would occur if the denominator was $n$ instead of $\sqrt{n}$ and then squared, or a simple decimal placement error. * **Option B (1):** Incorrect. This is the value of the Standard Deviation itself. SE is always smaller than SD (unless $n=1$). * **Option C (10):** Incorrect. This is the value of the Mean, which is irrelevant to the calculation of the Standard Error. **High-Yield Clinical Pearls for NEET-PG:** 1. **SD vs. SE:** Standard Deviation describes the **variability** within a single sample. Standard Error describes the **uncertainty** or precision of the sample mean compared to the population mean. 2. **Sample Size Relationship:** SE is inversely proportional to the square root of the sample size. To halve the SE, you must quadruple the sample size. 3. **Confidence Intervals (CI):** SE is used to calculate CI. For a 95% CI, the formula is $\text{Mean} \pm (1.96 \times \text{SE})$. In this case, the 95% CI would be $10 \pm 0.196$ g/dL. 4. **Standard Error of Proportion:** If the data is qualitative (e.g., prevalence), the formula changes to $\sqrt{pq/n}$.
Explanation: ### Explanation The correct answer is **D. Low prevalence of the disease in the population.** **Why the correct answer is right:** The number of false positives is directly related to the **Positive Predictive Value (PPV)** of a test. PPV is the probability that a person with a positive test result actually has the disease. PPV is heavily dependent on the **prevalence** of the disease in the population. When prevalence is low (e.g., screening for a rare cancer in the general population), the vast majority of people tested are healthy. Even a test with high specificity will inevitably misclassify a small percentage of these many healthy individuals as "positive." Because the actual number of diseased people is so small, these "false positives" will far outnumber the "true positives," leading to a low PPV. **Why the other options are wrong:** * **A. High specificity:** Specificity is the ability of a test to correctly identify those *without* the disease. High specificity actually **decreases** the number of false positives. * **B. High sensitivity:** Sensitivity is the ability to correctly identify those *with* the disease. While high sensitivity reduces false negatives, it does not inherently cause high false positives (that depends on specificity). * **C. High prevalence:** In a high-prevalence population, a positive test is much more likely to be a "true positive," thereby **increasing** the PPV and reducing the relative impact of false positives. **High-Yield NEET-PG Pearls:** * **Prevalence vs. Predictive Value:** As prevalence increases, PPV increases and NPV (Negative Predictive Value) decreases. * **Screening Strategy:** To minimize false positives in low-prevalence settings, clinicians often use a **sequential (two-stage) testing** strategy, where a highly sensitive test is followed by a highly specific confirmatory test. * **Fixed vs. Variable:** Sensitivity and Specificity are inherent properties of the test (they don't change with prevalence), whereas PPV and NPV are extrinsic properties (they change based on the population tested).
Explanation: **Explanation:** The correct answer is **Stratified Random Sampling**. In this scenario, the village population is divided into non-overlapping subgroups (lanes) called **strata**. The key characteristic of stratified sampling is that the population is divided into groups based on a specific characteristic (in this case, geographical location/lanes), and then a **random sample is drawn from each and every stratum**. This ensures that every lane is represented in the final sample, reducing sampling error and ensuring better representativeness of the entire village. **Why other options are incorrect:** * **Simple Random Sampling:** Here, every individual in the entire village would have an equal chance of being selected directly (e.g., using a random number table for the whole population) without first dividing them into lanes. * **Systematic Random Sampling:** This involves selecting individuals at fixed intervals (the $k^{th}$ unit) from a list, such as picking every $5^{th}$ house in the village after a random start. * **Cluster Sampling (Distinction):** Often confused with stratified sampling, in cluster sampling, the village would be divided into lanes, but only a *few* lanes would be randomly selected, and everyone within those selected lanes would be studied. In this question, *each* lane is sampled, which defines stratification. **High-Yield Pearls for NEET-PG:** * **Stratified Sampling:** Best when the population is **heterogeneous**; it ensures sub-group representation. * **Cluster Sampling:** Best when the population is large and widely dispersed; the unit of randomization is a "cluster" (e.g., a village or a school) rather than an individual. * **Multistage Sampling:** Used in large-scale national surveys (like NFHS), involving multiple levels of sampling (e.g., State → District → Village → Household).
Explanation: ### Explanation **1. Why Option A is Correct** The **Cross-Product Ratio** is another name for the **Odds Ratio (OR)**, which is the standard measure of association in a Case-Control study. It represents the ratio of the odds of exposure among cases to the odds of exposure among controls. To calculate this, we first arrange the data into a **2x2 Contingency Table**: | | Cases (Cancer) | Controls (No Cancer) | | :--- | :---: | :---: | | **Exposed (Calcium)** | 75 (a) | 25 (b) | | **Non-Exposed** | 25 (c) | 75 (d) | *Note: If 75/100 cases used supplements, 25 did not. If 25/100 controls used supplements, 75 did not.* **Formula:** $$\text{Odds Ratio} = \frac{a \times d}{b \times c}$$ $$\text{Calculation} = \frac{75 \times 75}{25 \times 25} = \frac{5625}{625} = \mathbf{9}$$ An OR of 9 indicates that the odds of exposure to calcium supplements were 9 times higher in women with breast cancer compared to those without. **2. Why Other Options are Incorrect** * **Options B (6), C (3), and D (12):** These are mathematical errors. They result from incorrectly setting up the 2x2 table (e.g., using the total 100 as a denominator instead of the non-exposed count) or failing to use the cross-multiplication method. **3. NEET-PG High-Yield Clinical Pearls** * **Study Design:** The Odds Ratio is used for **Case-Control studies**, while Relative Risk (RR) is used for **Cohort studies**. * **Interpretation:** * OR > 1: Positive association (Risk factor). * OR = 1: No association. * OR < 1: Negative association (Protective factor). * **Key Tip:** In the exam, always ensure you calculate the "non-exposed" cells ($c$ and $d$) by subtracting the exposed from the total before applying the formula. Do not use the total (100) in the cross-product.
Explanation: ### Explanation **Years of Potential Life Lost (YPLL)** is a measure of premature mortality that prioritizes deaths occurring at younger ages. It is calculated by subtracting the age at death from a predetermined "standard" age. **1. Why Option A is Correct:** In modern public health practice (specifically by the CDC and WHO), the standard age limit for calculating YPLL is typically set at **75 years**. Therefore, the denominator represents the **population under 75 years of age**, as this is the group "at risk" of dying before reaching the threshold. YPLL focuses on the social and economic burden of early death rather than just the number of deaths. **2. Why the Other Options are Incorrect:** * **Option B (Midyear Population):** This is the standard denominator for Crude Death Rate (CDR) and Specific Death Rates. It includes the entire population, whereas YPLL only considers those below the threshold age. * **Option C (15 to 65 years):** While 65 was historically used as the threshold for "retirement age," it is no longer the standard for YPLL. This range is more relevant to calculating the dependency ratio or economic productivity. * **Option D (Above 15 years):** This excludes children and infants. Since YPLL aims to highlight premature mortality, infant and child deaths contribute the most to the YPLL value; excluding them would defeat the purpose of the metric. **3. High-Yield Clinical Pearls for NEET-PG:** * **YPLL vs. DALY:** While YPLL measures **premature mortality** only, DALY (Disability-Adjusted Life Years) measures the total burden of disease (Mortality + Morbidity). * **Formula:** YPLL = $\sum$ (Standard Age - Age at Death before that age). * **Key Utility:** YPLL is highly sensitive to causes of death that affect younger populations, such as accidents, injuries, and congenital anomalies, whereas Crude Death Rates are dominated by chronic diseases of the elderly. * **Standard Age:** If "75" is not in the options, look for "65" or "70," as these were older standards, but **75** is the current preferred benchmark.
Explanation: **Explanation:** **Reliability** (also known as precision, consistency, or reproducibility) refers to the ability of a test to yield the same results when repeated under similar conditions. In biostatistics, it measures how free a test is from random error. If a test is reliable, repeated measurements on the same stable subject will produce consistent values. **Analysis of Options:** * **Option C (Correct):** This is the literal definition of reliability. It ensures that the results are stable over time and across different observers (inter-rater reliability). * **Option A:** Incorrect. Consistency and reproducibility are the *core* components of reliability; they are the primary focus, not a "non-problem." * **Option B:** Incorrect. While an investigator's skill can reduce observer bias, reliability is a property of the test/instrument itself. High reliability should ideally minimize the impact of an individual investigator's subjective knowledge. * **Option D:** Incorrect. This describes "Validity." Validity (accuracy) is the extent to which a test measures what it is actually intended to measure. **High-Yield NEET-PG Pearls:** 1. **Reliability vs. Validity:** A test can be reliable but not valid (e.g., a weighing scale that consistently shows 5kg extra is reliable but inaccurate). However, for a test to be highly valid, it must be reliable. 2. **Evaluation:** Reliability is measured using the **Kappa Coefficient** (for qualitative data) or **Intraclass Correlation Coefficient** (for quantitative data). 3. **Factors affecting Reliability:** * **Observer Variation:** Intra-observer (same person) vs. Inter-observer (different people). * **Biological Variation:** Changes in the parameter being measured (e.g., BP fluctuations). * **Instrument Error:** Faulty equipment.
Explanation: ### Explanation **1. Why the Correct Answer (B) is Right:** Sensitivity is defined as the ability of a test to correctly identify those **with the disease** (True Positives). It is calculated using the formula: $$\text{Sensitivity} = \frac{\text{True Positives (TP)}}{\text{Total Diseased (TP + FN)}} \times 100$$ In this scenario: * **Total pregnant women (Diseased):** 100 * **Positive results (True Positives):** 99 * **Sensitivity:** $(99 / 100) \times 100 = 99\%$ The test correctly identified 99% of the women who were actually pregnant. **2. Why the Incorrect Options are Wrong:** * **Option A (90%):** This represents the **Specificity** of the test. Specificity is the ability to correctly identify those **without the disease** (True Negatives). Here, 90 out of 100 non-pregnant women tested negative ($90/100 = 90\%$). * **Option C (Average):** Sensitivity and Specificity are independent properties of a diagnostic test. Averaging them has no statistical significance in evaluating test performance. * **Option D:** The data provided is sufficient. We have the "Gold Standard" status (known pregnant vs. non-pregnant) and the test results, which are the only requirements to build a 2x2 contingency table. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **SNOUT:** **S**ensitivity rules **OUT** a disease when the result is negative (useful for screening). * **SPIN:** **S**pecificity rules **IN** a disease when the result is positive (useful for confirmation). * **Sensitivity** is also known as the **True Positive Rate**. * **False Negative Rate** = $100 - \text{Sensitivity}$. In this case, it is 1%. * **False Positive Rate** = $100 - \text{Specificity}$. In this case, it is 10%.
Collection and Presentation of Data
Practice Questions
Measures of Central Tendency
Practice Questions
Measures of Dispersion
Practice Questions
Normal Distribution
Practice Questions
Sampling Methods
Practice Questions
Sample Size Calculation
Practice Questions
Hypothesis Testing
Practice Questions
Tests of Significance
Practice Questions
Correlation and Regression
Practice Questions
Survival Analysis
Practice Questions
Multivariate Analysis
Practice Questions
Statistical Software in Research
Practice Questions
Get full access to all questions, explanations, and performance tracking.
Start For Free