All of the following are tests for the degree of closeness of a measured or calculated quantity to its actual or true value, except:
In what interval is the sample registration system conducted?
What is true about a confounding factor?
What value of the correlation coefficient signifies a strong correlation?
The response which is graded by an observer on an agree or disagree continuum is based on which type of scale?
Which statistical test is used to determine the association between two variables?
For the given distribution of weights of a group of students, which measure of central tendency is most appropriate?

A study provides the following test results for a disease: Present (40 +ve, 10 -ve), Absent (225 +ve, 225 -ve). What is the sensitivity of this study?
The Sample Registration System (SRS) is a:
Trends can be represented by?
Explanation: In biostatistics and quality control, it is crucial to distinguish between **Accuracy** and **Precision**. **1. Why "Range Chart" is the correct answer:** The question asks for tests that measure the closeness to the "true value," which is the definition of **Accuracy**. A **Range Chart (R-chart)** measures the difference between the highest and lowest values in a sample. It is a measure of **dispersion or variability**, which relates to **Precision** (the consistency of results), not accuracy. It does not indicate how close the results are to the actual target value. **2. Explanation of incorrect options:** * **Mean Chart ($\bar{X}$ chart):** This tracks the average of samples over time. Since the mean is compared against the target/true value, it is a primary tool for monitoring **Accuracy**. * **Levey-Jennings (LJ) Chart:** Widely used in clinical laboratories, this chart plots control data against established mean and standard deviation limits. It is used to detect both systematic errors (loss of accuracy) and random errors. * **Shewhart Control Chart:** This is the umbrella term for quality control charts (including Mean and Range charts). In medical statistics, they are used to ensure a process remains within "statistical control" relative to its true intended value. **Clinical Pearls for NEET-PG:** * **Accuracy:** Closeness to the "True Value." (Measured by Mean, LJ charts). * **Precision:** Closeness of repeated measurements to "Each Other." (Measured by Range, Standard Deviation, Coefficient of Variation). * **Systematic Error:** Affects Accuracy (Shift in Mean). * **Random Error:** Affects Precision (Increased Range/SD). * **Westgard Rules:** These are the specific criteria used to interpret LJ charts to decide if an analytical run is in control.
Explanation: The **Sample Registration System (SRS)** is a large-scale demographic survey in India that provides reliable annual estimates of birth rates, death rates, and other fertility/mortality indicators. ### **Explanation of the Correct Answer** The SRS utilizes a **Dual Record System** to ensure data accuracy. This involves: 1. **Continuous Enumeration:** A local resident enumerator (usually a teacher or Anganwadi worker) records births and deaths as they occur. 2. **Retrospective Survey:** Every **6 months**, a full-time supervisor conducts an independent retrospective survey of the households. The data from both sources are matched, and any discrepancies are field-verified. This **half-yearly (6-month) interval** is the hallmark of the SRS, designed to minimize recall bias while maintaining a continuous flow of vital statistics between decennial censuses. ### **Why Other Options are Incorrect** * **B (1 year):** While the SRS publishes *annual* reports, the actual field verification and data collection cycle occur every six months. * **C (2 years):** This interval is not used in any major Indian vital statistic system; it would lead to significant recall bias. * **D (5 years):** This is the interval for the **National Family Health Survey (NFHS)**, which provides more detailed health and nutrition data but is not a continuous registration system like the SRS. ### **High-Yield NEET-PG Pearls** * **Gold Standard:** SRS is considered the most reliable source of **Infant Mortality Rate (IMR)** and **Maternal Mortality Ratio (MMR)** in India. * **Initiation:** It was started on a pilot basis in 1964-65 and became fully operational in **1969-70**. * **Authority:** It is conducted by the **Office of the Registrar General of India (RGI)**, Ministry of Home Affairs. * **Civil Registration System (CRS):** Unlike SRS (which is sample-based), CRS is the legal, continuous recording of all births and deaths, but it suffers from under-reporting in many states.
Explanation: ### Explanation A **confounding factor** is an "extraneous" variable that distorts the true relationship between an exposure and an outcome. To be a confounder, a variable must meet three criteria: it must be associated with the exposure, be a risk factor for the disease independently, and not be an intermediate step in the causal pathway. **Why Option B is Correct:** A confounder must be an **independent risk factor** for the disease. For example, in a study looking at the link between coffee consumption (exposure) and pancreatic cancer (outcome), smoking is a confounder because smoking itself is a known risk factor for pancreatic cancer, regardless of coffee intake. **Analysis of Incorrect Options:** * **Option A:** If a factor is distributed equally between the study and control groups, its effect is neutralized, and it ceases to act as a confounder. Confounding occurs specifically because the factor is *unequally* distributed. * **Option C:** Selecting a small group does not eliminate confounding; in fact, small sample sizes increase the risk of "random confounding." Confounding is eliminated through design (Randomization, Restriction, Matching) or analysis (Stratification, Multivariate analysis). * **Option D:** A confounder must be associated with **both** the exposure and the disease, not just one of them. If it is only associated with the exposure but doesn't cause the disease, it cannot distort the outcome. **High-Yield Clinical Pearls for NEET-PG:** * **Randomization** is the best method to control for both known and unknown confounders. * **Matching** is commonly used in Case-Control studies to eliminate confounding. * **Simpson’s Paradox:** A phenomenon where a trend appears in different groups of data but disappears or reverses when these groups are combined, often due to a confounding variable. * **Distinction:** Unlike a confounder, an **Effect Modifier** (Interaction) is not a nuisance but a biological phenomenon that should be described, not eliminated.
Explanation: **Explanation:** The **Correlation Coefficient (Pearson’s ‘r’)** is a statistical measure used to quantify the strength and direction of a linear relationship between two continuous variables (e.g., the relationship between BMI and Blood Pressure). **1. Why Option B is Correct:** The value of 'r' ranges strictly from **-1 to +1**. * A value of **1** (or -1) signifies a **perfect/strongest possible correlation**, where all data points lie exactly on a straight line. * As the value approaches 1, the strength of the relationship increases. In medical research, values >0.7 are generally considered "strong." **2. Why Other Options are Incorrect:** * **Option A (Zero):** A correlation coefficient of 0 indicates **no linear relationship** between the variables. * **Option C (Less than 1):** While values like 0.8 or 0.9 are strong, they are mathematically "weaker" than 1. Additionally, this range includes values near zero (e.g., 0.1), which represent very weak correlations. * **Option D (More than 1):** This is statistically **impossible**. The correlation coefficient cannot exceed +1 or be less than -1. **Clinical Pearls & High-Yield Facts for NEET-PG:** * **Directionality:** A positive sign (+) means both variables move in the same direction; a negative sign (-) means they move in opposite directions (e.g., Exercise vs. Resting Heart Rate). * **Coefficient of Determination ($r^2$):** This represents the proportion of variance in one variable explained by the other. If $r = 0.7$, then $r^2 = 0.49$ (49% of the variance is explained). * **Scatter Diagram:** This is the visual method used to represent correlation. A straight line at 45° indicates $r = 1$. * **Limitation:** Correlation does **not** imply causation.
Explanation: ### Explanation **1. Why Likert Scale is Correct:** The **Likert Scale** is a psychometric scale commonly used in health research to measure attitudes, beliefs, or opinions. It typically presents a statement and asks the respondent to choose from a continuum of fixed responses, most commonly a 5-point or 7-point scale (e.g., *Strongly Disagree, Disagree, Neutral, Agree, Strongly Agree*). It is the gold standard for assessing responses on an "agree-disagree" continuum. **2. Why Other Options are Incorrect:** * **Visual Analog Scale (VAS):** This is a continuous scale (usually a 10 cm line) where the patient marks a point representing their state (e.g., pain intensity from "no pain" to "worst pain"). It does not use discrete "agree/disagree" categories. * **Guttman Scale (Cumulative Scale):** This scale consists of a series of statements arranged in a hierarchical order. If a respondent agrees with a higher-intensity statement, it is assumed they agree with all lower-intensity statements preceding it. * **Adjective Scale:** This uses a list of adjectives (e.g., "happy," "tired," "anxious") to describe a state, rather than a continuum of agreement with a specific statement. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Types of Data:** Likert scales produce **Ordinal Data** (data with a natural order but where the distance between intervals is not necessarily equal). * **Central Tendency:** For Likert scales, the **Median** or **Mode** is the most appropriate measure of central tendency, not the Mean. * **Qualitative vs. Quantitative:** While the responses are qualitative, they are often assigned numerical values (1-5) for statistical coding. * **Memory Aid:** Remember **L**ikert = **L**evels of agreement.
Explanation: ### Explanation **1. Why Chi-square test is correct:** The **Chi-square ($\chi^2$) test** is a non-parametric test used to determine if there is a significant **association** between two **categorical (qualitative)** variables. In medical research, it is frequently used to compare proportions. For example, determining if the incidence of a disease (Yes/No) is associated with a risk factor like smoking (Yes/No). It tests the "null hypothesis" that there is no relationship between the variables. **2. Why other options are incorrect:** * **Correlation (Option B):** While correlation measures the strength and direction of a relationship between two variables, it is specifically used for **quantitative (numerical)** data (e.g., height and weight). It does not determine "association" in the categorical sense used in contingency tables. * **Regression (Option C):** Regression is used to **predict** the value of a dependent variable based on the value of an independent variable. It quantifies the functional relationship rather than just testing for the presence of an association. * **None of the above (Option D):** Incorrect, as Chi-square is the standard test for association between categorical variables. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Type of Data:** Chi-square = Qualitative/Categorical data; T-test/ANOVA = Quantitative data. * **Yates’ Correction:** Applied to a $2 \times 2$ Chi-square table when any cell frequency is less than 5. * **McNemar’s Test:** A variation of the Chi-square test used for **paired** data (e.g., comparing results in the same patient before and after treatment). * **Null Hypothesis ($H_0$):** Chi-square assumes $H_0$ is true (no association). If $p < 0.05$, we reject $H_0$ and conclude a significant association exists.
Explanation: ***Mean*** - For a **normal/symmetric distribution** like student weights, the mean provides the most **accurate representation** of central tendency as it uses all data points. - In a **bell-shaped distribution**, mean, median, and mode are approximately equal, making mean the **preferred choice** due to its mathematical properties. *Median* - While median is **robust to outliers**, it's not the most appropriate for **symmetric distributions** where extreme values are naturally distributed. - Median is preferred for **skewed distributions** or when data contains significant outliers that could distort the mean. *Mode* - Mode identifies the **most frequently occurring value** but may not represent the central tendency well in continuous data like weights. - Most useful for **categorical data** or discrete variables, less meaningful for **continuous measurements** like body weight. *All of the above* - Though all measures can be calculated, the **mean is specifically most appropriate** for normally distributed continuous data. - Different measures serve different purposes: mean for **normal distributions**, median for **skewed data**, and mode for **categorical variables**.
Explanation: ### Explanation **1. Why the correct answer is right:** Sensitivity is the ability of a test to correctly identify those with the disease (True Positives). To calculate sensitivity, we use the formula: **Sensitivity = [True Positives (TP) / (True Positives + False Negatives (FN))] × 100** From the data provided: * **Disease Present:** 40 tested positive (TP), 10 tested negative (FN). Total diseased = 50. * **Disease Absent:** 225 tested positive (FP), 225 tested negative (TN). Total healthy = 450. Applying the formula: Sensitivity = [40 / (40 + 10)] × 100 Sensitivity = [40 / 50] × 100 = **80%**. **2. Why the incorrect options are wrong:** * **Option A (40):** This represents the absolute number of True Positives, not the percentage (sensitivity). * **Option B (20):** This is the False Negative Rate (10/50 × 100 = 20%). Sensitivity and False Negative Rate are complementary (Sensitivity + FNR = 100%). * **Option D (50):** This represents the Specificity of the test [TN / (TN + FP)], calculated as [225 / (225 + 225)] × 100 = 50%. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Sensitivity (True Positive Rate):** Crucial for **screening tests** (e.g., ELISA for HIV) because it minimizes False Negatives. A highly sensitive test, when negative, helps **Rule Out** the disease (**SNOUT**). * **Specificity (True Negative Rate):** Crucial for **confirmatory tests** (e.g., Western Blot) because it minimizes False Positives. A highly specific test, when positive, helps **Rule In** the disease (**SPIN**). * **Predictive Values:** Unlike sensitivity/specificity, Positive and Negative Predictive Values are heavily dependent on the **prevalence** of the disease in the population.
Explanation: **Explanation:** The **Sample Registration System (SRS)** is a large-scale demographic survey in India used to provide reliable annual estimates of birth rate, death rate, and other fertility/mortality indicators. **Why Option A is Correct:** The SRS is fundamentally a **Dual Record System**. It employs two independent methods of data collection to ensure accuracy and minimize under-reporting: 1. **Continuous Enumeration:** A resident part-time enumerator (usually a teacher or Anganwadi worker) records births and deaths as they occur in a specific sample unit. 2. **Retrospective Half-yearly Survey:** An independent supervisor visits the same sample unit every six months to conduct a fresh survey. The data from both sources are then **matched**, and any discrepancies are field-verified to arrive at a final unduplicated count. **Why Other Options are Incorrect:** * **Option B:** While SRS provides annual estimates, it is not a simple "survey conducted every year." It is a continuous longitudinal process combined with biannual surveys. * **Option C:** SRS uses **Stratified Multi-stage Random Sampling**, not quota sampling. Quota sampling is a non-probability sampling method, which would not yield representative national data. * **Option D:** The sample size for SRS is primarily determined based on the **Infant Mortality Rate (IMR)** and crude birth rate, not the Maternal Mortality Ratio (MMR). **High-Yield Pearls for NEET-PG:** * **Gold Standard:** SRS is considered the most reliable source of vital statistics in India (more accurate than the Civil Registration System). * **Nodal Agency:** It is conducted by the **Office of the Registrar General of India (RGI)**. * **Initiation:** It was started on a pilot basis in 1964-65 and became fully operational in 1969-70. * **Key Data:** It is the primary source for calculating **IMR, MMR, and Total Fertility Rate (TFR)** in India.
Explanation: **Explanation:** In biostatistics, the choice of a graphical representation depends on the type of data and the objective of the study. **Why Line Diagram is Correct:** A **Line Diagram** (or line graph) is the most appropriate tool for representing **trends over time** (time-series data). It connects individual data points with lines, allowing for the visualization of fluctuations, increases, or decreases in a variable (e.g., maternal mortality rate or disease incidence) over a continuous period. **Analysis of Incorrect Options:** * **Bar Chart:** Used for representing **discrete, qualitative, or nominal data** (e.g., number of hospital beds in different cities). It compares categories rather than showing a continuous trend. * **Histogram:** Used for **continuous quantitative data** to show frequency distribution. Unlike a bar chart, there are no gaps between the bars. It represents the distribution of a single variable (e.g., age distribution of a population), not a trend over time. * **Pie Chart:** Used to show the **proportional distribution** of different components of a whole at a single point in time (e.g., causes of death in a specific year). It does not show changes over time. **High-Yield Clinical Pearls for NEET-PG:** * **Trend Analysis:** Always look for "Line Diagram" or "Run Chart" when the question mentions time, years, or trends. * **Frequency Polygon:** Created by joining the midpoints of the tops of the bars in a histogram; also used for frequency distributions. * **Scatter Diagram:** Used to show the **correlation** (relationship) between two continuous variables. * **Ogive:** A graph representing cumulative frequency.
Collection and Presentation of Data
Practice Questions
Measures of Central Tendency
Practice Questions
Measures of Dispersion
Practice Questions
Normal Distribution
Practice Questions
Sampling Methods
Practice Questions
Sample Size Calculation
Practice Questions
Hypothesis Testing
Practice Questions
Tests of Significance
Practice Questions
Correlation and Regression
Practice Questions
Survival Analysis
Practice Questions
Multivariate Analysis
Practice Questions
Statistical Software in Research
Practice Questions
Get full access to all questions, explanations, and performance tracking.
Start For Free