What is mean deviation?
Which of the following is NOT a measure of central tendency?
When a scale is described as good, satisfactory, and poor, what type of measurement scale is being used?
What is the median of the following set of values: 2, 5, 7, 10, 10, 13, 25?
All of the following statements regarding frequency distributions are true, except:
Which of the following is NOT a characteristic of a case-control study?
What is the odds ratio?
Which type of variable best describes the number of family members?
Perinatal mortality is expressed in terms of?
All of the following are non-parametric tests except?
Explanation: **Explanation:** **Mean Deviation** is a statistical tool used to quantify the spread or variability of data points around a central value (usually the mean). In Biostatistics, it is defined as the arithmetic average of the absolute differences (ignoring plus or minus signs) between each observation and the mean. **Why Option A is correct:** In medical research, we need to know how much individual values (e.g., blood pressure readings in a population) vary from the average. **Measures of dispersion** describe this "scatter." Mean deviation, along with Range, Variance, and Standard Deviation, falls into this category because it measures the extent to which data points are dispersed around the center. **Why other options are incorrect:** * **B. A ratio:** A ratio expresses the relationship between two independent quantities (e.g., Maternal Mortality Ratio). Mean deviation is an absolute measure of spread, not a comparison of two distinct groups. * **C. A range:** Range is the simplest measure of dispersion, calculated as the difference between the maximum and minimum values. Mean deviation is more complex as it involves all observations in the dataset. * **D. An average:** While the *calculation* involves taking an average of deviations, the term "average" usually refers to measures of central tendency (Mean, Median, Mode). Mean deviation is a measure of *variability*, not the center. **High-Yield Clinical Pearls for NEET-PG:** * **Standard Deviation (SD):** The most commonly used measure of dispersion in medical literature. It is the square root of the variance. * **Coefficient of Variation (CV):** Used to compare the relative dispersion of two sets of data with different units (e.g., comparing height in cm vs. weight in kg). * **Normal Distribution:** In a normal curve, Mean ± 1 SD covers **68%** of values, Mean ± 2 SD covers **95%**, and Mean ± 3 SD covers **99.7%**.
Explanation: **Explanation:** In biostatistics, data is summarized using two primary types of descriptive statistics: **Measures of Central Tendency** and **Measures of Dispersion**. **Why "Range" is the correct answer:** The **Range** is a **Measure of Dispersion** (variability). It represents the simplest way to measure the spread of data by calculating the difference between the highest and lowest values in a distribution. It does not describe the "center" or "typical" value of a dataset, but rather how scattered the data points are. **Why the other options are incorrect:** * **Mean (Arithmetic Average):** The most common measure of central tendency. it is calculated by summing all observations and dividing by the total number. It is sensitive to extreme values (outliers). * **Median (Positional Average):** The middle-most value when data is arranged in ascending or descending order. It is the best measure of central tendency for skewed distributions as it is not affected by outliers. * **Mode (Nominal Average):** The value that occurs most frequently in a dataset. A distribution can be unimodal, bimodal, or multimodal. **NEET-PG High-Yield Pearls:** 1. **Relationship in Normal Distribution:** Mean = Median = Mode (Symmetrical Bell Curve). 2. **Skewed Distributions:** * **Positively Skewed:** Mean > Median > Mode (Tail to the right). * **Negatively Skewed:** Mode > Median > Mean (Tail to the left). 3. **Best Measure:** Median is preferred for skewed data (e.g., survival time, incubation periods); Mean is preferred for normally distributed data (e.g., height, BP). 4. **Other Measures of Dispersion:** Standard Deviation (most common), Variance, and Interquartile Range.
Explanation: ### **Explanation** **1. Why Ordinal is Correct:** The measurement scale used here is **Ordinal** because the data categories (Good, Satisfactory, Poor) follow a **natural order or rank**. In an ordinal scale, the relative position of the categories is known (e.g., Good is better than Satisfactory), but the exact numerical difference between the ranks is not defined. In clinical practice, this is commonly seen in grading the severity of a disease or the quality of a patient's recovery. **2. Why Other Options are Incorrect:** * **Nominal:** This scale is used for naming or labeling categories without any inherent order (e.g., Gender: Male/Female; Blood Groups: A, B, AB, O). Since "Good" is clearly superior to "Poor," it cannot be nominal. * **Interval:** This scale has a defined order and equal intervals between values, but **no true zero point** (e.g., Temperature in Celsius). We cannot say that the "distance" between Good and Satisfactory is mathematically equal to the distance between Satisfactory and Poor. * **Ratio:** This is the highest level of measurement. It has all the properties of an interval scale plus a **true absolute zero** (e.g., Height, Weight, Blood Pressure). Qualitative descriptors like "Good" cannot have a mathematical zero. --- ### **High-Yield Clinical Pearls for NEET-PG** * **Mnemonic for Scales (Lowest to Highest Complexity):** **NOIR** (**N**ominal < **O**rdinal < **I**nterval < **R**atio). * **Qualitative Data:** Includes Nominal and Ordinal scales. * **Quantitative Data:** Includes Interval and Ratio scales. * **Common Ordinal Examples in Exams:** * Cancer Staging (Stage I, II, III, IV) * Glasgow Coma Scale (GCS) score * Likert Scales (Strongly Agree to Strongly Disagree) * Pain Scales (Mild, Moderate, Severe) * **Statistical Note:** For Ordinal data, the **Median** is the most appropriate measure of central tendency.
Explanation: **Explanation:** The **Median** is a measure of central tendency that represents the middle-most value in a data set when the observations are arranged in ascending or descending order. **Why Option A is Correct:** To calculate the median, follow these steps: 1. **Arrange the data:** The set is already in ascending order: 2, 5, 7, 10, 10, 13, 25. 2. **Count the observations (n):** Here, $n = 7$ (an odd number). 3. **Apply the formula:** For an odd number of observations, the Median is the $(\frac{n+1}{2})^{th}$ value. * Calculation: $(\frac{7+1}{2}) = 4^{th}$ value. 4. **Identify the value:** The 4th value in the sequence is **10**. **Why Incorrect Options are Wrong:** * **Option B (13):** This is the 6th value in the set. It would only be the median if the data set were much larger or differently distributed. * **Option C (25):** This is the maximum value (range limit), not the central value. * **Option D (5):** This is the 2nd value. Selecting this suggests a calculation error or failing to count to the center of the set. **High-Yield Clinical Pearls for NEET-PG:** * **Robustness:** Unlike the Mean, the Median is **not affected by extreme values (outliers)**. In this set, even if 25 were replaced by 250, the median would remain 10. * **Skewed Data:** The Median is the preferred measure of central tendency for **skewed distributions** (e.g., incubation periods, survival time, or income). * **Even Datasets:** If $n$ is even, the median is the average of the two middle-most values. * **Relationship:** In a perfectly symmetrical (Normal) distribution, **Mean = Median = Mode**.
Explanation: **Explanation** The correct answer is **D** because the statement is factually incorrect. In biostatistics, the "mode" refers to the value that occurs most frequently. A **bimodal distribution** is characterized by having **two distinct peaks** (two modes) rather than one. While most biological data follows a unimodal (one peak) normal distribution, a bimodal curve suggests that the sample actually contains two different populations (e.g., a distribution of hemoglobin levels showing two peaks might indicate a healthy group and an anemic group). **Analysis of other options:** * **Option A:** Histograms are indeed the standard method for illustrating frequency distributions of **continuous quantitative data**. The area of each bar represents the frequency. * **Option B:** Bar charts are the preferred tool for **discrete or qualitative (categorical) data**. Unlike histograms, there are spaces between the bars to indicate that the data is not continuous. * **Option C:** Frequency polygons (a type of line chart) are excellent for comparing multiple distributions on the same axes, as overlapping histograms would become visually cluttered and unreadable. **High-Yield Clinical Pearls for NEET-PG:** * **Normal Distribution (Gaussian):** Mean = Median = Mode. It is bell-shaped and symmetrical. * **Skewed Distributions:** If the tail is to the right, it is **Positively Skewed** (Mean > Median > Mode). If the tail is to the left, it is **Negatively Skewed** (Mode > Median > Mean). * **Frequency Polygon:** Created by joining the midpoints of the tops of the bars of a histogram. * **Cumulative Frequency Curve (Ogive):** Used to directly determine the **Median** of a dataset.
Explanation: ### Explanation **Why the correct answer is right:** In a **Case-Control Study**, the investigator starts with the "effect" (disease) and looks backward to find the "cause" (exposure). Because the study begins with a fixed number of diseased individuals rather than a population at risk followed over time, there is no **denominator** to calculate the number of new cases occurring in a population. Therefore, **Incidence Rates cannot be computed directly**. Incidence can only be calculated in longitudinal studies like **Cohort Studies** or Randomized Controlled Trials (RCTs). **Analysis of incorrect options:** * **Option A:** This is the fundamental definition of a case-control study. It is a retrospective design where cases (diseased) are compared with controls (non-diseased) regarding past exposure. * **Option B:** Since the study relies on participants' memories or old records to determine exposure, it is highly susceptible to **Recall Bias**, making this a true characteristic. * **Option C:** Defining a "case" can be challenging. Researchers must establish strict diagnostic criteria (e.g., histological vs. clinical diagnosis) and decide whether to use incident (new) or prevalent (existing) cases to avoid selection bias. **High-Yield Clinical Pearls for NEET-PG:** * **Measure of Association:** The primary measure in a Case-Control study is the **Odds Ratio (OR)**, which is an estimation of Relative Risk. * **Direction:** It is a **Retrospective** study (Proceeds from Effect to Cause). * **Suitability:** It is the best design for studying **rare diseases** or diseases with long latency periods. * **Nesting:** A "Nested Case-Control Study" is one conducted within a large cohort study, which helps minimize selection and information bias.
Explanation: ### Explanation The **Odds Ratio (OR)** is a measure of association used primarily in **Case-Control studies** to quantify the relationship between an exposure and an outcome. It represents the ratio of the odds of exposure among the cases to the odds of exposure among the controls. To calculate the OR, we use a standard **2x2 Contingency Table**: | | Disease (+) (Cases) | Disease (-) (Controls) | | :--- | :---: | :---: | | **Exposed (+)** | **a** | **b** | | **Non-Exposed (-)** | **c** | **d** | 1. **Odds of exposure in cases:** $a / c$ 2. **Odds of exposure in controls:** $b / d$ 3. **Odds Ratio:** $(a/c) \div (b/d) = \mathbf{ad/bc}$ #### Analysis of Options: * **D (ad/bc): Correct.** This is the "cross-product ratio," derived by multiplying the diagonal cells (exposed cases × non-exposed controls) and dividing by the product of the other diagonal (exposed controls × non-exposed cases). * **A, B, and C:** These are incorrect mathematical arrangements of the 2x2 table cells that do not represent any standard epidemiological measure of association. #### NEET-PG High-Yield Pearls: * **Study Design:** OR is the hallmark of **Case-Control studies**. It is used because the incidence of disease cannot be calculated in these studies (as the researcher determines the number of cases). * **Interpretation:** * **OR > 1:** Positive association (Risk factor). * **OR = 1:** No association. * **OR < 1:** Negative association (Protective factor). * **Rare Disease Assumption:** When a disease is rare (incidence < 5%), the Odds Ratio becomes a good estimate of the **Relative Risk (RR)**. * **Cross-sectional studies:** OR can also be used here, but it is then termed the "Prevalence Odds Ratio."
Explanation: **Explanation:** In Biostatistics, variables are classified based on the nature of the data they represent. The **number of family members** is a classic example of a **Discrete Variable**. **1. Why Discrete Variable is correct:** A discrete variable is a type of quantitative (numerical) variable that can only take on specific, whole-number values. These values are obtained by **counting**. Since you cannot have 4.5 or 5.2 family members—only 4, 5, or 6—the data exists in distinct, separate units with no possible values in between. **2. Why other options are incorrect:** * **Continuous Variable:** These are numerical variables that can take any value within a range, including decimals and fractions. They are typically obtained by **measuring**. Examples include height (165.5 cm), weight (70.2 kg), or blood pressure. * **Qualitative / Categorical Variable:** These describe a quality or attribute rather than a numerical quantity. They are expressed in words or categories. Examples include gender (Male/Female), blood group (A, B, AB, O), or socioeconomic status. While "number of family members" can be grouped into categories (e.g., Small vs. Large), the number itself is inherently quantitative. **Clinical Pearls for NEET-PG:** * **Memory Aid:** **D**iscrete = **D**isconnected (whole numbers); **C**ontinuous = **C**onnected (decimals possible). * **Scales of Measurement:** Discrete and Continuous variables fall under **Interval** or **Ratio** scales. * **High-Yield Example:** Number of hospital beds or number of cases of a disease are **Discrete**; Hemoglobin levels or Serum Creatinine are **Continuous**.
Explanation: **Explanation:** The **Perinatal Mortality Rate (PMR)** is a key indicator of the quality of antenatal, natal, and postnatal care. It encompasses late fetal deaths (stillbirths) and early neonatal deaths. **1. Why Option B is Correct:** According to the **World Health Organization (WHO)** and the **National Health Mission (NHM)** in India, the Perinatal Mortality Rate is calculated as the number of perinatal deaths (late fetal deaths after 28 weeks of gestation + first-week neonatal deaths) per **1,000 live births**. While some older definitions used "total births" (live births + stillbirths) as the denominator, the standardized reporting for national health statistics in India (SRS) and many international bodies uses **1,000 live births** to ensure consistency with other mortality indicators like IMR and NMR. **2. Why Other Options are Incorrect:** * **Option A:** While "total births" is used in the *theoretical* definition of PMR to include stillbirths in the denominator, most standardized competitive exams and the SRS (Sample Registration System) in India prioritize **live births** as the denominator for ease of calculation and comparison. * **Options C & D:** These are mathematically incorrect. Mortality rates in maternal and child health are typically expressed per 1,000 (IMR, NMR, PMR) or per 100,000 (MMR). A denominator of 10,000 is not standard for these indicators. **High-Yield Clinical Pearls for NEET-PG:** * **Components of PMR:** Late Fetal Deaths (Stillbirths >28 weeks) + Early Neonatal Deaths (0-7 days of life). * **Best Indicator:** PMR is considered the best indicator of **obstetric care** and maternal health status. * **Denominator Rule:** * **IMR, NMR, PMR:** Per 1,000 live births. * **MMR (Maternal Mortality Ratio):** Per 100,000 live births. * **Stillbirth Definition:** Fetal death occurring after 28 weeks of gestation (weight >1000g).
Explanation: **Explanation:** The core of this question lies in distinguishing between **Parametric** and **Non-parametric** statistical tests. **1. Why Student’s t-test is the correct answer:** The **Student’s t-test** is a **Parametric test**. Parametric tests are used when the data follows a **Normal (Gaussian) Distribution** and the variables are measured on an interval or ratio scale (quantitative data). The t-test specifically compares the means of two groups (e.g., comparing the mean hemoglobin levels between two groups of pregnant women). **2. Why the other options are incorrect (Non-parametric tests):** Non-parametric tests (Distribution-free tests) are used when the data is non-normal, skewed, or qualitative (nominal/ordinal). * **Chi-square test (Option A):** Used to compare proportions and test the association between two categorical variables (e.g., smoking status and lung cancer). * **Sign test (Option B):** A non-parametric alternative to the paired t-test, used to compare paired observations based on the direction of the difference. * **Fisher’s exact test (Option C):** Used instead of the Chi-square test for categorical data when the sample size is very small (expected frequency in any cell is <5). **High-Yield Clinical Pearls for NEET-PG:** * **Memory Aid:** If the test name contains "Mean" or "Standard Deviation," it is likely Parametric (e.g., Z-test, t-test, ANOVA). * **ANOVA (F-test):** Used to compare means of **three or more** groups. * **Wilcoxon Rank Sum / Mann-Whitney U test:** The non-parametric equivalent of the unpaired t-test. * **Kruskal-Wallis test:** The non-parametric equivalent of ANOVA. * **Correlation:** Pearson’s (Parametric) vs. Spearman’s (Non-parametric).
Collection and Presentation of Data
Practice Questions
Measures of Central Tendency
Practice Questions
Measures of Dispersion
Practice Questions
Normal Distribution
Practice Questions
Sampling Methods
Practice Questions
Sample Size Calculation
Practice Questions
Hypothesis Testing
Practice Questions
Tests of Significance
Practice Questions
Correlation and Regression
Practice Questions
Survival Analysis
Practice Questions
Multivariate Analysis
Practice Questions
Statistical Software in Research
Practice Questions
Get full access to all questions, explanations, and performance tracking.
Start For Free