Biostatistics Practice Questions

Q: What does it indicate when the highest mean and lowest mode are observed?

Positive skewing. ### Explanation In biostatistics, the relationship between the measures of central tendency (Mean, Median, and Mode) determines the shape of the distribution curve. **1. Why Positive Skewing is Correct:** In a **Positively Skewed Distribution** (also known as Right-skewed), the tail of the distribution extends toward the higher values on the right side. * The **Mode** represents the peak of the curve (the most frequent value) and remains at the lowest numerical value. * The **Mean** is highly sensitive to extreme values (outliers) in the tail, which pulls it toward the right. * Therefore, the relationship is: **Mean > Median > Mode**. Since the Mean is the highest and the Mode is the lowest, this perfectly describes positive skewing. **2. Why the Other Options are Incorrect:** * **Negative Skewing (Left-skewed):** The tail extends toward the lower values. Here, the Mean is pulled down by outliers, resulting in **Mode > Median > Mean**. The Mean is the lowest value, not the highest. * **Normal/Symmetrical Distribution:** In a perfectly symmetrical bell-shaped curve, the Mean, Median, and Mode are all equal (**Mean = Median = Mode**). There is no "highest" or "lowest" value among them. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Memory Aid:** In **P**ositive skew, the Mean is more **P**ositive (greater). In **N**egative skew, the Mean is more **N**egative (smaller). * **Median:** Always stays in the middle in both types of skewed distributions. It is the best measure of central tendency for skewed data. * **Sensitivity:** The Mean is the most affected by outliers; the Mode is the least affected. * **Example:** Income distribution in a population or incubation periods of most infectious diseases usually follow a positive skew.

Q: What is the best method to show the trend of events with the passage of time?

Line diagram. **Explanation:** **Why Line Diagram is Correct:** A **Line diagram** (or line graph) is the most effective method for representing **time-series data**. It is specifically designed to show trends, fluctuations, or changes in a variable over a continuous period (e.g., years, months, or weeks). By connecting discrete data points with a line, it allows for easy visualization of whether a trend is increasing, decreasing, or remaining stable. In epidemiology, it is frequently used to plot the incidence of diseases over several years. **Analysis of Incorrect Options:** * **Bar Diagram:** Used for comparing discrete, qualitative categories (e.g., number of hospital beds in different cities). It represents data in bars of equal width with gaps in between; it is not ideal for showing continuous temporal trends. * **Histogram:** Used to represent the frequency distribution of **continuous quantitative data** (e.g., age groups, height). Unlike bar charts, there are no gaps between bars. While it shows distribution, it does not track a single variable's trend over time. * **Pie Chart:** Used to show the **proportional distribution** of a whole at a single point in time (e.g., causes of maternal mortality). It cannot depict changes over time. **High-Yield Clinical Pearls for NEET-PG:** * **Frequency Polygon:** Created by joining the midpoints of the tops of a histogram; it is used to compare two or more frequency distributions. * **Scatter Diagram:** Used to show the **correlation** (relationship) between two quantitative variables. * **Ogive:** A graph representing cumulative frequency. * **Pictogram:** The best method for conveying information to a non-literate or general population.

Q: What is the definition of statistical power?

The ability to reject a false null hypothesis.. **Statistical Power** is a fundamental concept in clinical research that measures a study’s ability to detect an effect (or difference) when one truly exists. ### Why Option B is Correct Statistical Power (represented as **1 – β**) is the probability that a test will correctly **reject a false null hypothesis**. In clinical terms, if a new drug is truly effective (the null hypothesis is false), power is the likelihood that the study will yield a statistically significant result confirming that effectiveness. A power of 0.80 (80%) is generally considered the minimum acceptable level for clinical trials. ### Explanation of Incorrect Options * **Option A:** Rejecting a *true* null hypothesis is a **Type I Error (α)**. This is a "false positive," where you claim a difference exists when it actually does not. * **Option C:** Correctly accepting (or failing to reject) a *true* null hypothesis is known as **Confidence Level (1 – α)**. * **Option D:** Accepting (failing to reject) a *false* null hypothesis is a **Type II Error (β)**. This is a "false negative," where the study fails to detect a real effect. ### NEET-PG High-Yield Pearls * **The Relationship:** Power is inversely related to Type II error (**Power = 1 – β**). As the risk of a false negative decreases, the power of the study increases. * **Factors Increasing Power:** 1. **Increased Sample Size (N):** The most common way to boost power. 2. **Increased Effect Size:** Larger differences are easier to detect. 3. **Decreased Standard Deviation:** Less "noise" in the data makes the "signal" clearer. 4. **Increased Alpha (α):** Though rarely done, increasing the significance level (e.g., from 0.05 to 0.10) increases power.

Q: If the coefficient of correlation between height and weight is 2.6, what is true?

The calculation of the coefficient is wrong. ### Explanation **1. Why the Correct Answer is Right:** The **Pearson Correlation Coefficient (r)** is a statistical measure that quantifies the strength and direction of a linear relationship between two continuous variables. A fundamental mathematical property of the correlation coefficient is that its value **must always lie between -1 and +1** ($-1 \leq r \leq +1$). * **r = +1:** Perfect positive correlation. * **r = -1:** Perfect negative correlation. * **r = 0:** No linear correlation. In this question, the value provided is **2.6**, which exceeds the maximum possible limit of +1. Therefore, such a value is mathematically impossible, indicating that the calculation is incorrect. **2. Why the Other Options are Wrong:** * **Option A (Positive correlation):** While a positive number usually indicates a positive correlation, the value must be $\leq 1$. Since 2.6 is invalid, we cannot conclude the nature of the correlation. * **Option B (No association):** No association is represented by $r = 0$. * **Option C (Negative correlation):** A negative correlation is represented by a value between 0 and -1. **3. Clinical Pearls & High-Yield Facts for NEET-PG:** * **Coefficient of Determination ($r^2$):** This is the square of the correlation coefficient. it represents the proportion of variance in one variable that is predictable from the other. (e.g., if $r = 0.6$, then $r^2 = 0.36$ or 36%). * **Independence of Units:** The value of 'r' is independent of the units of measurement (e.g., whether height is in cm or inches, 'r' remains the same). * **Scatter Diagram:** The best visual method to represent correlation. * **Regression vs. Correlation:** Correlation measures the *strength* of association, while Regression measures the *nature* of the relationship to predict the value of a dependent variable.

Q: What is the definition of standard deviation?

The dispersion of values about the mean.. **Explanation:** **Standard Deviation (SD)** is the most commonly used measure of **dispersion** in biostatistics. It quantifies how much the individual observations in a data set spread out or "deviate" from the arithmetic mean. A low SD indicates that the data points are clustered closely around the mean, while a high SD indicates that the data are spread over a wider range. In a Normal (Gaussian) distribution, SD is used to define confidence intervals (e.g., Mean ± 1 SD covers 68% of values). **Analysis of Options:** * **Option A (Middle observation):** This defines the **Median**. It is a measure of central tendency used primarily for skewed data. * **Option B (Arithmetic mean):** This is the **Average** of all observations. It is a measure of central tendency, not dispersion. * **Option D (Most frequent value):** This defines the **Mode**. It is the only measure of central tendency that can be used for nominal (qualitative) data. **NEET-PG High-Yield Pearls:** 1. **Variance:** It is the square of the Standard Deviation ($SD^2$). 2. **Standard Error (SE):** It measures the dispersion of *sample means* around the *population mean* ($SE = SD / \sqrt{n}$). 3. **Coefficient of Variation:** Used to compare variability between two different units (e.g., height in cm vs. weight in kg). Formula: $(SD / Mean) \times 100$. 4. **Normal Distribution:** * Mean ± 1 SD = 68.3% * Mean ± 2 SD = 95.4% * Mean ± 3 SD = 99.7%

Q: What is the standard error of the mean called?

Standard deviation. ### Explanation The **Standard Error of the Mean (SEM)** is a measure of the dispersion of sample means around the true population mean. In biostatistics, the SEM is mathematically defined as the **Standard Deviation** of the sampling distribution of the mean. It represents how much the sample mean is likely to fluctuate from the actual population mean. The formula is: **SEM = σ / √n** *(where σ = Standard Deviation of the population and n = sample size)*. Essentially, SEM is the "Standard Deviation" of the error when estimating a population mean from a sample. #### Why other options are incorrect: * **B. Mode:** This is a measure of central tendency representing the most frequently occurring value in a data set. It does not describe variability or sampling error. * **C. Median:** This is the middle-most value of a distribution when arranged in order. Like the mode, it is a measure of central tendency, not dispersion. * **D. Variable:** This is a general term for any characteristic, number, or quantity that can be measured or counted (e.g., age, blood pressure). It is not a statistical measure of error. #### High-Yield NEET-PG Pearls: * **SD vs. SEM:** Standard Deviation (SD) describes the variability **within a single sample**, while Standard Error (SEM) describes the **reliability of the sample mean** compared to the population. * **Relationship with 'n':** As the sample size ($n$) increases, the SEM decreases (the estimate becomes more precise). * **Confidence Intervals:** SEM is used to calculate Confidence Intervals (CI). For a 95% CI, the range is approximately **Mean ± 2 SEM**.

Q: The parameters of sensitivity and specificity are used for assessing which type of validity?

Criterion validity. ### Explanation **1. Why Criterion Validity is Correct:** Criterion validity refers to the extent to which a new test (the index test) correlates with a "Gold Standard" (the criterion). In clinical medicine, sensitivity and specificity are the primary metrics used to measure this relationship. * **Sensitivity** measures the test's ability to correctly identify those with the disease (compared to the gold standard). * **Specificity** measures the test's ability to correctly identify those without the disease. Since these parameters evaluate the performance of a screening or diagnostic tool against a definitive reference, they are the hallmarks of **Criterion Validity**. **2. Why Other Options are Incorrect:** * **Construct Validity:** This assesses how well a test measures a theoretical concept or trait (e.g., intelligence, depression, or pain). It is used when no single gold standard exists. * **Discriminant Validity:** A subtype of construct validity, it ensures that a test does *not* correlate with variables it is theoretically supposed to be different from. * **Content Validity:** This evaluates whether the test covers the entire range of the subject matter it is intended to measure (e.g., does a final exam cover all chapters of the syllabus?). It is usually judged by a panel of experts rather than statistical formulas. **3. High-Yield Clinical Pearls for NEET-PG:** * **Sensitivity (True Positive Rate):** Essential for **Screening** tests to rule out disease (SNOUT). * **Specificity (True Negative Rate):** Essential for **Confirmatory** tests to rule in disease (SPIN). * **Predictive Values:** Unlike sensitivity/specificity, Positive and Negative Predictive Values are highly dependent on the **prevalence** of the disease in the population. * **Likelihood Ratio:** Considered the best way to measure diagnostic accuracy as it is independent of prevalence.

Q: What is the main purpose of randomization in research?

To eliminate selection bias. **Explanation:** **Randomization** is the "heart" of a Randomized Controlled Trial (RCT). Its primary purpose is to ensure that every participant has an equal chance of being assigned to any study group (intervention or control). 1. **Why Option A is Correct:** Randomization primarily **eliminates selection bias**. By using a random sequence (like computer-generated numbers) rather than the investigator's choice, it prevents the researcher from consciously or unconsciously picking specific patients for specific groups. This ensures that the study groups are comparable at the start of the trial. 2. **Why Other Options are Incorrect:** * **Option B:** Randomization does not "remove" confounding factors; it **distributes** them equally between groups. While it is the best method to balance both known and unknown confounders, the term "eliminate selection bias" is the more precise definition of its *direct* purpose. * **Option C:** Randomization ensures the **validity** of the study, not necessarily "good" (positive) results. A well-randomized study may still show that a drug is ineffective. * **Option D:** Analysis bias is prevented by "Blinding" and "Intention-to-treat analysis," not by the initial act of randomization. **High-Yield Pearls for NEET-PG:** * **Randomization vs. Random Sampling:** Randomization ensures **internal validity** (comparability of groups), whereas random sampling ensures **external validity** (generalizability). * **Confounding:** Randomization is the only method that can control for **unknown confounders**. * **Blinding:** While randomization eliminates selection bias, **blinding** is used to eliminate **ascertainment (observer) bias**. * **Allocation Concealment:** This is the process used to *implement* randomization (e.g., opaque envelopes) to prevent the researcher from knowing the next assignment.

Question 1

What does it indicate when the highest mean and lowest mode are observed?

Accepted Answer

Positive skewing

Answer

Negative skewing

Answer

Normal distribution

Answer

Symmetrical distribution

Question 2

What is the best method to show the trend of events with the passage of time?

Accepted Answer

Line diagram

Answer

Bar diagram

Answer

Histogram

Answer

Pie chart

Question 3

What is the definition of statistical power?

Accepted Answer

The ability to reject a false null hypothesis.

Answer

The ability to reject a true null hypothesis.

Answer

The ability to accept a true null hypothesis.

Answer

The ability to accept a false null hypothesis.

Question 4

If the coefficient of correlation between height and weight is 2.6, what is true?

Accepted Answer

The calculation of the coefficient is wrong

Answer

Positive correlation

Answer

No association

Answer

Negative correlation

Question 5

Lead time is defined as:

Accepted Answer

Time between the point where the diagnosis is made by a screening test to the point of usual diagnosis

Answer

Time between diagnosis and treatment

Answer

Time between disease onset and outcome

Answer

Time between the usual time of diagnosis and outcome

Question 6

What is the definition of standard deviation?

Accepted Answer

The dispersion of values about the mean.

Answer

The value of the middle observation when data is arranged in ascending order.

Answer

The arithmetic mean.

Answer

The most frequently occurring value.

Question 7

What is the standard error of the mean called?

Accepted Answer

Standard deviation

Answer

Mode

Answer

Median

Answer

Variable

Question 8

The parameters of sensitivity and specificity are used for assessing which type of validity?

Accepted Answer

Criterion validity

Answer

Construct validity

Answer

Discriminant validity

Answer

Content validity

Question 9

Design Effect is associated with which of the following sampling techniques?

Accepted Answer

Systemic sampling

Answer

Stratified sampling

Answer

Cluster sampling

Answer

Simple Random sampling

Question 10

What is the main purpose of randomization in research?

Accepted Answer

To eliminate selection bias

Answer

To remove confounding factors

Answer

To ensure good results

Answer

To eliminate analysis bias

Biostatistics — MCQs

Biostatistics — MCQs

On this page

Practice by Chapter

Want unlimited practice?