P-values and confidence intervals US Medical PG Practice Questions and MCQs
Practice US Medical PG questions for P-values and confidence intervals. These multiple choice questions (MCQs) cover important concepts and help you prepare for your exams.
P-values and confidence intervals US Medical PG Question 1: A scientist in Chicago is studying a new blood test to detect Ab to EBV with increased sensitivity and specificity. So far, her best attempt at creating such an exam reached 82% sensitivity and 88% specificity. She is hoping to increase these numbers by at least 2 percent for each value. After several years of work, she believes that she has actually managed to reach a sensitivity and specificity much greater than what she had originally hoped for. She travels to China to begin testing her newest blood test. She finds 2,000 patients who are willing to participate in her study. Of the 2,000 patients, 1,200 of them are known to be infected with EBV. The scientist tests these 1,200 patients' blood and finds that only 120 of them tested negative with her new exam. Of the patients who are known to be EBV-free, only 20 of them tested positive. Given these results, which of the following correlates with the exam's specificity?
- A. 82%
- B. 90%
- C. 84%
- D. 86%
- E. 98% (Correct Answer)
P-values and confidence intervals Explanation: ***98%***
- **Specificity** measures the proportion of **true negatives** among all actual negatives.
- In this case, 800 patients are known to be EBV-free (actual negatives), and 20 of them tested positive (false positives). This means 800 - 20 = 780 tested negative (true negatives). Specificity = (780 / 800) * 100% = **98%**.
*82%*
- This value represents the *original sensitivity* before the scientist’s new attempts to improve the test.
- It does not reflect the *newly calculated specificity* based on the provided data.
*90%*
- This value represents the *newly calculated sensitivity* of the test, not the specificity.
- Out of 1200 EBV-infected patients, 120 tested negative (false negatives), meaning 1080 tested positive (true positives). Sensitivity = (1080 / 1200) * 100% = 90%.
*84%*
- This percentage is not directly derived from the information given for either sensitivity or specificity after the new test results.
- It does not correspond to any of the calculated values for the new test's performance.
*86%*
- This percentage is not directly derived from the information given for either sensitivity or specificity after the new test results.
- It does not correspond to any of the calculated values for the new test's performance.
P-values and confidence intervals US Medical PG Question 2: A randomized control double-blind study is conducted on the efficacy of 2 sulfonylureas. The study concluded that medication 1 was more efficacious in lowering fasting blood glucose than medication 2 (p ≤ 0.05; 95% CI: 14 [10-21]). Which of the following is true regarding a 95% confidence interval (CI)?
- A. If the same study were repeated multiple times, approximately 95% of the calculated confidence intervals would contain the true population parameter. (Correct Answer)
- B. The 95% confidence interval is the probability chosen by the researcher to be the threshold of statistical significance.
- C. When a 95% CI for the estimated difference between groups contains the value ‘0’, the results are significant.
- D. It represents the probability that chance would not produce the difference shown, 95% of the time.
- E. The study is adequately powered at the 95% confidence interval.
P-values and confidence intervals Explanation: ***If the same study were repeated multiple times, approximately 95% of the calculated confidence intervals would contain the true population parameter.***
- This statement accurately defines the **frequentist interpretation** of a confidence interval (CI). It reflects the long-run behavior of the CI over hypothetical repetitions of the study.
- A 95% CI means that if you were to repeat the experiment many times, 95% of the CIs calculated from those experiments would capture the **true underlying population parameter**.
*The 95% confidence interval is the probability chosen by the researcher to be the threshold of statistical significance.*
- The **alpha level (α)**, typically set at 0.05 (or 5%), is the threshold for statistical significance (p ≤ 0.05), representing the probability of a Type I error.
- The 95% confidence level (1-α) is related to statistical significance, but it is not the *threshold* itself; rather, it indicates the **reliability** of the interval estimate.
*When a 95% CI for the estimated difference between groups contains the value ‘0’, the results are significant.*
- If a 95% CI for the difference between groups **contains 0**, it implies that there is **no statistically significant difference** between the groups at the 0.05 alpha level.
- A statistically significant difference (p ≤ 0.05) would be indicated if the 95% CI **does NOT contain 0**, suggesting that the intervention had a real effect.
*It represents the probability that chance would not produce the difference shown, 95% of the time.*
- This statement misinterprets the meaning of a CI and probability. The chance of not producing the observed difference is typically addressed by the **p-value**, not directly by the CI in this manner.
- A CI provides a **range of plausible values** for the population parameter, not a probability about the role of chance in producing the observed difference.
*The study is adequately powered at the 95% confidence interval.*
- **Statistical power** is the probability of correctly rejecting a false null hypothesis, typically set at 80% or 90%. It is primarily determined by sample size, effect size, and alpha level.
- A 95% CI is a measure of the **precision** of an estimate, while power refers to the **ability of a study to detect an effect** if one exists. They are related but distinct concepts.
P-values and confidence intervals US Medical PG Question 3: A surgeon is interested in studying how different surgical techniques impact the healing of tendon injuries. In particular, he will compare 3 different types of suture repairs biomechanically in order to determine the maximum load before failure of the tendon 2 weeks after repair. He collects data on maximum load for 90 different repaired tendons from an animal model. Thirty tendons were repaired using each of the different suture techniques. Which of the following statistical measures is most appropriate for analyzing the results of this study?
- A. Chi-squared
- B. Wilcoxon rank sum
- C. Pearson r coefficient
- D. Student t-test
- E. ANOVA (Correct Answer)
P-values and confidence intervals Explanation: ***ANOVA***
- **ANOVA (Analysis of Variance)** is appropriate here because it compares the means of **three or more independent groups** (the three different suture techniques) on a continuous dependent variable (maximum load before failure).
- The study has three distinct repair techniques, each with 30 tendons, making ANOVA suitable for determining if there are statistically significant differences among their mean failure loads.
*Chi-squared*
- The **Chi-squared test** is used for analyzing **categorical data** (frequencies or proportions) to determine if there is an association between two nominal variables.
- This study involves quantitative measurement (maximum load), not categorical data, making Chi-squared inappropriate.
*Wilcoxon rank sum*
- The **Wilcoxon rank sum test** (also known as Mann-Whitney U test) is a **non-parametric test** used to compare two independent groups when the data is not normally distributed or is ordinal.
- While the study has independent groups, it involves three groups, and the dependent variable is continuous, making ANOVA a more powerful and appropriate choice assuming normal distribution.
*Pearson r coefficient*
- The **Pearson r coefficient** measures the **strength and direction of a linear relationship between two continuous variables**.
- This study aims to compare means across different groups, not to determine the correlation between two continuous variables.
*Student t-test*
- The **Student t-test** is used to compare the means of **exactly two groups** (either independent or paired) on a continuous dependent variable.
- This study involves comparing three different suture techniques, not just two, making the t-test unsuitable.
P-values and confidence intervals US Medical PG Question 4: You are reading through a recent article that reports significant decreases in all-cause mortality for patients with malignant melanoma following treatment with a novel biological infusion. Which of the following choices refers to the probability that a study will find a statistically significant difference when one truly does exist?
- A. Type II error
- B. Type I error
- C. Confidence interval
- D. p-value
- E. Power (Correct Answer)
P-values and confidence intervals Explanation: ***Power***
- **Power** is the probability that a study will correctly reject the null hypothesis when it is, in fact, false (i.e., will find a statistically significant difference when one truly exists).
- A study with high power minimizes the risk of a **Type II error** (failing to detect a real effect).
*Type II error*
- A **Type II error** (or **beta error**) occurs when a study fails to reject a false null hypothesis, meaning it concludes there is no significant difference when one actually exists.
- This is the **opposite** of what the question describes, which asks for the probability of *finding* a difference.
*Type I error*
- A **Type I error** (or **alpha error**) occurs when a study incorrectly rejects a true null hypothesis, concluding there is a significant difference when one does not actually exist.
- This relates to the **p-value** and the level of statistical significance (e.g., p < 0.05).
*Confidence interval*
- A **confidence interval** provides a range of values within which the true population parameter is likely to lie with a certain degree of confidence (e.g., 95%).
- It does not directly represent the probability of finding a statistically significant difference when one truly exists.
*p-value*
- The **p-value** is the probability of observing data as extreme as, or more extreme than, that obtained in the study, assuming the null hypothesis is true.
- It is used to determine statistical significance, but it is not the probability of detecting a true effect.
P-values and confidence intervals US Medical PG Question 5: A medical research study is beginning to evaluate the positive predictive value of a novel blood test for non-Hodgkin’s lymphoma. The diagnostic arm contains 700 patients with NHL, of which 400 tested positive for the novel blood test. In the control arm, 700 age-matched control patients are enrolled and 0 are found positive for the novel test. What is the PPV of this test?
- A. 400 / (400 + 0) (Correct Answer)
- B. 700 / (700 + 300)
- C. 400 / (400 + 300)
- D. 700 / (700 + 0)
- E. 700 / (400 + 400)
P-values and confidence intervals Explanation: ***400 / (400 + 0) = 1.0 or 100%***
- The **positive predictive value (PPV)** is calculated as **True Positives / (True Positives + False Positives)**.
- In this scenario, **True Positives (TP)** are the 400 patients with NHL who tested positive, and **False Positives (FP)** are 0, as no control patients tested positive.
- This gives a PPV of 400/400 = **1.0 or 100%**, indicating that all patients who tested positive actually had the disease.
*700 / (700 + 300)*
- This calculation does not align with the formula for PPV based on the given data.
- The denominator `(700+300)` suggests an incorrect combination of various patient groups.
*400 / (400 + 300)*
- The denominator `(400+300)` incorrectly includes 300, which is the number of **False Negatives** (patients with NHL who tested negative), not False Positives.
- PPV focuses on the proportion of true positives among all positive tests, not all diseased individuals.
*700 / (700 + 0)*
- This calculation incorrectly uses the total number of patients with NHL (700) as the numerator, rather than the number of positive test results in that group.
- The numerator should be the **True Positives** (400), not the total number of diseased individuals.
*700 / (400 + 400)*
- This calculation uses incorrect values for both the numerator and denominator, not corresponding to the PPV formula.
- The numerator 700 represents the total number of patients with the disease, not those who tested positive, and the denominator incorrectly sums up values that don't represent the proper PPV calculation.
More P-values and confidence intervals US Medical PG questions available in the OnCourse app. Practice MCQs, flashcards, and get detailed explanations.