One-sided vs two-sided tests US Medical PG Practice Questions and MCQs
Practice US Medical PG questions for One-sided vs two-sided tests. These multiple choice questions (MCQs) cover important concepts and help you prepare for your exams.
One-sided vs two-sided tests US Medical PG Question 1: Group of 100 medical students took an end of the year exam. The mean score on the exam was 70%, with a standard deviation of 25%. The professor states that a student's score must be within the 95% confidence interval of the mean to pass the exam. Which of the following is the minimum score a student can have to pass the exam?
- A. 45%
- B. 63.75%
- C. 67.5%
- D. 20%
- E. 65% (Correct Answer)
One-sided vs two-sided tests Explanation: ***65%***
- To find the **95% confidence interval (CI) of the mean**, we use the formula: Mean ± (Z-score × Standard Error). For a 95% CI, the Z-score is approximately **1.96**.
- The **Standard Error (SE)** is calculated as SD/√n, where n is the sample size (100 students). So, SE = 25%/√100 = 25%/10 = **2.5%**.
- The 95% CI is 70% ± (1.96 × 2.5%) = 70% ± 4.9%. The lower bound is 70% - 4.9% = **65.1%**, which rounds to **65%** as the minimum passing score.
*45%*
- This value is significantly lower than the calculated lower bound of the 95% confidence interval (approximately 65.1%).
- It would represent a score far outside the defined passing range.
*63.75%*
- This value falls below the calculated lower bound of the 95% confidence interval (approximately 65.1%).
- While close, this score would not meet the professor's criterion for passing.
*67.5%*
- This value is within the 95% confidence interval (65.1% to 74.9%) but is **not the minimum score**.
- Lower scores within the interval would still qualify as passing.
*20%*
- This score is extremely low and falls significantly outside the 95% confidence interval for a mean of 70%.
- It would indicate performance far below the defined passing threshold.
One-sided vs two-sided tests US Medical PG Question 2: You are conducting a study comparing the efficacy of two different statin medications. Two groups are placed on different statin medications, statin A and statin B. Baseline LDL levels are drawn for each group and are subsequently measured every 3 months for 1 year. Average baseline LDL levels for each group were identical. The group receiving statin A exhibited an 11 mg/dL greater reduction in LDL in comparison to the statin B group. Your statistical analysis reports a p-value of 0.052. Which of the following best describes the meaning of this p-value?
- A. There is a 95% chance that the difference in reduction of LDL observed reflects a real difference between the two groups
- B. Though A is more effective than B, there is a 5% chance the difference in reduction of LDL between the two groups is due to chance
- C. If 100 permutations of this experiment were conducted, 5 of them would show similar results to those described above
- D. This is a statistically significant result
- E. There is a 5.2% chance of observing a difference in reduction of LDL of 11 mg/dL or greater even if the two medications have identical effects (Correct Answer)
One-sided vs two-sided tests Explanation: **There is a 5.2% chance of observing a difference in reduction of LDL of 11 mg/dL or greater even if the two medications have identical effects**
- The **p-value** represents the probability of observing results as extreme as, or more extreme than, the observed data, assuming the **null hypothesis** is true (i.e., there is no true difference between the groups).
- A p-value of 0.052 means there's approximately a **5.2% chance** that the observed 11 mg/dL difference (or a more substantial difference) occurred due to **random variation**, even if both statins were equally effective.
*There is a 95% chance that the difference in reduction of LDL observed reflects a real difference between the two groups*
- This statement is an incorrect interpretation of the p-value; it confuses the p-value with the **probability that the alternative hypothesis is true**.
- A p-value does not directly tell us the probability that the observed difference is "real" or due to the intervention being studied.
*Though A is more effective than B, there is a 5% chance the difference in reduction of LDL between the two groups is due to chance*
- This statement implies that Statin A is more effective, which cannot be concluded with a p-value of 0.052 if the significance level (alpha) was set at 0.05.
- While it's true there's a chance the difference is due to chance, claiming A is "more effective" based on this p-value before statistical significance is usually declared is misleading.
*If 100 permutations of this experiment were conducted, 5 of them would show similar results to those described above*
- This is an incorrect interpretation. The p-value does not predict the outcome of repeated experiments in this manner.
- It refers to the **probability under the null hypothesis in a single experiment**, not the frequency of results across multiple hypothetical repetitions.
*This is a statistically significant result*
- A p-value of 0.052 is generally considered **not statistically significant** if the conventional alpha level (significance level) is set at 0.05 (or 5%).
- For a result to be statistically significant at alpha = 0.05, the p-value must be **less than 0.05**.
One-sided vs two-sided tests US Medical PG Question 3: A randomized control double-blind study is conducted on the efficacy of 2 sulfonylureas. The study concluded that medication 1 was more efficacious in lowering fasting blood glucose than medication 2 (p ≤ 0.05; 95% CI: 14 [10-21]). Which of the following is true regarding a 95% confidence interval (CI)?
- A. If the same study were repeated multiple times, approximately 95% of the calculated confidence intervals would contain the true population parameter. (Correct Answer)
- B. The 95% confidence interval is the probability chosen by the researcher to be the threshold of statistical significance.
- C. When a 95% CI for the estimated difference between groups contains the value ‘0’, the results are significant.
- D. It represents the probability that chance would not produce the difference shown, 95% of the time.
- E. The study is adequately powered at the 95% confidence interval.
One-sided vs two-sided tests Explanation: ***If the same study were repeated multiple times, approximately 95% of the calculated confidence intervals would contain the true population parameter.***
- This statement accurately defines the **frequentist interpretation** of a confidence interval (CI). It reflects the long-run behavior of the CI over hypothetical repetitions of the study.
- A 95% CI means that if you were to repeat the experiment many times, 95% of the CIs calculated from those experiments would capture the **true underlying population parameter**.
*The 95% confidence interval is the probability chosen by the researcher to be the threshold of statistical significance.*
- The **alpha level (α)**, typically set at 0.05 (or 5%), is the threshold for statistical significance (p ≤ 0.05), representing the probability of a Type I error.
- The 95% confidence level (1-α) is related to statistical significance, but it is not the *threshold* itself; rather, it indicates the **reliability** of the interval estimate.
*When a 95% CI for the estimated difference between groups contains the value ‘0’, the results are significant.*
- If a 95% CI for the difference between groups **contains 0**, it implies that there is **no statistically significant difference** between the groups at the 0.05 alpha level.
- A statistically significant difference (p ≤ 0.05) would be indicated if the 95% CI **does NOT contain 0**, suggesting that the intervention had a real effect.
*It represents the probability that chance would not produce the difference shown, 95% of the time.*
- This statement misinterprets the meaning of a CI and probability. The chance of not producing the observed difference is typically addressed by the **p-value**, not directly by the CI in this manner.
- A CI provides a **range of plausible values** for the population parameter, not a probability about the role of chance in producing the observed difference.
*The study is adequately powered at the 95% confidence interval.*
- **Statistical power** is the probability of correctly rejecting a false null hypothesis, typically set at 80% or 90%. It is primarily determined by sample size, effect size, and alpha level.
- A 95% CI is a measure of the **precision** of an estimate, while power refers to the **ability of a study to detect an effect** if one exists. They are related but distinct concepts.
One-sided vs two-sided tests US Medical PG Question 4: A biostatistician is processing data for a large clinical trial she is working on. The study is analyzing the use of a novel pharmaceutical compound for the treatment of anorexia after chemotherapy with the outcome of interest being the change in weight while taking the drug. While most participants remained about the same weight or continued to lose weight while on chemotherapy, there were smaller groups of individuals who responded very positively to the orexic agent. As a result, the data had a strong positive skew. The biostatistician wishes to report the measures of central tendency for this project. Just by understanding the skew in the data, which of the following can be expected for this data set?
- A. Mean = median = mode
- B. Mean < median < mode
- C. Mean > median > mode (Correct Answer)
- D. Mean > median = mode
- E. Mean < median = mode
One-sided vs two-sided tests Explanation: ***Mean > median > mode***
- In a dataset with a **strong positive skew**, the tail of the distribution is on the right, pulled by a few **unusually large values**.
- These extreme high values disproportionately influence the **mean**, pulling it to the right (higher value), while the **median** (middle value) is less affected, and the **mode** (most frequent value) is often located at the peak of the distribution towards the left.
*Mean = median = mode*
- This relationship between the measures of central tendency is characteristic of a **perfectly symmetrical distribution**, such as a **normal distribution**, where there is no skew.
- In a symmetrical distribution, the mean, median, and mode are all located at the exact center of the data.
*Mean < median < mode*
- This order is typical for a dataset with a **negative skew**, where the tail is on the left due to a few **unusually small values**.
- In a negatively skewed distribution, the mean is pulled to the left (lower value) by the small values, making it less than the median and mode.
*Mean > median = mode*
- This configuration is generally not characteristic of standard skewed distributions and would imply a specific, less common bimodal or complex distribution shape where the mode coincides with the median, but the mean is pulled higher.
- While theoretically possible, it doesn't describe a typical positively skewed distribution where the mode is usually the lowest of the three.
*Mean < median = mode*
- This relationship would suggest a negatively skewed distribution where the median and mode are equal, but the mean is pulled to the left (lower value) by a leftward tail.
- Again, this is a less typical representation of a standard negatively skewed distribution, which often follows the Mean < Median < Mode pattern.
One-sided vs two-sided tests US Medical PG Question 5: A research group wants to assess the safety and toxicity profile of a new drug. A clinical trial is conducted with 20 volunteers to estimate the maximum tolerated dose and monitor the apparent toxicity of the drug. The study design is best described as which of the following phases of a clinical trial?
- A. Phase 0
- B. Phase III
- C. Phase V
- D. Phase II
- E. Phase I (Correct Answer)
One-sided vs two-sided tests Explanation: ***Phase I***
- **Phase I clinical trials** involve a small group of healthy volunteers (typically 20-100) to primarily assess **drug safety**, determine a safe dosage range, and identify side effects.
- The main goal is to establish the **maximum tolerated dose (MTD)** and evaluate the drug's pharmacokinetic and pharmacodynamic profiles.
*Phase 0*
- **Phase 0 trials** are exploratory studies conducted in a very small number of subjects (10-15) to gather preliminary data on a drug's **pharmacodynamics and pharmacokinetics** in humans.
- They involve microdoses, not intended to have therapeutic effects, and thus cannot determine toxicity or MTD.
*Phase III*
- **Phase III trials** are large-scale studies involving hundreds to thousands of patients to confirm the drug's **efficacy**, monitor side effects, compare it to standard treatments, and collect information that will allow the drug to be used safely.
- These trials are conducted after safety and initial efficacy have been established in earlier phases.
*Phase V*
- "Phase V" is not a standard, recognized phase in the traditional clinical trial classification (Phase 0, I, II, III, IV).
- This term might be used in some non-standard research contexts or for post-marketing studies that go beyond Phase IV surveillance, but it is not a formal phase for initial drug development.
*Phase II*
- **Phase II trials** involve several hundred patients with the condition the drug is intended to treat, focusing on **drug efficacy** and further evaluating safety.
- While safety is still monitored, the primary objective shifts to determining if the drug works for its intended purpose and at what dose.
One-sided vs two-sided tests US Medical PG Question 6: An academic medical center in the United States is approached by a pharmaceutical company to run a small clinical trial to test the effectiveness of its new drug, compound X. The company wants to know if the measured hemoglobin a1c (Hba1c) of patients with type 2 diabetes receiving metformin and compound X would be lower than that of control subjects receiving only metformin. After a year of study and data analysis, researchers conclude that the control and treatment groups did not differ significantly in their Hba1c levels.
However, parallel clinical trials in several other countries found that compound X led to a significant decrease in Hba1c. Interested in the discrepancy between these findings, the company funded a larger study in the United States, which confirmed that compound X decreased Hba1c levels. After compound X was approved by the FDA, and after several years of use in the general population, outcomes data confirmed that it effectively lowered Hba1c levels and increased overall survival. What term best describes the discrepant findings in the initial clinical trial run by institution A?
- A. Type I error
- B. Hawthorne effect
- C. Type II error (Correct Answer)
- D. Publication bias
- E. Confirmation bias
One-sided vs two-sided tests Explanation: ***Type II error***
- A **Type II error** occurs when a study fails to **reject a false null hypothesis**, meaning it concludes there is no significant difference or effect when one actually exists.
- In this case, the initial US trial incorrectly concluded that Compound X had no significant effect on HbA1c, while subsequent larger studies and real-world data proved it did.
*Type I error*
- A **Type I error** (alpha error) occurs when a study incorrectly **rejects a true null hypothesis**, concluding there is a significant difference or effect when there isn't.
- This scenario describes the opposite: the initial study failed to find an effect that genuinely existed, indicating a Type II error, not a Type I error.
*Hawthorne effect*
- The **Hawthorne effect** is a type of reactivity in which individuals modify an aspect of their behavior in response to their awareness of being observed.
- This effect does not explain the initial trial's failure to detect a real drug effect; rather, it relates to participants changing behavior due to study participation itself.
*Publication bias*
- **Publication bias** occurs when studies with positive or statistically significant results are more likely to be published than those with negative or non-significant results.
- While relevant to the literature as a whole, it doesn't explain the discrepancy in findings within a single drug's development where a real effect was initially missed.
*Confirmation bias*
- **Confirmation bias** is the tendency to search for, interpret, favor, and recall information in a way that confirms one's preexisting beliefs or hypotheses.
- This bias would likely lead researchers to *find* an effect if they expected one, or to disregard data that contradicts their beliefs, which is not what happened in the initial trial.
One-sided vs two-sided tests US Medical PG Question 7: A student health coordinator plans on leading a campus-wide HIV screening program that will be free for the entire undergraduate student body. The goal is to capture as many correct HIV diagnoses as possible with the fewest false positives. The coordinator consults with the hospital to see which tests are available to use for this program. Test A has a sensitivity of 0.92 and a specificity of 0.99. Test B has a sensitivity of 0.95 and a specificity of 0.96. Test C has a sensitivity of 0.98 and a specificity of 0.93. Which of the following testing schemes should the coordinator pursue?
- A. Test A on the entire student body followed by Test B on those who are positive
- B. Test A on the entire student body followed by Test C on those who are positive
- C. Test C on the entire student body followed by Test B on those who are positive
- D. Test C on the entire student body followed by Test A on those who are positive (Correct Answer)
- E. Test B on the entire student body followed by Test A on those who are positive
One-sided vs two-sided tests Explanation: ***Test C on the entire student body followed by Test A on those who are positive***
- To "capture as many correct HIV diagnoses as possible" (maximize true positives), the initial screening test should have the **highest sensitivity**. Test C has the highest sensitivity (0.98).
- To "capture as few false positives as possible" (maximize true negatives and confirm diagnoses), the confirmatory test should have the **highest specificity**. Test A has the highest specificity (0.99).
*Test A on the entire student body followed by Test B on those who are positive*
- Starting with Test A (sensitivity 0.92) would miss more true positive cases than starting with Test C (sensitivity 0.98), failing the goal of **capturing as many cases as possible**.
- Following with Test B (specificity 0.96) would result in more false positives than following with Test A (specificity 0.99).
*Test A on the entire student body followed by Test C on those who are positive*
- This scheme would miss many true positive cases initially due to Test A's lower sensitivity compared to Test C.
- Following with Test C would introduce more false positives than necessary, as it has a lower specificity (0.93) than Test A (0.99).
*Test C on the entire student body followed by Test B on those who are positive*
- While Test C is a good initial screen for its high sensitivity, following it with Test B (specificity 0.96) is less optimal than Test A (specificity 0.99) for minimizing false positives in the confirmation step.
- This combination would therefore yield more false positives in the confirmatory stage than using Test A.
*Test B on the entire student body followed by Test A on those who are positive*
- Test B has a sensitivity of 0.95, which is lower than Test C's sensitivity of 0.98, meaning it would miss more true positive cases at the initial screening stage.
- While Test A provides excellent specificity for confirmation, the initial screening step is suboptimal for the goal of capturing as many diagnoses as possible.
One-sided vs two-sided tests US Medical PG Question 8: A pharmaceutical corporation is developing a research study to evaluate a novel blood test to screen for breast cancer. They enrolled 800 patients in the study, half of which have breast cancer. The remaining enrolled patients are age-matched controls who do not have the disease. Of those in the diseased arm, 330 are found positive for the test. Of the patients in the control arm, only 30 are found positive. What is this test’s sensitivity?
- A. 330 / (330 + 30)
- B. 330 / (330 + 70) (Correct Answer)
- C. 370 / (30 + 370)
- D. 370 / (70 + 370)
- E. 330 / (400 + 400)
One-sided vs two-sided tests Explanation: ***330 / (330 + 70)***
- **Sensitivity** measures the proportion of actual **positives** that are correctly identified as such.
- In this study, there are **400 diseased patients** (half of 800). Of these, 330 tested positive (true positives), meaning 70 tested negative (false negatives). So sensitivity is **330 / (330 + 70)**.
*330 / (330 + 30)*
- This calculation represents the **positive predictive value**, which is the probability that subjects with a positive screening test truly have the disease. It uses **true positives / (true positives + false positives)**.
- It does not correctly calculate **sensitivity**, which requires knowing the total number of diseased individuals.
*370 / (30 + 370)*
- This expression is attempting to calculate **specificity**, which is the proportion of actual negatives that are correctly identified. It would be **true negatives / (true negatives + false positives)**.
- However, the numbers used are incorrect for specificity in this context given the data provided.
*370 / (70 + 370)*
- This formula is an incorrect combination of values and does not represent any standard epidemiological measure like **sensitivity** or **specificity**.
- It is attempting to combine false negatives (70) and true negatives (370 from control arm) in a non-standard way.
*330 / (400 + 400)*
- This calculation attempts to divide true positives by the total study population (800 patients).
- This metric represents the **prevalence of true positives within the entire study cohort**, not the test's **sensitivity**.
One-sided vs two-sided tests US Medical PG Question 9: The height of American adults is expected to follow a normal distribution, with a typical male adult having an average height of 69 inches with a standard deviation of 0.1 inches. An investigator has been informed about a community in the American Midwest with a history of heavy air and water pollution in which a lower mean height has been reported. The investigator plans to sample 30 male residents to test the claim that heights in this town differ significantly from the national average based on heights assumed be normally distributed. The significance level is set at 10% and the probability of a type 2 error is assumed to be 15%. Based on this information, which of the following is the power of the proposed study?
- A. 0.10
- B. 0.85 (Correct Answer)
- C. 0.90
- D. 0.15
- E. 0.05
One-sided vs two-sided tests Explanation: ***0.85***
- **Power** is defined as **1 - β**, where β is the **probability of a Type II error**.
- Given that the probability of a **Type II error (β)** is 15% or 0.15, the power of the study is 1 - 0.15 = **0.85**.
*0.10*
- This value represents the **significance level (α)**, which is the probability of committing a **Type I error** (rejecting a true null hypothesis).
- The significance level is distinct from the **power of the study**, which relates to Type II errors.
*0.90*
- This value would be the power if the **Type II error rate (β)** was 0.10 (1 - 0.10 = 0.90), but the question specifies a β of 0.15.
- It is also the complement of the significance level (1 - α), which is not the definition of power.
*0.15*
- This value is the **probability of a Type II error (β)**, not the power of the study.
- **Power** is the probability of correctly rejecting a false null hypothesis, which is 1 - β.
*0.05*
- While 0.05 is a common significance level (α), it is not given as the significance level in this question (which is 0.10).
- This value also does not represent the power of the study, which would be calculated using the **Type II error rate**.
One-sided vs two-sided tests US Medical PG Question 10: A patient with HCC and a long history of alcohol dependence and chronic hepatitis C has been using the mTOR inhibitor sirolimus 100 mg for cancer treatment. Her cancer has shown a partial response. She also has a history of hypertension and poorly controlled type 2 diabetes mellitus complicated by diabetic retinopathy. Current medications include enalapril and insulin. She asks her oncologist and hepatologist if she could try everolimus for its purported survival benefit in treating HCC. Based on clinical considerations, which of the following statements is most accurate?
- A. The patient should start everolimus 50 mg because of the survival benefit relative to sirolimus 100 mg
- B. The patient is not a good candidate for everolimus due to her history of hypertension
- C. The patient should start everolimus 100 mg because of the survival benefit relative to sirolimus 100 mg
- D. The patient should start everolimus 50 mg because of her history of alcohol use disorder and hepatitis C
- E. The patient is not a good candidate for everolimus due to her history of diabetes (Correct Answer)
One-sided vs two-sided tests Explanation: ***The patient is not a good candidate for Noxbinle due to her history of diabetes***
- The current medication is sirolimus, an **mTOR inhibitor** and its successor everolimus, also an mTOR inhibitor, is not beneficial for this patient due to her **poorly controlled type 2 diabetes mellitus**.
- mTOR inhibitors, including everolimus, are known to **worsen hyperglycemia** and **accelerate the progression of diabetes**, making it contraindicated in patients with already complicated diabetes.
*The patient should start everolimus 50 mg because of the survival benefit relative to sirolimus 100 mg*
- There is **no established evidence** that everolimus at any dose offers a superior survival benefit compared to sirolimus in HCC, particularly after a partial response to sirolimus.
- **Switching mTOR inhibitors** without a compelling clinical reason, especially with existing comorbidities, is not standard practice.
*The patient is not a good candidate for everolimus due to her history of hypertension*
- While mTOR inhibitors can contribute to **hypertension**, this patient is already on **enalapril** for her existing hypertension.
- Her **poorly controlled diabetes** presents a more direct and severe contraindication due to the metabolic side effects of everolimus.
*The patient should start everolimus 100 mg because of the survival benefit relative to sirolimus 100 mg*
- No clinical data supports a **superior survival benefit** of everolimus 100 mg over sirolimus 100 mg in HCC.
- Given the patient's existing **poorly controlled diabetes**, increasing the dose of an mTOR inhibitor or switching to an equivalent dose of another would heighten the risk of severe metabolic complications.
*The patient should start everolimus 50 mg because of her history of alcohol use disorder and hepatitis C*
- The patient's history of alcohol dependence and chronic hepatitis C are **risk factors for HCC** but do not directly contraindicate a specific dose of everolimus more than her diabetes.
- While liver impairment due to these conditions might influence dosing of various medications, the **primary concern for everolimus** in this case remains the uncontrolled diabetes.
More One-sided vs two-sided tests US Medical PG questions available in the OnCourse app. Practice MCQs, flashcards, and get detailed explanations.