Research question formulation US Medical PG Practice Questions and MCQs
Practice US Medical PG questions for Research question formulation. These multiple choice questions (MCQs) cover important concepts and help you prepare for your exams.
Research question formulation US Medical PG Question 1: An investigator is measuring the blood calcium level in a sample of female cross country runners and a control group of sedentary females. If she would like to compare the means of the two groups, which statistical test should she use?
- A. Chi-square test
- B. Linear regression
- C. t-test (Correct Answer)
- D. ANOVA (Analysis of Variance)
- E. F-test
Research question formulation Explanation: ***t-test***
- A **t-test** is appropriate for comparing the means of two independent groups, such as the blood calcium levels between runners and sedentary females.
- It assesses whether the observed difference between the two sample means is statistically significant or occurred by chance.
*Chi-square test*
- The **chi-square test** is used to analyze categorical data to determine if there is a significant association between two variables.
- It is not suitable for comparing continuous variables like blood calcium levels.
*Linear regression*
- **Linear regression** is used to model the relationship between a dependent variable (outcome) and one or more independent variables (predictors).
- It aims to predict the value of a variable based on the value of another, rather than comparing means between groups.
*ANOVA (Analysis of Variance)*
- **ANOVA** is used to compare the means of **three or more independent groups**.
- Since there are only two groups being compared in this scenario, a t-test is more specific and appropriate.
*F-test*
- The **F-test** is primarily used to compare the variances of two populations or to assess the overall significance of a regression model.
- While it is the basis for ANOVA, it is not the direct test for comparing the means of two groups.
Research question formulation US Medical PG Question 2: A research group wants to assess the safety and toxicity profile of a new drug. A clinical trial is conducted with 20 volunteers to estimate the maximum tolerated dose and monitor the apparent toxicity of the drug. The study design is best described as which of the following phases of a clinical trial?
- A. Phase 0
- B. Phase III
- C. Phase V
- D. Phase II
- E. Phase I (Correct Answer)
Research question formulation Explanation: ***Phase I***
- **Phase I clinical trials** involve a small group of healthy volunteers (typically 20-100) to primarily assess **drug safety**, determine a safe dosage range, and identify side effects.
- The main goal is to establish the **maximum tolerated dose (MTD)** and evaluate the drug's pharmacokinetic and pharmacodynamic profiles.
*Phase 0*
- **Phase 0 trials** are exploratory studies conducted in a very small number of subjects (10-15) to gather preliminary data on a drug's **pharmacodynamics and pharmacokinetics** in humans.
- They involve microdoses, not intended to have therapeutic effects, and thus cannot determine toxicity or MTD.
*Phase III*
- **Phase III trials** are large-scale studies involving hundreds to thousands of patients to confirm the drug's **efficacy**, monitor side effects, compare it to standard treatments, and collect information that will allow the drug to be used safely.
- These trials are conducted after safety and initial efficacy have been established in earlier phases.
*Phase V*
- "Phase V" is not a standard, recognized phase in the traditional clinical trial classification (Phase 0, I, II, III, IV).
- This term might be used in some non-standard research contexts or for post-marketing studies that go beyond Phase IV surveillance, but it is not a formal phase for initial drug development.
*Phase II*
- **Phase II trials** involve several hundred patients with the condition the drug is intended to treat, focusing on **drug efficacy** and further evaluating safety.
- While safety is still monitored, the primary objective shifts to determining if the drug works for its intended purpose and at what dose.
Research question formulation US Medical PG Question 3: A student health coordinator plans on leading a campus-wide HIV screening program that will be free for the entire undergraduate student body. The goal is to capture as many correct HIV diagnoses as possible with the fewest false positives. The coordinator consults with the hospital to see which tests are available to use for this program. Test A has a sensitivity of 0.92 and a specificity of 0.99. Test B has a sensitivity of 0.95 and a specificity of 0.96. Test C has a sensitivity of 0.98 and a specificity of 0.93. Which of the following testing schemes should the coordinator pursue?
- A. Test A on the entire student body followed by Test B on those who are positive
- B. Test A on the entire student body followed by Test C on those who are positive
- C. Test C on the entire student body followed by Test B on those who are positive
- D. Test C on the entire student body followed by Test A on those who are positive (Correct Answer)
- E. Test B on the entire student body followed by Test A on those who are positive
Research question formulation Explanation: ***Test C on the entire student body followed by Test A on those who are positive***
- To "capture as many correct HIV diagnoses as possible" (maximize true positives), the initial screening test should have the **highest sensitivity**. Test C has the highest sensitivity (0.98).
- To "capture as few false positives as possible" (maximize true negatives and confirm diagnoses), the confirmatory test should have the **highest specificity**. Test A has the highest specificity (0.99).
*Test A on the entire student body followed by Test B on those who are positive*
- Starting with Test A (sensitivity 0.92) would miss more true positive cases than starting with Test C (sensitivity 0.98), failing the goal of **capturing as many cases as possible**.
- Following with Test B (specificity 0.96) would result in more false positives than following with Test A (specificity 0.99).
*Test A on the entire student body followed by Test C on those who are positive*
- This scheme would miss many true positive cases initially due to Test A's lower sensitivity compared to Test C.
- Following with Test C would introduce more false positives than necessary, as it has a lower specificity (0.93) than Test A (0.99).
*Test C on the entire student body followed by Test B on those who are positive*
- While Test C is a good initial screen for its high sensitivity, following it with Test B (specificity 0.96) is less optimal than Test A (specificity 0.99) for minimizing false positives in the confirmation step.
- This combination would therefore yield more false positives in the confirmatory stage than using Test A.
*Test B on the entire student body followed by Test A on those who are positive*
- Test B has a sensitivity of 0.95, which is lower than Test C's sensitivity of 0.98, meaning it would miss more true positive cases at the initial screening stage.
- While Test A provides excellent specificity for confirmation, the initial screening step is suboptimal for the goal of capturing as many diagnoses as possible.
Research question formulation US Medical PG Question 4: You submit a paper to a prestigious journal about the effects of coffee consumption on mesothelioma risk. The first reviewer lauds your clinical and scientific acumen, but expresses concern that your study does not have adequate statistical power. Statistical power refers to which of the following?
- A. The probability of detecting an association when no association exists.
- B. The probability of not detecting an association when an association does exist.
- C. The probability of detecting an association when an association does exist. (Correct Answer)
- D. The first derivative of work.
- E. The square root of the variance.
Research question formulation Explanation: ***The probability of detecting an association when an association does exist.***
- **Statistical power** is defined as the probability that a study will correctly reject a false null hypothesis, meaning it will detect a true effect or association if one exists.
- A study with **adequate statistical power** is less likely to miss a real effect.
*The probability of detecting an association when no association exists.*
- This describes a **Type I error** or **false positive**, often represented by **alpha (α)**.
- It is the probability of incorrectly concluding an effect or association exists when, in reality, there is none.
*The probability of not detecting an association when an association does exist.*
- This refers to a **Type II error** or **false negative**, represented by **beta (β)**.
- **Statistical power** is calculated as **1 - β**, so this option describes the complement of power.
*The first derivative of work.*
- The first derivative of work with respect to time represents **power** in physics, which is the rate at which work is done.
- This option is a **distractor** from physics and is unrelated to statistical power in research.
*The square root of the variance.*
- The **square root of the variance** is the **standard deviation**, a measure of the dispersion or spread of data.
- This is a statistical concept but is not the definition of statistical power.
Research question formulation US Medical PG Question 5: A 65-year-old man is admitted to the hospital because of a 1-month history of fatigue, intermittent fever, and weakness. Results from a peripheral blood smear taken during his evaluation are indicative of possible acute myeloid leukemia. Bone marrow aspiration and subsequent cytogenetic studies confirm the diagnosis. The physician sets aside an appointed time-slot and arranges a meeting in a quiet office to inform him about the diagnosis and discuss his options. He has been encouraged to bring someone along to the appointment if he wanted. He comes to your office at the appointed time with his daughter. He appears relaxed, with a full range of affect. Which of the following is the most appropriate opening statement in this situation?
- A. Your lab reports show that you have an acute myeloid leukemia
- B. What is your understanding of the reasons we did bone marrow aspiration and cytogenetic studies? (Correct Answer)
- C. You must be curious and maybe even anxious about the results of your tests.
- D. I may need to refer you to a blood cancer specialist because of your diagnosis. You may need chemotherapy or radiotherapy, which we are not equipped for.
- E. Would you like to know all the details of your diagnosis, or would you prefer I just explain to you what our options are?
Research question formulation Explanation: ***"What is your understanding of the reasons we did bone marrow aspiration and cytogenetic studies?"***
- This **open-ended question** allows the patient to express their current knowledge and perceptions, which helps the physician tailor the discussion.
- It establishes a **patient-centered approach**, respecting the patient's existing understanding and preparing them for further information.
*"You must be curious and maybe even anxious about the results of your tests."*
- While empathic, this statement makes an **assumption about the patient's feelings** rather than inviting them to share their own.
- It is often better to ask directly or use more open-ended questions that allow the patient to express their true emotions, especially given their **relaxed demeanor**.
*"I may need to refer you to a blood cancer specialist because of your diagnosis. You may need chemotherapy or radiotherapy, which we are not equipped for.”"*
- This statement immediately introduces **overwhelming and potentially alarming information** (referral, chemotherapy, radiotherapy) without first establishing the diagnosis or assessing the patient's readiness to receive it.
- It prematurely jumps to treatment and logistics, potentially causing **unnecessary distress** before the patient has processed the core diagnosis.
*"Would you like to know all the details of your diagnosis, or would you prefer I just explain to you what our options are?""*
- While it attempts to assess the patient's preference for information, this question is a **closed-ended "either/or" choice** that might limit the patient's ability to express nuanced needs.
- It also prematurely introduces the idea of "options" without first explaining the diagnosis in an understandable context.
*"Your lab reports show that you have an acute myeloid leukemia"*
- This is a **direct and blunt delivery of a serious diagnosis** without any preparatory context or assessment of the patient's existing knowledge or emotional state.
- Delivering such news abruptly can be shocking and overwhelming, potentially **hindering effective communication** and rapport building.
Research question formulation US Medical PG Question 6: You are interested in studying the etiology of heart failure reduced ejection fraction (HFrEF) and attempt to construct an appropriate design study. Specifically, you wish to look for potential causality between dietary glucose consumption and HFrEF. Which of the following study designs would allow you to assess for and determine this causality?
- A. Cross-sectional study
- B. Case series
- C. Cohort study (Correct Answer)
- D. Case-control study
- E. Randomized controlled trial
Research question formulation Explanation: ***Cohort study***
- A **cohort study** observes a group of individuals over time to identify risk factors and outcomes, allowing for the assessment of **temporal relationships** between exposure (dietary glucose) and outcome (HFrEF).
- This design is suitable for establishing a potential **causal link** as it tracks participants from exposure to outcome, enabling the calculation of incidence rates and relative risks.
*Cross-sectional study*
- A **cross-sectional study** measures exposure and outcome simultaneously at a single point in time, making it impossible to determine the **temporal sequence** of events.
- This design can only identify **associations** or correlations, not causation, as it cannot establish whether high glucose consumption preceded HFrEF.
*Case series*
- A **case series** describes characteristics of a group of patients with a particular disease or exposure, often to highlight unusual clinical features, but it lacks a **comparison group**.
- It cannot assess causality because it does not provide information on the frequency of exposure in healthy individuals or the incidence of the disease in unexposed individuals.
*Case-control study*
- A **case-control study** compares individuals with the outcome (cases) to those without the outcome (controls) to determine past exposures, which makes it prone to **recall bias**.
- While it can suggest associations, it cannot definitively establish a temporal relationship or causation as the outcome is already known when exposure is assessed.
*Randomized controlled trial*
- A **randomized controlled trial (RCT)** is the gold standard for establishing causation by randomly assigning participants to an intervention or control group, but it may not be ethical or feasible for studying long-term dietary exposures and chronic diseases like HFrEF due to the long follow-up period and complexity of diet.
- While ideal for causality, directly controlling and randomizing dietary glucose intake over decades to observe HFrEF development might be practically challenging or unethical.
Research question formulation US Medical PG Question 7: A first-year medical student is conducting a summer project with his medical school's pediatrics department using adolescent IQ data from a database of 1,252 patients. He observes that the mean IQ of the dataset is 100. The standard deviation was calculated to be 10. Assuming that the values are normally distributed, approximately 87% of the measurements will fall in between which of the following limits?
- A. 85–115 (Correct Answer)
- B. 95–105
- C. 65–135
- D. 80–120
- E. 70–130
Research question formulation Explanation: ***85–115***
- For a **normal distribution**, approximately 87% of data falls within **±1.5 standard deviations** from the mean.
- With a mean of 100 and a standard deviation of 10, the range is 100 ± (1.5 * 10) = 100 ± 15, which gives **85–115**.
*95–105*
- This range represents **±0.5 standard deviations** from the mean (100 ± 5), which covers only about 38% of the data.
- This is a much narrower range and does not encompass 87% of the observations as required.
*65–135*
- This range represents **±3.5 standard deviations** from the mean (100 ± 35), which would cover over 99.9% of the data.
- Thus, this interval is too wide for 87% of the measurements.
*80–120*
- This range represents **±2 standard deviations** from the mean (100 ± 20), which covers approximately 95% of the data.
- While a common interval, it is wider than necessary for 87% of the data.
*70–130*
- This range represents **±3 standard deviations** from the mean (100 ± 30), which covers approximately 99.7% of the data.
- This interval is significantly wider than required to capture 87% of the data.
Research question formulation US Medical PG Question 8: Many large clinics have noticed that the prevalence of primary biliary cholangitis (PBC) has increased significantly over the past 20 years. An epidemiologist is working to identify possible reasons for this. After analyzing a series of nationwide health surveillance databases, the epidemiologist finds that the incidence of PBC has remained stable over the past 20 years. Which of the following is the most plausible explanation for the increased prevalence of PBC?
- A. Improved quality of care for PBC (Correct Answer)
- B. Increased availability of diagnostic testing for PBC
- C. Increased exposure to environmental risk factors for PBC
- D. Increased awareness of PBC among clinicians
- E. Increased average age of the population at risk for PBC
Research question formulation Explanation: ***Improved quality of care for PBC***
- This leads to a **longer survival time** for patients with PBC. When incidence remains stable but patients live longer, the cumulative number of living cases (prevalence) naturally increases.
- An increase in prevalence with stable incidence is a classic indicator of **improved patient survival** due to better management or treatment.
*Increased availability of diagnostic testing for PBC*
- This would primarily impact the **incidence** of PBC by detecting more cases that were previously undiagnosed. The question states that the incidence has remained stable.
- While improved diagnostics might initially increase *reported* incidence, if the true incidence is stable, it wouldn't explain a sustained rise in prevalence without a corresponding change in incidence or survival.
*Increased exposure to environmental risk factors for PBC*
- This would directly lead to an **increase in the incidence** of PBC, as more people would be developing the disease.
- Since the incidence is stable, an increase in environmental risk factors is not the most plausible explanation for increased prevalence.
*Increased awareness of PBC among clinicians*
- Similar to increased diagnostic testing, increased awareness would likely lead to the diagnosis of more new cases, thus **increasing the incidence** of PBC.
- A stable incidence despite increased awareness means that the actual rate of new cases developing the disease has not changed, ruling this out as the primary cause of increased prevalence.
*Increased average age of the population at risk for PBC*
- An aging population could potentially increase the incidence of age-related diseases. However, if the **incidence has remained stable**, it implies that even with an older population, the rate of new diagnoses has not increased.
- While age is a risk factor for PBC, an increase in prevalence without a change in incidence suggests a factor influencing the duration of the disease rather than its onset.
Research question formulation US Medical PG Question 9: A statistician wants to study the effects of a medicine in three groups-humans, animals, and plants. He then selects randomly from these three groups. Which type of sampling is being performed?
- A. Simple random sampling
- B. Systematic sampling
- C. Stratified random sampling (Correct Answer)
- D. Cluster sampling
- E. Convenience sampling
Research question formulation Explanation: ***Stratified random sampling***
- This method involves dividing the population into **distinct subgroups (strata)** based on shared characteristics (in this case, humans, animals, and plants), and then performing a simple random sample within each stratum.
- This ensures that all subgroups are proportionally represented in the sample, which is appropriate when studying effects across different biological categories.
*Simple random sampling*
- This method involves selecting individuals from the entire population **purely by chance**, without first dividing them into subgroups.
- It would not guarantee representation from all three distinct groups (humans, animals, and plants), which is essential for studying differential effects.
*Systematic sampling*
- This involves selecting samples at **regular intervals** from an ordered list or sequence.
- This method is not suitable here because the population is divided into distinct, non-ordered groups rather than a continuous sequence.
*Cluster sampling*
- This method involves dividing the population into **clusters**, then randomly selecting some clusters and sampling all individuals within those selected clusters.
- In this scenario, the initial groups (humans, animals, plants) are strata, not clusters, as the intent is to sample from within each group, not to treat the groups themselves as primary sampling units.
*Convenience sampling*
- This is a **non-probability sampling method** where subjects are selected based on ease of access rather than random selection.
- The question explicitly states that random selection is performed from each group, ruling out convenience sampling.
Research question formulation US Medical PG Question 10: A study was undertaken to establish the relationship between the consumption of a vegetarian or non-vegetarian diet and the presence of diseases. Which statistical test should be used?
- A. Chi-square test (Correct Answer)
- B. T-test
- C. ANOVA
- D. Fisher's exact test
- E. Mann-Whitney U test
Research question formulation Explanation: ***Chi-square test***
- The **chi-square test** is appropriate when analyzing the relationship between two **categorical variables**. In this scenario, "diet type" (vegetarian/non-vegetarian) and "presence of disease" (yes/no) are both categorical variables.
- This test determines if there is a statistically significant association between the frequency counts of these two variables in a contingency table.
*T-test*
- A **t-test** is used to compare the **means** of two groups, typically when the dependent variable is continuous.
- This test is unsuitable here because the presence of disease and diet type are categorical, not continuous, variables.
*ANOVA*
- **ANOVA** (Analysis of Variance) is used to compare the **means** of three or more groups, often with a continuous dependent variable.
- Similar to the t-test, ANOVA is not applicable as the study involves categorical variables, not the comparison of means across multiple groups.
*Fisher's exact test*
- **Fisher's exact test** is similar to the chi-square test but specifically used for **small sample sizes** where the expected frequencies in any cell of the contingency table are less than 5.
- While it analyzes categorical data, the chi-square test is the more general and commonly preferred test for larger sample sizes, which is generally assumed unless otherwise specified.
*Mann-Whitney U test*
- The **Mann-Whitney U test** is a non-parametric test used to compare differences between two independent groups when the dependent variable is **ordinal or continuous** but not normally distributed.
- This test is not appropriate for analyzing the association between two categorical variables, as it requires at least one variable to have ranked or continuous data.
More Research question formulation US Medical PG questions available in the OnCourse app. Practice MCQs, flashcards, and get detailed explanations.