Sampling techniques US Medical PG Practice Questions and MCQs
Practice US Medical PG questions for Sampling techniques. These multiple choice questions (MCQs) cover important concepts and help you prepare for your exams.
Sampling techniques US Medical PG Question 1: A study is funded by the tobacco industry to examine the association between smoking and lung cancer. They design a study with a prospective cohort of 1,000 smokers between the ages of 20-30. The length of the study is five years. After the study period ends, they conclude that there is no relationship between smoking and lung cancer. Which of the following study features is the most likely reason for the failure of the study to note an association between tobacco use and cancer?
- A. Late-look bias
- B. Latency period (Correct Answer)
- C. Confounding
- D. Effect modification
- E. Pygmalion effect
Sampling techniques Explanation: ***Latency period***
- **Lung cancer** typically has a **long latency period**, often **20-30+ years**, between initial exposure to tobacco carcinogens and the development of clinically detectable disease.
- A **five-year study duration** in young smokers (ages 20-30) is **far too short** to observe the development of lung cancer, which explains the false negative finding.
- This represents a **fundamental flaw in study design** rather than a bias—the biological timeline of disease development was not adequately considered.
*Late-look bias*
- **Late-look bias** occurs when a study enrolls participants who have already survived the early high-risk period of a disease, leading to **underestimation of true mortality or incidence**.
- Also called **survival bias**, it involves studying a population that has already been "selected" by survival.
- This is not applicable here, as the study simply ended before sufficient time elapsed for disease to develop.
*Confounding*
- **Confounding** occurs when a third variable is associated with both the exposure and outcome, distorting the apparent relationship between them.
- While confounding can affect study results, it would not completely eliminate the detection of a strong, well-established association like smoking and lung cancer in a properly conducted prospective cohort study.
- The issue here is temporal (insufficient follow-up time), not the presence of an unmeasured confounder.
*Effect modification*
- **Effect modification** (also called interaction) occurs when the magnitude of an association between exposure and outcome differs across levels of a third variable.
- This represents a **true biological phenomenon**, not a study design flaw or bias.
- It would not explain the complete failure to detect any association.
*Pygmalion effect*
- The **Pygmalion effect** (observer-expectancy effect) refers to a psychological phenomenon where higher expectations lead to improved performance in the observed subjects.
- This concept is relevant to **behavioral and educational research**, not to objective epidemiological studies of disease incidence.
- It has no relevance to the biological relationship between carcinogen exposure and cancer development.
Sampling techniques US Medical PG Question 2: A research team develops a new monoclonal antibody checkpoint inhibitor for advanced melanoma that has shown promise in animal studies as well as high efficacy and low toxicity in early phase human clinical trials. The research team would now like to compare this drug to existing standard of care immunotherapy for advanced melanoma. The research team decides to conduct a non-randomized study where the novel drug will be offered to patients who are deemed to be at risk for toxicity with the current standard of care immunotherapy, while patients without such risk factors will receive the standard treatment. Which of the following best describes the level of evidence that this study can offer?
- A. Level 1
- B. Level 3 (Correct Answer)
- C. Level 5
- D. Level 4
- E. Level 2
Sampling techniques Explanation: ***Level 3***
- A **non-randomized controlled trial** like the one described, where patient assignment to treatment groups is based on specific characteristics (risk of toxicity), falls into Level 3 evidence.
- This level typically includes **non-randomized controlled trials** and **well-designed cohort studies** with comparison groups, which are prone to selection bias and confounding.
- The study compares two treatments but lacks randomization, making it Level 3 evidence.
*Level 1*
- Level 1 evidence is the **highest level of evidence**, derived from **systematic reviews and meta-analyses** of multiple well-designed randomized controlled trials or large, high-quality randomized controlled trials.
- The described study is explicitly stated as non-randomized, ruling out Level 1.
*Level 2*
- Level 2 evidence involves at least one **well-designed randomized controlled trial** (RCT) or **systematic reviews** of randomized trials.
- The current study is *non-randomized*, which means it cannot be classified as Level 2 evidence, as randomization is a key criterion for this level.
*Level 4*
- Level 4 evidence includes **case series**, **case-control studies**, and **poorly designed cohort or case-control studies**.
- While the study is non-randomized, it is a controlled comparative trial rather than a case series or retrospective case-control study, placing it at Level 3.
*Level 5*
- Level 5 evidence is the **lowest level of evidence**, typically consisting of **expert opinion** without explicit critical appraisal, or based on physiology, bench research, or animal studies.
- While the drug was initially tested in animal studies, the current human comparative study offers a higher level of evidence than expert opinion or preclinical data.
Sampling techniques US Medical PG Question 3: A surgeon is interested in studying how different surgical techniques impact the healing of tendon injuries. In particular, he will compare 3 different types of suture repairs biomechanically in order to determine the maximum load before failure of the tendon 2 weeks after repair. He collects data on maximum load for 90 different repaired tendons from an animal model. Thirty tendons were repaired using each of the different suture techniques. Which of the following statistical measures is most appropriate for analyzing the results of this study?
- A. Chi-squared
- B. Wilcoxon rank sum
- C. Pearson r coefficient
- D. Student t-test
- E. ANOVA (Correct Answer)
Sampling techniques Explanation: ***ANOVA***
- **ANOVA (Analysis of Variance)** is appropriate here because it compares the means of **three or more independent groups** (the three different suture techniques) on a continuous dependent variable (maximum load before failure).
- The study has three distinct repair techniques, each with 30 tendons, making ANOVA suitable for determining if there are statistically significant differences among their mean failure loads.
*Chi-squared*
- The **Chi-squared test** is used for analyzing **categorical data** (frequencies or proportions) to determine if there is an association between two nominal variables.
- This study involves quantitative measurement (maximum load), not categorical data, making Chi-squared inappropriate.
*Wilcoxon rank sum*
- The **Wilcoxon rank sum test** (also known as Mann-Whitney U test) is a **non-parametric test** used to compare two independent groups when the data is not normally distributed or is ordinal.
- While the study has independent groups, it involves three groups, and the dependent variable is continuous, making ANOVA a more powerful and appropriate choice assuming normal distribution.
*Pearson r coefficient*
- The **Pearson r coefficient** measures the **strength and direction of a linear relationship between two continuous variables**.
- This study aims to compare means across different groups, not to determine the correlation between two continuous variables.
*Student t-test*
- The **Student t-test** is used to compare the means of **exactly two groups** (either independent or paired) on a continuous dependent variable.
- This study involves comparing three different suture techniques, not just two, making the t-test unsuitable.
Sampling techniques US Medical PG Question 4: A clinical trial investigating a new biomedical device used to correct congenital talipes equinovarus (club foot) in infants has recently been published. The study was a preliminary investigation of a new device and as such the sample size is only 20 participants. The results indicate that the new biomedical device is less efficacious than the current standard of care of serial casting (p < 0.001), but the authors mention in the conclusion that it may be due to a single outlier--a patient whose foot remained uncorrected by the conclusion of the study. Which of the following descriptive statistics is the least sensitive to outliers?
- A. Standard deviation
- B. Median (Correct Answer)
- C. Mean
- D. Variance
- E. Mode
Sampling techniques Explanation: ***Median***
- The **median** is the middle value in a dataset when ordered from least to greatest, making it inherently resistant to extreme values or **outliers**.
- It describes the central tendency without being skewed by a single unusually high or low data point, unlike the mean.
- Among measures of central tendency, the median is the **most robust** to outliers.
*Standard deviation*
- **Standard deviation** measures the spread of data points around the mean, and because it is based on the **mean**, it is highly sensitive to outliers.
- A single outlier can significantly increase the standard deviation, making the data appear more dispersed than it actually is for the majority of observations.
*Mean*
- The **mean** is calculated by summing all values and dividing by the number of values, which makes it directly affected by every data point, especially extreme ones.
- A single **outlier** can pull the mean significantly towards its value, misrepresenting the central tendency of the majority of the data.
*Variance*
- **Variance** is the average of the squared differences from the mean, and like standard deviation, its calculation heavily relies on the **mean**.
- Squaring the differences amplifies the impact of outliers, making variance very sensitive to extreme values.
*Mode*
- The **mode** represents the most frequently occurring value in a dataset and is also resistant to outliers since it only depends on frequency of occurrence.
- However, in small datasets or datasets without repeated values, the mode may be **undefined or uninformative**, making it less useful for describing central tendency compared to the median.
Sampling techniques US Medical PG Question 5: An investigator is measuring the blood calcium level in a sample of female cross country runners and a control group of sedentary females. If she would like to compare the means of the two groups, which statistical test should she use?
- A. Chi-square test
- B. Linear regression
- C. t-test (Correct Answer)
- D. ANOVA (Analysis of Variance)
- E. F-test
Sampling techniques Explanation: ***t-test***
- A **t-test** is appropriate for comparing the means of two independent groups, such as the blood calcium levels between runners and sedentary females.
- It assesses whether the observed difference between the two sample means is statistically significant or occurred by chance.
*Chi-square test*
- The **chi-square test** is used to analyze categorical data to determine if there is a significant association between two variables.
- It is not suitable for comparing continuous variables like blood calcium levels.
*Linear regression*
- **Linear regression** is used to model the relationship between a dependent variable (outcome) and one or more independent variables (predictors).
- It aims to predict the value of a variable based on the value of another, rather than comparing means between groups.
*ANOVA (Analysis of Variance)*
- **ANOVA** is used to compare the means of **three or more independent groups**.
- Since there are only two groups being compared in this scenario, a t-test is more specific and appropriate.
*F-test*
- The **F-test** is primarily used to compare the variances of two populations or to assess the overall significance of a regression model.
- While it is the basis for ANOVA, it is not the direct test for comparing the means of two groups.
Sampling techniques US Medical PG Question 6: You are reading through a recent article that reports significant decreases in all-cause mortality for patients with malignant melanoma following treatment with a novel biological infusion. Which of the following choices refers to the probability that a study will find a statistically significant difference when one truly does exist?
- A. Type II error
- B. Type I error
- C. Confidence interval
- D. p-value
- E. Power (Correct Answer)
Sampling techniques Explanation: ***Power***
- **Power** is the probability that a study will correctly reject the null hypothesis when it is, in fact, false (i.e., will find a statistically significant difference when one truly exists).
- A study with high power minimizes the risk of a **Type II error** (failing to detect a real effect).
*Type II error*
- A **Type II error** (or **beta error**) occurs when a study fails to reject a false null hypothesis, meaning it concludes there is no significant difference when one actually exists.
- This is the **opposite** of what the question describes, which asks for the probability of *finding* a difference.
*Type I error*
- A **Type I error** (or **alpha error**) occurs when a study incorrectly rejects a true null hypothesis, concluding there is a significant difference when one does not actually exist.
- This relates to the **p-value** and the level of statistical significance (e.g., p < 0.05).
*Confidence interval*
- A **confidence interval** provides a range of values within which the true population parameter is likely to lie with a certain degree of confidence (e.g., 95%).
- It does not directly represent the probability of finding a statistically significant difference when one truly exists.
*p-value*
- The **p-value** is the probability of observing data as extreme as, or more extreme than, that obtained in the study, assuming the null hypothesis is true.
- It is used to determine statistical significance, but it is not the probability of detecting a true effect.
Sampling techniques US Medical PG Question 7: A biostatistician is processing data for a large clinical trial she is working on. The study is analyzing the use of a novel pharmaceutical compound for the treatment of anorexia after chemotherapy with the outcome of interest being the change in weight while taking the drug. While most participants remained about the same weight or continued to lose weight while on chemotherapy, there were smaller groups of individuals who responded very positively to the orexic agent. As a result, the data had a strong positive skew. The biostatistician wishes to report the measures of central tendency for this project. Just by understanding the skew in the data, which of the following can be expected for this data set?
- A. Mean = median = mode
- B. Mean < median < mode
- C. Mean > median > mode (Correct Answer)
- D. Mean > median = mode
- E. Mean < median = mode
Sampling techniques Explanation: ***Mean > median > mode***
- In a dataset with a **strong positive skew**, the tail of the distribution is on the right, pulled by a few **unusually large values**.
- These extreme high values disproportionately influence the **mean**, pulling it to the right (higher value), while the **median** (middle value) is less affected, and the **mode** (most frequent value) is often located at the peak of the distribution towards the left.
*Mean = median = mode*
- This relationship between the measures of central tendency is characteristic of a **perfectly symmetrical distribution**, such as a **normal distribution**, where there is no skew.
- In a symmetrical distribution, the mean, median, and mode are all located at the exact center of the data.
*Mean < median < mode*
- This order is typical for a dataset with a **negative skew**, where the tail is on the left due to a few **unusually small values**.
- In a negatively skewed distribution, the mean is pulled to the left (lower value) by the small values, making it less than the median and mode.
*Mean > median = mode*
- This configuration is generally not characteristic of standard skewed distributions and would imply a specific, less common bimodal or complex distribution shape where the mode coincides with the median, but the mean is pulled higher.
- While theoretically possible, it doesn't describe a typical positively skewed distribution where the mode is usually the lowest of the three.
*Mean < median = mode*
- This relationship would suggest a negatively skewed distribution where the median and mode are equal, but the mean is pulled to the left (lower value) by a leftward tail.
- Again, this is a less typical representation of a standard negatively skewed distribution, which often follows the Mean < Median < Mode pattern.
Sampling techniques US Medical PG Question 8: An investigator is studying the effects of zinc deprivation on cancer cell proliferation. It is hypothesized that because zinc is known to be a component of transcription factor motifs, zinc deprivation will result in slower tumor growth. To test this hypothesis, tumor cells are cultured on media containing low and high concentrations of zinc. During the experiment, a labeled oligonucleotide probe is used to identify the presence of a known transcription factor. The investigator most likely used which of the following laboratory techniques?
- A. ELISA
- B. PCR
- C. Western blot
- D. Northern blot
- E. Southwestern blot (Correct Answer)
Sampling techniques Explanation: ***Southwestern blot***
- A **Southwestern blot** specifically identifies **DNA-binding proteins** (such as transcription factors) by detecting their ability to bind to specific **labeled DNA oligonucleotide probes**
- The technique involves: protein separation by gel electrophoresis → transfer to membrane → probing with **labeled double-stranded DNA oligonucleotide**
- This directly answers the question: using a labeled oligonucleotide probe to identify a transcription factor
*ELISA*
- **ELISA** detects and quantifies proteins using **antibody-antigen interactions**, not DNA-binding activity
- While it could detect the presence of a transcription factor protein, it cannot assess the protein's ability to bind to specific DNA sequences
- Does not utilize oligonucleotide probes for detection
*PCR*
- **PCR** amplifies specific **DNA sequences** but does not detect or characterize proteins
- This technique would amplify DNA, not identify DNA-binding proteins
- Not applicable for detecting transcription factor presence or function
*Western blot*
- **Western blot** detects specific proteins using **antibodies**, not oligonucleotide probes
- While it could confirm transcription factor protein presence, it cannot assess DNA-binding capability
- Uses antibody-based detection, not nucleotide probe-based detection
*Northern blot*
- **Northern blot** detects specific **RNA molecules**, not DNA-binding proteins
- Uses labeled DNA or RNA probes to detect RNA, not to detect proteins that bind DNA
- Wrong target molecule (RNA vs. proteins)
Sampling techniques US Medical PG Question 9: A scientist is designing a study to determine whether eating a new diet is able to lower blood pressure in a group of patients. In particular, he believes that starting the diet may help decrease peak blood pressures throughout the day. Therefore, he will equip study participants with blood pressure monitors and follow pressure trends over a 24-hour period. He decides that after recruiting subjects, he will start them on either the new diet or a control diet and follow them for 1 month. After this time, he will switch patients onto the other diet and follow them for an additional month. He will analyze the results from the first month against the results from the second month for each patient. This type of study design is best at controlling for which of the following problems with studies?
- A. Hawthorne effect
- B. Recall bias
- C. Confounding (Correct Answer)
- D. Selection bias
- E. Pygmalion effect
Sampling techniques Explanation: ***Confounding***
- This **crossover design** (switching patients to the other diet) effectively controls for **confounding variables** by making each patient their own control, ensuring that inherent patient characteristics do not bias the comparison between diets.
- By comparing the effects of both diets within the same individual, individual variability in factors such as genetics, lifestyle, and other co-morbidities are accounted for, reducing their potential as confounders.
*Hawthorne effect*
- The **Hawthorne effect** refers to subjects modifying their behavior in response to being observed, which this study design does not specifically address or eliminate.
- While patients are being monitored, the design aims to compare the diets' effects, not to prevent behavioral changes due to observation itself.
*Recall bias*
- **Recall bias** occurs when participants' memories of past events are inaccurate, often influenced by their current health status or beliefs.
- This study measures **real-time blood pressure** data, not relying on recollection of past exposures or outcomes, thereby mitigating recall bias.
*Selection bias*
- **Selection bias** arises from non-random selection of participants into study groups, leading to systematic differences between groups.
- While patient recruitment could introduce selection bias into the overall study population, the **crossover design** itself helps control for differences between treatment arms because all participants eventually receive both treatments.
*Pygmalion effect*
- The **Pygmalion effect** (or observer-expectancy effect) describes phenomena where higher expectations lead to increased performance, usually from a researcher influencing a subject.
- This effect is not directly addressed by the crossover design; the design focuses on controlling for patient-specific confounders rather than investigator bias in expectations.
Sampling techniques US Medical PG Question 10: A statistician wants to study the effects of a medicine in three groups-humans, animals, and plants. He then selects randomly from these three groups. Which type of sampling is being performed?
- A. Simple random sampling
- B. Systematic sampling
- C. Stratified random sampling (Correct Answer)
- D. Cluster sampling
- E. Convenience sampling
Sampling techniques Explanation: ***Stratified random sampling***
- This method involves dividing the population into **distinct subgroups (strata)** based on shared characteristics (in this case, humans, animals, and plants), and then performing a simple random sample within each stratum.
- This ensures that all subgroups are proportionally represented in the sample, which is appropriate when studying effects across different biological categories.
*Simple random sampling*
- This method involves selecting individuals from the entire population **purely by chance**, without first dividing them into subgroups.
- It would not guarantee representation from all three distinct groups (humans, animals, and plants), which is essential for studying differential effects.
*Systematic sampling*
- This involves selecting samples at **regular intervals** from an ordered list or sequence.
- This method is not suitable here because the population is divided into distinct, non-ordered groups rather than a continuous sequence.
*Cluster sampling*
- This method involves dividing the population into **clusters**, then randomly selecting some clusters and sampling all individuals within those selected clusters.
- In this scenario, the initial groups (humans, animals, plants) are strata, not clusters, as the intent is to sample from within each group, not to treat the groups themselves as primary sampling units.
*Convenience sampling*
- This is a **non-probability sampling method** where subjects are selected based on ease of access rather than random selection.
- The question explicitly states that random selection is performed from each group, ruling out convenience sampling.
More Sampling techniques US Medical PG questions available in the OnCourse app. Practice MCQs, flashcards, and get detailed explanations.