Machine Learning Fundamentals

On this page

ML Fundamentals - AI's Learning Curve

AI, ML, and DL Nested Diagram

  • Machine Learning (ML): AI subset; systems learn from data.
  • Types:
    • Supervised: Labeled data (e.g., image classification: tumor/normal).
    • Unsupervised: Unlabeled data (e.g., anomaly detection).
    • Reinforcement: Rewards/penalties (less common in radiology).
  • Key Terms:
    • Features: Input variables (pixel values).
    • Labels: Output variables (disease state).
    • Model: Learned mathematical representation.
    • Training: Learning process.
    • Inference: Prediction on new data.

⭐ Arthur Samuel coined "Machine Learning" in 1959 (IBM), known for computer checkers.

Supervised Learning - Guided Predictions

  • Core Concept: Learning from labeled data, mapping inputs to outputs ($X \rightarrow Y$).
  • Primary Tasks:
    • Classification: Assigns inputs to predefined categories. E.g., classifying a lung nodule as benign or malignant.
    • Regression: Predicts continuous numerical values. E.g., estimating tumor volume or bone age. Logistic Regression Decision Boundary and Loss
  • Common Algorithms:
    • Linear/Logistic Regression: Models relationships to predict outcomes.
    • Support Vector Machines (SVM): Finds an optimal boundary to separate data classes.
    • Decision Trees: Uses a tree-like structure for decision-making.
    • k-Nearest Neighbors (k-NN): Classifies based on the 'k' closest labeled examples.
  • Radiology Applications:
    • CADe (Computer-Aided Detection): Highlights suspicious regions for radiologists (e.g., microcalcifications).
    • CADx (Computer-Aided Diagnosis): Characterizes lesions to assess malignancy risk.

⭐ The vast majority of currently FDA-approved AI algorithms in diagnostic radiology are based on supervised learning techniques.

Unsupervised Learning & Data Prep - Finding Hidden Patterns

  • Core: Discovers patterns in unlabeled data.
  • Tasks:
    • Clustering: Groups similar data. E.g., k-Means (partitions data into $k$ clusters by similarity).
    • Dimensionality Reduction: Reduces features. E.g., PCA (finds key patterns, reduces complexity/noise). K-means clustering on PCA-reduced data
  • Data Prep (GIGO: Garbage In, Garbage Out):
    • Cleaning: Handle missing values, outliers.
    • Normalization/Standardization: Scale features.
    • Augmentation: Expand image data (rotate, scale). Vital for medical images.
  • Dataset Splitting (70% Train / 15% Val / 15% Test):
    • Train: Model learning.
    • Validate: Tune hyperparameters; prevent overfitting.
    • Test: Unbiased final performance evaluation.

⭐ Data augmentation is a critical technique in medical imaging AI to artificially expand limited datasets and improve model generalization, thereby reducing overfitting.

Model Metrics & CNNs - AI's Checkup

  • Confusion Matrix: Elements: TP, TN, FP, FN.

    • TP: Correct positive; TN: Correct negative.
    • FP: False alarm (Type I error); FN: Missed detection (Type II error).
  • Key Metrics:

    • Sensitivity (Recall): $S_n = TP/(TP+FN)$ (detects disease).
    • Specificity: $S_p = TN/(TN+FP)$ (confirms absence).
    • Precision (PPV): $PPV = TP/(TP+FP)$ (true positives among positive calls).
    • Accuracy: $Acc = (TP+TN)/(TP+TN+FP+FN)$ (overall correct).
  • AUC-ROC: Area Under ROC Curve. Plots $S_n$ vs (1-$S_p$). Higher AUC = better discrimination.

  • Common Pitfalls:

    • Overfitting: Good on training data, poor on test data.
    • Underfitting: Poor on both training and test data.
  • Deep Learning (DL): ML using deep Artificial Neural Networks (ANNs).

  • Convolutional Neural Networks (CNNs): DL for image analysis; hierarchical feature learning.

    • Key Layers: Convolutional (features), Pooling (reduce dimensionality), Fully Connected (classify).

CNN Architecture Diagram

⭐ The Area Under the ROC Curve (AUC-ROC) is a widely used metric to evaluate the performance of binary classification models in radiology, representing the trade-off between sensitivity and specificity across different thresholds.

High‑Yield Points - ⚡ Biggest Takeaways

  • Supervised learning (e.g., CNNs) is vital for image classification and segmentation tasks.
  • Unsupervised learning aids in pattern discovery and anomaly detection in large datasets.
  • Deep Learning, particularly Convolutional Neural Networks (CNNs), are foundational for radiological image analysis.
  • Training, validation, and test datasets are essential to prevent overfitting and ensure generalizability.
  • Key performance metrics include AUC-ROC, sensitivity, specificity, and accuracy.
  • Awareness of algorithmic bias and the need for explainable AI (XAI) is growing in radiology AI applications.
Rezzy AI Tutor

Have doubts about this lesson?

Ask Rezzy, our AI tutor, to explain anything you didn't understand

Practice Questions: Machine Learning Fundamentals

Test your understanding with these related questions

Based on the provided image, which of the following is the correct diagnosis?

Image for question 1
1 of 5

Flashcards: Machine Learning Fundamentals

1/9

The Modified _____ scale is a method for grading SAH as seen on non-contrast CT (NCCT)

TAP TO REVEAL ANSWER

The Modified _____ scale is a method for grading SAH as seen on non-contrast CT (NCCT)

Fisher

browseSpaceflip

Enjoying this lesson?

Get full access to all lessons, practice questions, and more.

Start For Free
Machine Learning Fundamentals - Free Indian Medical PG