Image Multi Classification

The task of assigning multiple labels to an entire image, indicating what object or scene is present in the image.

Author

Benedict Thekkel

Popular Datasets

CIFAR-10

Description: 60,000 32x32 color images in 10 classes, with 6,000 images per class.
URL: CIFAR-10

CIFAR-100

Description: 60,000 32x32 color images in 100 classes, with 600 images per class.
URL: CIFAR-100

ImageNet

Description: Over 14 million images categorized into over 20,000 classes.
URL: ImageNet

MNIST

Description: 70,000 28x28 grayscale images of handwritten digits in 10 classes.
URL: MNIST

Fashion-MNIST

Description: 70,000 28x28 grayscale images of 10 fashion categories.
URL: Fashion-MNIST

Popular Models

Convolutional Neural Networks (CNNs)

Examples: LeNet, AlexNet, VGG, ResNet, DenseNet

Residual Networks (ResNet)

Description: Introduces skip connections to prevent vanishing gradients.
URL: ResNet

DenseNet

Description: Connects each layer to every other layer in a feed-forward fashion.
URL: DenseNet

Inception Networks (GoogLeNet)

Description: Uses multiple types of convolutions in parallel to capture different features.
URL: Inception

EfficientNet

Description: Scales up model size while balancing network depth, width, and resolution.
URL: EfficientNet

Hyperparameters

Learning Rate

Description: Controls how much to change the model in response to the estimated error each time the model weights are updated.

Batch Size

Description: The number of training examples utilized in one iteration.

Number of Epochs

Description: The number of complete passes through the training dataset.

Optimizer

Examples: SGD, Adam, RMSprop

Weight Initialization

Examples: Xavier, He initialization

Dropout Rate

Description: Fraction of the input units to drop.

Learning Rate Decay

Description: Reduces the learning rate as training progresses.

Popular Loss Functions

Cross-Entropy Loss

Description: Measures the performance of a classification model whose output is a probability value between 0 and 1.

Hinge Loss

Description: Used for “maximum-margin” classification, mostly for SVMs.

Kullback-Leibler Divergence Loss

Description: Measures how one probability distribution diverges from a second, expected probability distribution.

Popular Evaluation Metrics

Accuracy

Description: The ratio of correctly predicted observation to the total observations.

Precision

Description: The ratio of correctly predicted positive observations to the total predicted positives.

Recall (Sensitivity)

Description: The ratio of correctly predicted positive observations to the all observations in actual class.

F1 Score

Description: The weighted average of Precision and Recall.

Confusion Matrix

Description: A table used to describe the performance of a classification model.

ROC-AUC

Description: A graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied.

Other Important Topics

Data Augmentation

Description: Techniques used to increase the diversity of your training data without actually collecting new data.
Examples: Rotation, Translation, Scaling, Flipping, Adding Noise

Transfer Learning

Description: Reusing a pre-trained model on a new problem.
Example: Fine-tuning a model pre-trained on ImageNet for a new classification task.

Regularization Techniques

Examples: L1/L2 regularization, Dropout, Early Stopping

Ensemble Methods

Description: Combining the predictions of multiple models to improve generalizability and robustness.
Examples: Bagging, Boosting, Stacking

Fine-Tuning

Description: Adjusting the parameters of an existing model to better fit the new data.

Hyperparameter Tuning

Techniques: Grid Search, Random Search, Bayesian Optimization

Model Interpretability

Techniques: SHAP values, LIME