Image Multi Classification
The task of assigning multiple labels to an entire image, indicating what object or scene is present in the image.
Popular Datasets
- CIFAR-10
- Description: 60,000 32x32 color images in 10 classes, with 6,000 images per class.
- URL: CIFAR-10
- CIFAR-100
- Description: 60,000 32x32 color images in 100 classes, with 600 images per class.
- URL: CIFAR-100
- ImageNet
- Description: Over 14 million images categorized into over 20,000 classes.
- URL: ImageNet
- MNIST
- Description: 70,000 28x28 grayscale images of handwritten digits in 10 classes.
- URL: MNIST
- Fashion-MNIST
- Description: 70,000 28x28 grayscale images of 10 fashion categories.
- URL: Fashion-MNIST
Popular Models
- Convolutional Neural Networks (CNNs)
- Examples: LeNet, AlexNet, VGG, ResNet, DenseNet
- Residual Networks (ResNet)
- Description: Introduces skip connections to prevent vanishing gradients.
- URL: ResNet
- DenseNet
- Description: Connects each layer to every other layer in a feed-forward fashion.
- URL: DenseNet
- Inception Networks (GoogLeNet)
- Description: Uses multiple types of convolutions in parallel to capture different features.
- URL: Inception
- EfficientNet
- Description: Scales up model size while balancing network depth, width, and resolution.
- URL: EfficientNet
Hyperparameters
- Learning Rate
- Description: Controls how much to change the model in response to the estimated error each time the model weights are updated.
- Batch Size
- Description: The number of training examples utilized in one iteration.
- Number of Epochs
- Description: The number of complete passes through the training dataset.
- Optimizer
- Examples: SGD, Adam, RMSprop
- Weight Initialization
- Examples: Xavier, He initialization
- Dropout Rate
- Description: Fraction of the input units to drop.
- Learning Rate Decay
- Description: Reduces the learning rate as training progresses.
Popular Loss Functions
- Cross-Entropy Loss
- Description: Measures the performance of a classification model whose output is a probability value between 0 and 1.
- Hinge Loss
- Description: Used for “maximum-margin” classification, mostly for SVMs.
- Kullback-Leibler Divergence Loss
- Description: Measures how one probability distribution diverges from a second, expected probability distribution.
Popular Evaluation Metrics
- Accuracy
- Description: The ratio of correctly predicted observation to the total observations.
- Precision
- Description: The ratio of correctly predicted positive observations to the total predicted positives.
- Recall (Sensitivity)
- Description: The ratio of correctly predicted positive observations to the all observations in actual class.
- F1 Score
- Description: The weighted average of Precision and Recall.
- Confusion Matrix
- Description: A table used to describe the performance of a classification model.
- ROC-AUC
- Description: A graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied.
Other Important Topics
- Data Augmentation
- Description: Techniques used to increase the diversity of your training data without actually collecting new data.
- Examples: Rotation, Translation, Scaling, Flipping, Adding Noise
- Transfer Learning
- Description: Reusing a pre-trained model on a new problem.
- Example: Fine-tuning a model pre-trained on ImageNet for a new classification task.
- Regularization Techniques
- Examples: L1/L2 regularization, Dropout, Early Stopping
- Ensemble Methods
- Description: Combining the predictions of multiple models to improve generalizability and robustness.
- Examples: Bagging, Boosting, Stacking
- Fine-Tuning
- Description: Adjusting the parameters of an existing model to better fit the new data.
- Hyperparameter Tuning
- Techniques: Grid Search, Random Search, Bayesian Optimization
- Model Interpretability
- Techniques: SHAP values, LIME