Soft CIFAR: Exploring Neural Network Training with Soft Labels

This repository contains the implementation and experimental code for our study on the impact of soft labels in neural network training, specifically focusing on the CIFAR-10 dataset.

Overview

We investigate whether training neural networks with soft labels (probabilistic class distributions from multiple annotators) provides benefits over traditional hard labels. Our research compares several uncertainty-aware network architectures and regularization techniques across multiple robustness metrics.

Key Features

Implementation of multiple uncertainty-aware architectures:
- Spectral Normalized Gaussian Process (SNGP)
- Deterministic Uncertainty Quantification (DUQ)
- Monte Carlo Dropout
Comprehensive evaluation framework measuring:
- Classification accuracy
- Out-of-distribution detection (OOD AUROC)
- Expected Calibration Error (ECE)
- Adversarial robustness via FGSM attacks
Dataset handling for both hard and soft CIFAR-10 labels
Regularization techniques implementation:
- MixUp
- CutMix
- Spectral normalization
- Gradient penalties

Results

Our experiments show that:

Soft labels generally improve model robustness across multiple dimensions
Different network architectures excel at different robustness metrics
SNGP performs best at OOD detection when trained with soft labels
Monte Carlo Dropout provides superior adversarial robustness
Regularization techniques like MixUp and CutMix cannot effectively replicate the benefits of training with soft labels
For a more detailed overview, access the pdf of the report located under uncertainty_report/report.pdf

Model Architecture

We use ResNet architectures of varying depths, with the primary experiments conducted using 20-layer models (~270,000 parameters). Each network is optionally wrapped with uncertainty quantification methods.

Requirements

Python 3.8+
PyTorch 1.8+
torchvision
numpy
Weights & Biases for experiment tracking

Usage

Basic Training

To train a basic ResNet model with hard labels:

python train.py --unc_method basic --hard True --do_augmentation True

or if using the SLURM scheduler, simply run the train.sh script with the appropriately adjusted file paths and ensure the soft cifar 10 dataset has been downloaded.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
data		data
losses		losses
uncertainty_report		uncertainty_report
wrappers		wrappers
.gitignore		.gitignore
README.md		README.md
evaluate_metrics.py		evaluate_metrics.py
grid_searcher.py		grid_searcher.py
load_ood.py		load_ood.py
lr_warmup.py		lr_warmup.py
mixup.py		mixup.py
model_structure.py		model_structure.py
multi_train.sh		multi_train.sh
resnet.py		resnet.py
train.py		train.py
train.sh		train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soft CIFAR: Exploring Neural Network Training with Soft Labels

Overview

Key Features

Results

Model Architecture

Requirements

Usage

Basic Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Soft CIFAR: Exploring Neural Network Training with Soft Labels

Overview

Key Features

Results

Model Architecture

Requirements

Usage

Basic Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages