SPMI Reproducibility Study

Learning with Partial-Label and Unlabeled Data: A Uniform Treatment for Supervision Redundancy and Insufficiency
Liu et al., ICML 2024

This repository contains our reproduction of SPMI, a mutual-information-based framework for semi-supervised partial-label learning.

Overview

SPMI proposes a unified semi-supervised partial-label learning framework that:

Expands redundant partial-labels via a mutual information criterion.
Condenses noisy candidate sets using a KL-divergence-based score.
Smoothly updates class priors with EMA.

The authors claim SPMI outperforms composite baselines (PRODEN + FixMatch), but our reproduction struggled to match their 85–95% accuracies, revealing multiple ambiguities and potential pitfalls.

Repository Structure

├── spmi.py # Core SPMI algorithm
├── model.py # LeNet & WideResNet definitions
├── dataset.py # Data loaders & transforms
├── train.py # Training loop & utils
├── ablation.py # Ablation-study harness
├── fminstexp.py # Fashion-MNIST
├── cifar10exp.py # CIFAR-10
├── cifar100test1.y # CIFAR100
├── cifar100test2.py # CIFAR100
├── SHVNexp.py # SVHN
├── ProdenFixmatch.py # PRODEN+FixMatch baseline
├── logs/ # Logs and Results └── README.md # This document

Implementation Details

Key Components

Label Expansion (spmi.py): Implements Eq. 11 to add probable labels based on mutual information.
Label Condensation (spmi.py): Applies Eq. 15 to prune the candidate set via an information score.
Class Prior EMA (spmi.py): Updates priors with Eq. 16; handles edge cases when α = 1.0.

Ablation Study

ablation.py lets you toggle:

Initialization fallback
Label generation
Label condensation

and measure their individual contributions on each dataset.

Implementation Challenges

Initialization Fallback: Added top-2 selection when no candidate meets threshold.
Numerical Stability: Inserted ε-smoothing in KL-divergence.
Unused β Penalty: Made the information-bottleneck term optional (it wasn’t used in reported experiments).
Multi-GPU Issues: DataParallel breaks the stateful candidate sets—use a single GPU for SPMI.

Experimental Setup

Datasets & Splits

Dataset	Labeled / Unlabeled	Partial-rate (p)
Fashion-MNIST	1 000 / 4 000	0.3 / 0.3
CIFAR-10	1 000 / 4 000	0.3 / 0.3
CIFAR-100	Not Conducted	-
SVHN	1 000 / —	0.3 /

Hyperparameters & Augmentations

Batch size: 256
LR: 0.03 with cosine schedule, warm-up 10 epochs
Optimizer: SGD (momentum = 0.9, weight_decay = 5e-4)
Total epochs: 500
Augmentations:
- Fashion-MNIST: RandomCrop → RandomHorizontalFlip → Cutout
- CIFAR/SVHN: RandomCrop → RandomHorizontalFlip → AutoAugment → Cutout

Usage

Running SPMI Experiments

Example: Fashion-MNIST

python fminstexp.py

CIFAR-10

python cifar10exp.py

SVHN

python SHVNexp.py

Ablation study (e.g. for Fashion-MNIST, 500 epochs)

python ablation.py --dataset fmnist --epochs 500

For Fixmatch bash script with output redirect (output redir is in the bash file)

bash run_proden_experiments.sh

Dependencies

run requirements.txt

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
Logs		Logs
label_diagnostics		label_diagnostics
.gitignore		.gitignore
FMINSTtest.py		FMINSTtest.py
ProdenFixmatch.py		ProdenFixmatch.py
README.md		README.md
SHVNexp.py		SHVNexp.py
SPMI Reproducibility.pdf		SPMI Reproducibility.pdf
ablation.py		ablation.py
cifar100test1.py		cifar100test1.py
cifar100test2.py		cifar100test2.py
cifar10exp.py		cifar10exp.py
dataset.py		dataset.py
debug2.ipynb		debug2.ipynb
debug_labels.py		debug_labels.py
debug_util.py		debug_util.py
debugdday.ipynb		debugdday.ipynb
fminstexp.py		fminstexp.py
fmnist_full_diagnostics.csv		fmnist_full_diagnostics.csv
integration_test.py		integration_test.py
main.py		main.py
model.py		model.py
proden_fixmatch_fashion_mnist_l4000_p0.3_th0.95.csv		proden_fixmatch_fashion_mnist_l4000_p0.3_th0.95.csv
quick_diagnostics.py		quick_diagnostics.py
quicktest.py		quicktest.py
requirements.txt		requirements.txt
run_proden_experiments.sh		run_proden_experiments.sh
sanitycheck.py		sanitycheck.py
spmi.py		spmi.py
spmi_diagnostics.csv		spmi_diagnostics.csv
test_spmi.py		test_spmi.py
testing.py		testing.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SPMI Reproducibility Study

Learning with Partial-Label and Unlabeled Data: A Uniform Treatment for Supervision Redundancy and Insufficiency
Liu et al., ICML 2024

Table of Contents

Overview

Repository Structure

Implementation Details

Key Components

Ablation Study

Implementation Challenges

Experimental Setup

Datasets & Splits

Hyperparameters & Augmentations

Usage

Running SPMI Experiments

Example: Fashion-MNIST

CIFAR-10

SVHN

Ablation study (e.g. for Fashion-MNIST, 500 epochs)

For Fixmatch bash script with output redirect (output redir is in the bash file)

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Languages

COMP6258-Reproducibility-Challenge/SPMI-Reproducibility-Study

Folders and files

Latest commit

History

Repository files navigation

SPMI Reproducibility Study

Learning with Partial-Label and Unlabeled Data: A Uniform Treatment for Supervision Redundancy and Insufficiency Liu et al., ICML 2024

Table of Contents

Overview

Repository Structure

Implementation Details

Key Components

Ablation Study

Implementation Challenges

Experimental Setup

Datasets & Splits

Hyperparameters & Augmentations

Usage

Running SPMI Experiments

Example: Fashion-MNIST

CIFAR-10

SVHN

Ablation study (e.g. for Fashion-MNIST, 500 epochs)

For Fixmatch bash script with output redirect (output redir is in the bash file)

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Learning with Partial-Label and Unlabeled Data: A Uniform Treatment for Supervision Redundancy and Insufficiency
Liu et al., ICML 2024

Packages