Magic Un-Eraser

Description

In this project we train a cold diffusion model to generate handwritten digits by reversing the custom 'Eraser' image degradation process. As a baseline, we also train two denoising diffusion probabilistic models (DDPM - standard diffusion).

Toggle Degradation/Generation

Degradation	Generation

Project structure

configs/ - Contains model configurations of the 3 models trained in this project.
data/ - Contains the MNIST dataset.
docs/ - Contains documentation for the project.
models/ - Contains subdirectories for each trained model (automatically generated by the train.py script). Each sub-directory contains the model state dictionary, the configuration used to train it, the metric logger, and a subdirectory of samples generated throughout the training process.
plots/ - Contains the plots used in the report, and other visualisations.
src/ - Contains the source code for the project. (Re-usable components that are used by the scripts in the root directory)
eval.py - Script for evaluating the trained models.
make_plots.py - Script for generating plots used in the report.
train.py - Script for training the models.
.gitignore - Tells git which files to ignore
.pre-commit-config.yaml - Specifies pre-commit hooks to protect the main branch
Dockerfile - Dockerfile to generate docker image
requirements.txt - List of packages/versions to re-create the environment for the project.
LICENSE - MIT license.

Usage / Re-Production

Note: All commands assume they are being run from the root directory of the project.

1. Set-up

To re-create the environment used for the project, use the requirements.txt file. This can be done with pip, conda or docker. The docker container will not naturally have access to the mps device on Mac laptops (which was used to train the models), thus for best performance it is recommended to use pip or conda to re-createe the environment.

pip

$ pip install -r requirements.txt

Using conda

$ conda create --name <env-name> python
$ conda activate <env-name>
$ pip install -r requirements.txt

Docker (not recommended)

$ docker build -t <image-name> .
$ docker run -ti <image-name> bash

2. Training

We trained three models: DDPM model with default hyperparameters (low-capacity), DDPM model with high-capacity (twice the number of trainable parameters) and a Cold diffusion model (termed the 'Magic Un-Eraser') using the 'Eraser' degradation strategy. The Cold diffusion model had all the same hyperparameters as the default DDPM.

$ python train.py ./configs/ddpm_default.ini    # default DDPM model
$ python train.py ./configs/ddpm_high.ini       # high capacity DDPM model
$ python train.py ./configs/magic_uneraser.ini  # Cold diffusion model ("Magic Un-Eraser")

3. Evaluation & Plotting

To evaluate the models, we generate 100 samples and calculate the FID score between the samples and 500 images from the MNIST test set. Use the following commands:

$ python eval.py --model_dir ./models/ddpm_default --output_dir ./plots
$ python eval.py --model_dir ./models/ddpm_high --output_dir ./plots
$ python eval.py --model_dir ./models/magic_uneraser --output_dir ./plots

To make the plots, use

$ python make_plots.py --models ./models --output_dir ./plots

Timing

Times to run each script:

train.py - Using the mps device, it took ~25 minutes to train the high-capacity DDPM model, and ~15 minutes to train the other two models.
eval.py - No more than 200 seconds for each trained model. (Used the cpu only)
make_plots.py - No more than 1-2 minutes.

I ran all scripts on my personal laptop. The train.py script used the mps device, which is essentially the macbook GPU. The specifications are:

Operating System: macOS Sonoma v14.0

CPU:

Chip: Apple M1 Pro
Total Number of Cores: 8 (6 performance and 2 efficiency)
Memory (RAM): 16 GB

GPU (mps):

Chipset Model: Apple M1 Pro
Type: GPU
Bus: Built-In
Total Number of Cores: 14
Metal Support: Metal 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Magic Un-Eraser

Description

Usage / Re-Production

1. Set-up

2. Training

3. Evaluation & Plotting

Timing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
configs		configs
docs		docs
models		models
plots		plots
report		report
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
make_plots.py		make_plots.py
requirements.txt		requirements.txt
train.py		train.py

License

knottwill/Magic-UnEraser

Folders and files

Latest commit

History

Repository files navigation

Magic Un-Eraser

Description

Usage / Re-Production

1. Set-up

2. Training

3. Evaluation & Plotting

Timing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages