Official implementation of Diffusion Autoencoders

A CVPR 2022 (ORAL) paper (paper, site, 5-min video):

@inproceedings{preechakul2021diffusion,
      title={Diffusion Autoencoders: Toward a Meaningful and Decodable Representation}, 
      author={Preechakul, Konpat and Chatthee, Nattanat and Wizadwongsa, Suttisak and Suwajanakorn, Supasorn},
      booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, 
      year={2022},
}

Usage

⚙️ Try a Colab walkthrough:

🤗 Try a web demo:

Note: Since we expect a lot of changes on the codebase, please fork the repo before using.

Prerequisites

See requirements.txt

pip install -r requirements.txt

Quick start

A jupyter notebook.

For unconditional generation: sample.ipynb

For manipulation: manipulate.ipynb

For interpolation: interpolate.ipynb

For autoencoding: autoencoding.ipynb

Aligning your own images:

Put images into the imgs directory
Run align.py (need to pip install dlib requests)
Result images will be available in imgs_align directory

Original in `imgs` directory	Aligned with `align.py`	Using `manipulate.ipynb`

Checkpoints

We provide checkpoints for the following models:

DDIM: FFHQ128 (72M, 130M), Bedroom128, Horse128
DiffAE (autoencoding only): FFHQ256, FFHQ128 (72M, 130M), Bedroom128, Horse128
DiffAE (with latent DPM, can sample): FFHQ256, FFHQ128, Bedroom128, Horse128
DiffAE's classifiers (for manipulation): FFHQ256's latent on CelebAHQ, FFHQ128's latent on CelebAHQ

Checkpoints ought to be put into a separate directory checkpoints. Download the checkpoints and put them into checkpoints directory. It should look like this:

checkpoints/
- bedroom128_autoenc
    - last.ckpt # diffae checkpoint
    - latent.ckpt # predicted z_sem on the dataset
- bedroom128_autoenc_latent
    - last.ckpt # diffae + latent DPM checkpoint
- bedroom128_ddpm
- ...

LMDB Datasets

We do not own any of the following datasets. We provide the LMDB ready-to-use dataset for the sake of convenience.

The directory tree should be:

datasets/
- bedroom256.lmdb
- celebahq256.lmdb
- celeba.lmdb
- ffhq256.lmdb
- horse256.lmdb

You can also download from the original sources, and use our provided codes to package them as LMDB files. Original sources for each dataset is as follows:

FFHQ (https://github.com/NVlabs/ffhq-dataset)
CelebAHQ (https://github.com/switchablenorms/CelebAMask-HQ)
CelebA (https://mmlab.ie.cuhk.edu.hk/projects/CelebA.html)
LSUN (https://github.com/fyu/lsun)

The conversion codes are provided as:

data_resize_bedroom.py
data_resize_celebhq.py
data_resize_celeba.py
data_resize_ffhq.py
data_resize_horse.py

Google drive: https://drive.google.com/drive/folders/1abNP4QKGbNnymjn8607BF0cwxX2L23jh?usp=sharing

Training

We provide scripts for training & evaluate DDIM and DiffAE (including latent DPM) on the following datasets: FFHQ128, FFHQ256, Bedroom128, Horse128, Celeba64 (D2C's crop). Usually, the evaluation results (FID's) will be available in eval directory.

Note: Most experiment requires at least 4x V100s during training the DPM models while requiring 1x 2080Ti during training the accompanying latent DPM.

FFHQ128

# diffae
python run_ffhq128.py
# ddim
python run_ffhq128_ddim.py

A classifier (for manipulation) can be trained using:

python run_ffhq128_cls.py

FFHQ256

We only trained the DiffAE due to high computation cost. This requires 8x V100s.

sbatch run_ffhq256.py

After the task is done, you need to train the latent DPM (requiring only 1x 2080Ti)

python run_ffhq256_latent.py

A classifier (for manipulation) can be trained using:

python run_ffhq256_cls.py

Bedroom128

# diffae
python run_bedroom128.py
# ddim
python run_bedroom128_ddim.py

Horse128

# diffae
python run_horse128.py
# ddim
python run_horse128_ddim.py

Celeba64

This experiment can be run on 2080Ti's.

# diffae
python run_celeba64.py

Name	Name	Last commit message	Last commit date
Latest commit mbrown3434 Delete datasets/celeba_anno/ffhq256.lmdb directory Sep 6, 2023 ef71380 · Sep 6, 2023 History 96 Commits
.vscode	.vscode	interpolation example	Jul 15, 2022
datasets/celeba_anno	datasets/celeba_anno	Delete datasets/celeba_anno/ffhq256.lmdb directory	Sep 6, 2023
diffusion	diffusion	update	Mar 18, 2022
evals	evals	update	Mar 19, 2022
imgs	imgs	update	Mar 21, 2022
imgs_align	imgs_align	update	Mar 21, 2022
imgs_interpolate	imgs_interpolate	interpolation example	Jul 15, 2022
imgs_manipulated	imgs_manipulated	update	Apr 10, 2022
model	model	remove unused files	Mar 19, 2022
.gitignore	.gitignore	manipuate_w/note	Mar 26, 2022
LICENSE	LICENSE	Create LICENSE	Aug 5, 2022
README.md	README.md	Update README.md	Aug 30, 2022
README.md.backup	README.md.backup	update	Mar 21, 2022
TRAIN	TRAIN	Update TRAIN	Sep 5, 2023
align.py	align.py	fix png alpha channel problem	Apr 1, 2022
autoencoding.ipynb	autoencoding.ipynb	autoencoding example	Jul 15, 2022
choices.py	choices.py	update	Mar 18, 2022
cog.yaml	cog.yaml	replicate demo	Aug 4, 2022
config.py	config.py	Update config.py	Sep 5, 2023
config_base.py	config_base.py	init	Jan 11, 2022
data_resize_bedroom.py	data_resize_bedroom.py	update	Mar 17, 2022
data_resize_celeba.py	data_resize_celeba.py	update	Mar 19, 2022
data_resize_celebahq.py	data_resize_celebahq.py	update	Mar 17, 2022
data_resize_ffhq.py	data_resize_ffhq.py	update	Mar 17, 2022
data_resize_horse.py	data_resize_horse.py	update	Mar 17, 2022
dataset.py	dataset.py	update	Mar 18, 2022
dataset_util.py	dataset_util.py	init	Jan 11, 2022
dist_utils.py	dist_utils.py	init	Jan 11, 2022
experiment.py	experiment.py	Update experiment.py	Sep 5, 2023
experiment_classifier.py	experiment_classifier.py	update	Apr 10, 2022
install_requirements_for_colab.sh	install_requirements_for_colab.sh	update	Apr 10, 2022
interpolate.ipynb	interpolate.ipynb	interpolation example	Jul 15, 2022
lmdb_writer.py	lmdb_writer.py	init	Jan 11, 2022
manipulate.ipynb	manipulate.ipynb	update	Apr 10, 2022
manipulate_note.ipynb	manipulate_note.ipynb	Merge branch 'master' of github.com:phizaz/diffae	Apr 1, 2022
metrics.py	metrics.py	fix unmatch argument	Jun 28, 2022
predict.py	predict.py	replicate demo	Aug 4, 2022
renderer.py	renderer.py	update	Mar 17, 2022
requirement_for_colab.txt	requirement_for_colab.txt	colab requirements	Apr 1, 2022
requirements.txt	requirements.txt	fix version of torchmetrics	Jul 15, 2022
run_bedroom128.py	run_bedroom128.py	update	Mar 19, 2022
run_bedroom128_ddim.py	run_bedroom128_ddim.py	update	Mar 19, 2022
run_celeba64.py	run_celeba64.py	update	Mar 19, 2022
run_ffhq128.py	run_ffhq128.py	update	Mar 19, 2022
run_ffhq128_cls.py	run_ffhq128_cls.py	update	Mar 19, 2022
run_ffhq128_ddim.py	run_ffhq128_ddim.py	update	Mar 19, 2022
run_ffhq256.py	run_ffhq256.py	Update run_ffhq256.py	Sep 5, 2023
run_ffhq256.sh	run_ffhq256.sh	update	Mar 19, 2022
run_ffhq256_cls.py	run_ffhq256_cls.py	update	Mar 19, 2022
run_ffhq256_latent.py	run_ffhq256_latent.py	update	Mar 19, 2022
run_horse128.py	run_horse128.py	update	Mar 19, 2022
run_horse128_ddim.py	run_horse128_ddim.py	update	Mar 19, 2022
sample.ipynb	sample.ipynb	update sample	Jul 29, 2022
ssim.py	ssim.py	init	Jan 11, 2022
templates.py	templates.py	Update templates.py	Sep 5, 2023
templates_cls.py	templates_cls.py	fix default path 128cls	Mar 28, 2022
templates_latent.py	templates_latent.py	update	Mar 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Official implementation of Diffusion Autoencoders

Usage

Prerequisites

Quick start

Checkpoints

LMDB Datasets

Training

About

Releases

Packages

Languages

License

mbrown3434/diffae

Folders and files

Latest commit

History

Repository files navigation

Official implementation of Diffusion Autoencoders

Usage

Prerequisites

Quick start

Checkpoints

LMDB Datasets

Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages