Skip to content

Commit b992106

Browse files
committed
release checkpoints
1 parent b158d92 commit b992106

File tree

1 file changed

+14
-12
lines changed

1 file changed

+14
-12
lines changed

README.md

+14-12
Original file line numberDiff line numberDiff line change
@@ -75,54 +75,56 @@ You can change the number of sampling steps using the `--timestep_respacing` arg
7575

7676
To sample using [DDIM](https://arxiv.org/abs/2010.02502), pass `--use_ddim True`.
7777

78-
## Experiment hyper-parameters
78+
## Models and Hyperparameters
7979

80-
This section includes run flags for training the main models in the paper. Note that the batch sizes are specified for single-GPU training, even though most of these runs will not naturally fit on a single GPU. To address this, either set `--microbatch` to a small value (e.g. 4) to train on one GPU, or run with MPI and divide `--batch_size` by the number of GPUs.
80+
This section includes model checkpoints and run flags for the main models in the paper.
8181

82-
Unconditional ImageNet-64 with our `L_hybrid` objective and cosine noise schedule:
82+
Note that the batch sizes are specified for single-GPU training, even though most of these runs will not naturally fit on a single GPU. To address this, either set `--microbatch` to a small value (e.g. 4) to train on one GPU, or run with MPI and divide `--batch_size` by the number of GPUs.
83+
84+
Unconditional ImageNet-64 with our `L_hybrid` objective and cosine noise schedule [[checkpoint](https://openaipublic.blob.core.windows.net/diffusion/march-2021/imagenet64_uncond_100M_1500K.pt)]:
8385

8486
```bash
8587
MODEL_FLAGS="--image_size 64 --num_channels 128 --num_res_blocks 3 --learn_sigma True"
8688
DIFFUSION_FLAGS="--diffusion_steps 4000 --noise_schedule cosine"
8789
TRAIN_FLAGS="--lr 1e-4 --batch_size 128"
8890
```
8991

90-
Unconditional CIFAR-10 with our `L_hybrid` objective and cosine noise schedule:
92+
Unconditional CIFAR-10 with our `L_hybrid` objective and cosine noise schedule [[checkpoint](https://openaipublic.blob.core.windows.net/diffusion/march-2021/cifar10_uncond_50M_500K.pt)]:
9193

9294
```bash
9395
MODEL_FLAGS="--image_size 32 --num_channels 128 --num_res_blocks 3 --learn_sigma True --dropout 0.3"
9496
DIFFUSION_FLAGS="--diffusion_steps 4000 --noise_schedule cosine"
9597
TRAIN_FLAGS="--lr 1e-4 --batch_size 128"
9698
```
9799

98-
Class-conditional ImageNet-64 model (270M parameters, trained for 250K iterations):
100+
Class-conditional ImageNet-64 model (270M parameters, trained for 250K iterations) [[checkpoint](https://openaipublic.blob.core.windows.net/diffusion/march-2021/imagenet64_cond_270M_250K.pt)]:
99101

100102
```bash
101103
MODEL_FLAGS="--image_size 64 --num_channels 192 --num_res_blocks 3 --learn_sigma True --class_cond True"
102104
DIFFUSION_FLAGS="--diffusion_steps 4000 --noise_schedule cosine --rescale_learned_sigmas False --rescale_timesteps False"
103105
TRAIN_FLAGS="--lr 3e-4 --batch_size 2048"
104106
```
105107

106-
Upsampling 256x256 model (280M parameters, trained for 500K iterations):
108+
Upsampling 256x256 model (280M parameters, trained for 500K iterations) [[checkpoint](https://openaipublic.blob.core.windows.net/diffusion/march-2021/upsample_cond_500K.pt)]:
107109

108110
```bash
109-
MODEL_FLAGS="--num_channels 192 --num_res_blocks 3 --learn_sigma True --class_cond True"
111+
MODEL_FLAGS="--num_channels 192 --num_res_blocks 2 --learn_sigma True --class_cond True"
110112
DIFFUSION_FLAGS="--diffusion_steps 4000 --noise_schedule linear --rescale_learned_sigmas False --rescale_timesteps False"
111113
TRAIN_FLAGS="--lr 3e-4 --batch_size 256"
112114
```
113115

114-
LSUN bedroom model (lr=1e-4):
116+
LSUN bedroom model (lr=1e-4) [[checkpoint](https://openaipublic.blob.core.windows.net/diffusion/march-2021/lsun_uncond_100M_1200K_bs128.pt)]:
115117

116118
```bash
117-
MODEL_FLAGS="--image_size 256 --num_channels 128 --num_res_blocks 2 --num_heads 1 --learn_sigma True"
119+
MODEL_FLAGS="--image_size 256 --num_channels 128 --num_res_blocks 2 --num_heads 1 --learn_sigma True --use_scale_shift_norm False --attention_resolutions 16"
118120
DIFFUSION_FLAGS="--diffusion_steps 1000 --noise_schedule linear --rescale_learned_sigmas False --rescale_timesteps False"
119121
TRAIN_FLAGS="--lr 1e-4 --batch_size 128"
120122
```
121123

122-
LSUN bedroom model (lr=2e-5):
124+
LSUN bedroom model (lr=2e-5) [[checkpoint](https://openaipublic.blob.core.windows.net/diffusion/march-2021/lsun_uncond_100M_2400K_bs64.pt)]:
123125

124126
```bash
125-
MODEL_FLAGS="--image_size 256 --num_channels 128 --num_res_blocks 2 --num_heads 1 --learn_sigma True"
126-
DIFFUSION_FLAGS="--diffusion_steps 1000 --noise_schedule linear --rescale_learned_sigmas False --rescale_timesteps False"
127+
MODEL_FLAGS="--image_size 256 --num_channels 128 --num_res_blocks 2 --num_heads 1 --learn_sigma True --use_scale_shift_norm False --attention_resolutions 16"
128+
DIFFUSION_FLAGS="--diffusion_steps 1000 --noise_schedule linear --rescale_learned_sigmas False --rescale_timesteps False --use_scale_shift_norm False"
127129
TRAIN_FLAGS="--lr 2e-5 --batch_size 128"
128130
```

0 commit comments

Comments
 (0)