Model Zoo
Pre-trained Models
First of all, we thank the following repositories for their work on high-quality image synthesis
Please download the models you need and save them to checkpoints/.
| StyleGAN Official |
|
|
|
| Model (Dataset) |
Training Samples |
Training Duration (K Images) |
FID |
| ffhq-1024x1024 |
70,000 |
25,000 |
4.40 |
| celebahq-1024x1024 |
30,000 |
25,000 |
5.06 |
| bedroom-256x256 |
3,033,042 |
70,000 |
2.65 |
| cat-256x256 |
1,657,266 |
70,000 |
8.53 |
| car-512x384 |
5,520,756 |
46,000 |
3.27 |
| StyleGAN Ours |
|
|
|
| Model (Dataset) |
Training Samples |
Training Duration (K Images) |
FID |
| Face ("partial" means faces are not fully aligned to center) |
|
|
|
| celeba_partial-256x256 |
103,706 |
50,000 |
7.03 |
| ffhq-256x256 |
70,000 |
25,000 |
5.70 |
| ffhq-512x512 |
70,000 |
25,000 |
5.15 |
| LSUN Indoor Scene |
|
|
|
| livingroom-256x256 |
1,315,802 |
30,000 |
5.16 |
| diningroom-256x256 |
657,571 |
25,000 |
4.13 |
| kitchen-256x256 |
1,000,000 |
30,000 |
5.06 |
| LSUN Indoor Scene Mixture |
|
|
|
| apartment-256x256 |
4 * 200,000 |
60,000 |
4.18 |
| LSUN Outdoor Scene |
|
|
|
| church-256x256 |
126,227 |
30,000 |
4.82 |
| tower-256x256 |
708,264 |
30,000 |
5.99 |
| bridge-256x256 |
818,687 |
25,000 |
6.42 |
| LSUN Other Scene |
|
|
|
| restaurant-256x256 |
626,331 |
50,000 |
4.03 |
| classroom-256x256 |
168,103 |
50,000 |
10.10 |
| conferenceroom-256x256 |
229,069 |
50,000 |
6.20 |
| StyleGAN2 Official |
|
|
|
| Model (Dataset) |
Training Samples |
Training Duration (K Images) |
FID |
| ffhq-1024x1024 |
70,000 |
25,000 |
2.84 |
| church-256x256 |
126,227 |
48,000 |
3.86 |
| cat-256x256 |
1,657,266 |
88,000 |
6.93 |
| horse-256x256 |
2,000,340 |
100,000 |
3.43 |
| car-512x384 |
5,520,756 |
57,000 |
2.32 |
Training Datasets
- MNIST (60,000 training samples and 10,000 test samples on 10 digital numbers)
- SVHN (73,257 training samples, 26,032 testing samples, and 531,131 additional samples on 10 digital numbers)
- CIFAR10 (50,000 training samples and 10,000 test samples on 10 classes)
- CIFAR100 (50,000 training samples and 10,000 test samples on 100 classes)
- ImageNet (1,281,167 training samples, 50,000 validation samples, and 100,100 testing samples on 1000 classes)
- CelebA (202,599 samples from 10,177 identities, with 5 landmarks and 40 binary facial attributes)
- CelebA-HQ (30,000 samples)
- FF-HQ (70,000 samples)
- LSUN (see statistical information below)
- Places (around 1.8M training samples covering 365 classes)
- Cityscapes (2,975 training samples, 19998 extra training samples (one broken), 500 validation samples, and 1,525 test samples)
- Streetscapes
Statistical information of LSUN dataset is summarized as follows:
| LSUN Datasets Stats |
|
|
| Name |
Number of Samples |
Size |
| Scenes |
|
|
| bedroom (train) |
3,033,042 |
43G |
| bridge (train) |
818,687 |
15G |
| churchoutdoor (train) |
126,227 |
2G |
| classroom (train) |
168,103 |
3G |
| conferenceroom (train) |
229,069 |
4G |
| diningroom (train) |
657,571 |
11G |
| kitchen (train) |
2,212,277 |
33G |
| livingroom (train) |
1,315,802 |
21G |
| restaurant (train) |
626,331 |
13G |
| tower (train) |
708,264 |
11G |
| Objects |
|
|
| airplane |
1,530,696 |
34G |
| bicycle |
3,347,211 |
129G |
| bird |
2,310,362 |
65G |
| boat |
2,651,165 |
86G |
| bottle |
3,202,760 |
64G |
| bus |
695,891 |
24G |
| car |
5,520,756 |
173G |
| cat |
1,657,266 |
42G |
| chair |
5,037,807 |
116G |
| cow |
377,379 |
15G |
| diningtable |
1,537,123 |
48G |
| dog |
5,054,817 |
145G |
| horse |
2,000,340 |
69G |
| motorbike |
1,194,101 |
42G |
| person |
18,890,816 |
477G |
| pottedplant |
1,104,859 |
43G |
| sheep |
418,983 |
18G |
| sofa |
2,365,870 |
56G |
| train |
1,148,020 |
43G |
| tvmonitor |
2,463,284 |
46G |