README.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67

<img src='imgs/teaser_720.gif' align="right" width=360>

<br><br><br><br>

# pix2pixHD
### [[Project]](https://tcwang0509.github.io/pix2pixHD/) [[Youtube]](https://youtu.be/3AIpPlzM_qs) [[Paper]](https://arxiv.org/pdf/1711.11585.pdf) <br>
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic image-to-image translation. It can be used for turning semantic label maps into photo-realistic images or synthesizing portraits from face label maps. <br><br>
[High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs](https://tcwang0509.github.io/pix2pixHD/)  
 [Ting-Chun Wang](https://tcwang0509.github.io/)<sup>1</sup>, [Ming-Yu Liu](http://mingyuliu.net/)<sup>1</sup>, [Jun-Yan Zhu](http://people.eecs.berkeley.edu/~junyanz/)<sup>2</sup>, Andrew Tao<sup>1</sup>, [Jan Kautz](http://jankautz.com/)<sup>1</sup>, [Bryan Catanzaro](http://catanzaro.name/)<sup>1</sup>  
 <sup>1</sup>NVIDIA Corporation, <sup>2</sup>UC Berkeley  
 In arxiv, 2017.  

## Release notice
The code is ready to publish but still under final approval process. It should be approved in a couple of days.<br>
If you want to get notified once the code is released, please subscribe [here](https://tcwang0509.github.io/pix2pixHD/subscribe.html).

## Image-to-image translation at 2k/1k resolution
- Our label-to-streetview results
<p align='center'>  
  <img src='imgs/teaser_label.png' width='440'/>
  <img src='imgs/teaser_ours.jpg' width='440'/>
</p>
- Interactive editing results
<p align='center'>  
  <img src='imgs/teaser_style.gif' width='440'/>
  <img src='imgs/teaser_label.gif' width='440'/>
</p>
- Additional streetview results
<p align='center'>
  <img src='imgs/cityscapes_1.jpg' width='440'/>
  <img src='imgs/cityscapes_2.jpg' width='440'/>
</p>
<p align='center'>
  <img src='imgs/cityscapes_3.jpg' width='440'/>
  <img src='imgs/cityscapes_4.jpg' width='440'/>
</p>

- Label-to-face and interactive editing results
<p align='center'>
  <img src='imgs/face1_1.jpg' width='290'/>
  <img src='imgs/face1_2.jpg' width='290'/>
  <img src='imgs/face1_3.jpg' width='290'/>
</p>
<p align='center'>
  <img src='imgs/face2_1.jpg' width='290'/>
  <img src='imgs/face2_2.jpg' width='290'/>
  <img src='imgs/face2_3.jpg' width='290'/>
</p>

- Our editing interface
<p align='center'>
  <img src='imgs/city_short.gif' width='380'/>
  <img src='imgs/face_short.gif' width='490'/>
</p>

### Citation

If you find this useful for your research, please use the following.

```
@article{wang2017highres,
  title={High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs},
  author={Ting-Chun Wang and Ming-Yu Liu and Jun-Yan Zhu and Andrew Tao and Jan Kautz and Bryan Catanzaro},
  journal={arXiv preprint arXiv:1711.11585},
  year={2017}
}
```