CycleGAN

Computer Vision Generative AI Image Generation

21 min read

Updated Jul 7, 2026

Suggest edit History Talk

RawGraph

Last edited

Jul 7, 2026

Fact-checked

In review queue

Sources

20 citations

Revision

v2 · 4,168 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

CycleGAN (Cycle-Consistent Generative Adversarial Network) is a deep learning architecture for unpaired image-to-image translation. It learns a mapping G: X to Y between two image domains X and Y from training samples that are not aligned in pairs, by coupling the forward generator G with an inverse generator F: Y to X and enforcing a cycle-consistency loss so that F(G(x)) is approximately equal to x and G(F(y)) is approximately equal to y. The method was introduced in March 2017 in the arXiv preprint "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks" by Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros at the Berkeley AI Research (BAIR) Lab, University of California, Berkeley, and was published at the IEEE International Conference on Computer Vision (ICCV) 2017 ^[1]. The authors state the motivation plainly: for image translation "the goal is to learn the mapping between an input image and an output image using a training set of aligned image pairs," but "for many tasks, paired training data will not be available," and CycleGAN's contribution is a way to learn the translation anyway from two unaligned image collections ^[1].

CycleGAN is one of the most cited works in the generative adversarial network (GAN) literature and is the foundational paper that opened the subfield of unpaired translation. By 2026 it had been cited tens of thousands of times, roughly 19,000 records in the SciSpace index ^[20] and, by Google Scholar's broader count, more than 30,000, which places it among the most cited papers in all of computer vision. The official PyTorch code base released alongside the paper had drawn more than 25,000 stars on GitHub ^[18]. CycleGAN let practitioners learn translation tasks such as horse to zebra, summer to winter, and photograph to painting, where collecting aligned input-output pairs is either impossible or prohibitively expensive.

What problem does CycleGAN solve?

Before CycleGAN, supervised image translation relied on aligned image pairs. The pix2pix framework by Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros, presented at CVPR 2017, used a conditional GAN on paired data to learn mappings such as edge maps to photographs, semantic labels to street scenes, and grayscale to color ^[2]. pix2pix worked well, but it required datasets where every input image came with a corresponding ground-truth output. For most interesting translation tasks, such pairs simply do not exist. Nobody can photograph the same scene as both a Monet painting and a real landscape, and there is no aligned dataset of horses standing in the exact pose of zebras in the same field.

The CycleGAN authors framed the problem as learning the joint distribution of two domains given only the marginals. That problem is highly under-constrained. An infinite family of joint distributions match any two marginals, so adversarial loss alone is not enough; the generator can permute outputs arbitrarily and still satisfy the discriminator. In the authors' words, because the adversarial mapping is "highly under-constrained," they "couple it with an inverse mapping" and "introduce a cycle consistency loss" ^[1]. To arrive at that idea, they borrowed the notion of cycle consistency from work in language translation, visual tracking, and 3D shape matching, and turned it into a differentiable loss. If a horse is mapped to a zebra and then back to a horse, the result should resemble the original horse. This single constraint, together with adversarial losses in both directions, was enough to produce convincing translations on a wide range of unpaired tasks.

Two other groups proposed essentially the same idea at almost the same time. Zili Yi, Hao Zhang, Ping Tan, and Minglun Gong introduced DualGAN ("DualGAN: Unsupervised Dual Learning for Image-to-Image Translation," ICCV 2017) ^[3], and Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, and Jiwon Kim introduced DiscoGAN ("Learning to Discover Cross-Domain Relations with Generative Adversarial Networks," ICML 2017) ^[4]. All three works rely on the same dual-mapping plus reconstruction-loss principle. CycleGAN became the most cited of the three, partly because of the strength of the experiments and partly because of the open-source PyTorch repository released alongside the paper.

Who created CycleGAN?

The paper was written by four researchers associated with UC Berkeley ^[1]:

Jun-Yan Zhu, lead author, then a PhD student at Berkeley advised by Alexei A. Efros. Zhu later went to MIT CSAIL as a postdoc, worked at Adobe Research, and joined Carnegie Mellon University as an assistant professor in 2020 ^[19]. He has continued to work on generative models and image-to-image translation, including pix2pixHD, BicycleGAN, GauGAN, and CUT.
Taesung Park, then a PhD student at Berkeley, also advised by Efros. Park went on to first-author the Contrastive Unpaired Translation (CUT) paper at ECCV 2020 ^[8] and the SPADE/GauGAN architecture.
Phillip Isola, then at Berkeley working with Efros, later an assistant professor at MIT EECS. Isola was the first author of pix2pix (CVPR 2017), the paired counterpart to CycleGAN ^[2].
Alexei A. Efros, professor at UC Berkeley, the senior author and advisor on the project.

The project was funded in part by NSF, the Berkeley Deep Drive program, and Adobe and Nvidia hardware donations. The first arXiv version of the paper appeared on 30 March 2017, and the work was presented at ICCV in Venice in October 2017 ^[1].

How does CycleGAN work?

CycleGAN trains four neural networks at once: two generators and two discriminators.

Notation

Let X and Y be two image domains, with training samples drawn independently from each. The goal is to learn:

A generator G: X to Y that turns images from domain X into images that look like they came from domain Y.
An inverse generator F: Y to X.
A discriminator D_Y that tries to distinguish real samples from Y from translated samples G(x).
A discriminator D_X that tries to distinguish real samples from X from translated samples F(y).

Loss function

The full objective combines three terms.

Adversarial losses. Each generator-discriminator pair has its own adversarial loss in the style of GAN training. CycleGAN uses the least-squares formulation from Mao et al. 2017 (LSGAN, "Least Squares Generative Adversarial Networks," ICCV 2017) instead of the original log-loss from Goodfellow et al. 2014, because least-squares loss is more stable and produces higher quality images ^[10]^[9]. The two adversarial terms are written L_GAN(G, D_Y, X, Y) and L_GAN(F, D_X, Y, X).

Cycle-consistency loss. This is the central contribution. Both compositions G then F and F then G must approximately reconstruct the input:

L_cyc(G, F) = E_x [ || F(G(x)) - x ||_1 ] + E_y [ || G(F(y)) - y ||_1 ]

The loss is measured in L1 because the authors found that L1 produced sharper outputs than L2, mirroring observations from the pix2pix paper ^[1]^[2].

Identity loss. An optional identity term encourages the generators to behave like the identity function when they are fed an image that is already in the target domain:

L_id(G, F) = E_y [ || G(y) - y ||_1 ] + E_x [ || F(x) - x ||_1 ]

Identity loss helps preserve color composition and prevents tinting artifacts. It was used for the painting-to-photograph experiments and switched off elsewhere.

Total objective. The full loss is a weighted sum:

L = L_GAN(G, D_Y, X, Y)
  + L_GAN(F, D_X, Y, X)
  + lambda_cyc * L_cyc(G, F)
  + lambda_id  * L_id(G, F)

In the paper, lambda_cyc is set to 10; the authors report, "For all the experiments, we set lambda = 10 in Equation 3" ^[1]. When the identity term is used, its weight is not an absolute 0.5 but 0.5 times lambda_cyc: the paper states that "the weight for the identity mapping loss was 0.5 lambda where lambda was the weight for cycle consistency loss," giving an effective identity weight of 5 ^[1].

Architecture

Generators. The generator network is adapted from the architecture used by Justin Johnson, Alexandre Alahi, and Li Fei-Fei in "Perceptual Losses for Real-Time Style Transfer and Super-Resolution" (ECCV 2016) ^[11]. It consists of three downsampling convolutions, a stack of residual blocks, three upsampling fractionally-strided convolutions, and a final convolution mapping back to RGB. The authors specify the depth directly: "We use 6 blocks for 128x128 images and 9 blocks for 256x256 and higher-resolution training images" ^[1]. Instance normalization (Ulyanov et al. 2016) is used throughout instead of batch normalization, which suits the batch size of 1 used during training ^[13].

Discriminators. Both D_X and D_Y are 70x70 PatchGAN discriminators in the style introduced by Isola et al. for pix2pix ^[2]^[1]. The PatchGAN classifies overlapping 70 by 70 patches of the image as real or fake and averages the responses across the image, producing a single scalar output for the loss. PatchGANs have far fewer parameters than full-image discriminators, run faster, and focus the model on local texture and structure rather than global layout, which is exactly what is needed for translation.

Training

Optimizer: Adam, with beta_1 = 0.5 and beta_2 = 0.999 ^[1].
Learning rate: 2e-4 for the first 100 epochs, then linearly decayed to zero over the next 100 epochs, for 200 epochs total. The paper notes it keeps "the same learning rate for the first 100 epochs and linearly decay the rate to zero over the next 100 epochs" ^[1].
Batch size: 1 (instance normalization is used because batch normalization with batch size 1 is degenerate) ^[1].
Image buffer: a history pool of 50 generated images is kept and used for discriminator updates, following Shrivastava, Pfister, Tuzel, Susskind, Wang, and Webb's "Learning from Simulated and Unsupervised Images through Adversarial Training" (CVPR 2017) ^[12]. The authors "keep an image buffer that stores the 50 previously created images" ^[1]. Sampling old generated images stabilizes the discriminator and reduces oscillation.
Initialization: weights drawn from a Gaussian distribution with mean 0 and standard deviation 0.02 ^[1].
Discriminator updates: the loss for D is divided by 2, slowing the discriminator relative to the generator ^[1].

The full training of a single CycleGAN model takes on the order of one to two days on a single GPU for the standard 256 by 256 datasets reported in the paper.

What were the results in the original paper?

The authors evaluated CycleGAN on a wide spectrum of tasks. The most widely circulated images come from a small set of domain pairs.

Task	Source domain	Target domain	Notes
Object transfiguration	Horse	Zebra	The single most reproduced CycleGAN demo
Object transfiguration	Apple	Orange	Both directions
Season transfer	Yosemite summer	Yosemite winter	Snowfall and color shifts
Collection style transfer	Photograph	Monet painting	Also Cezanne, Van Gogh, Ukiyo-e
Photo enhancement	iPhone snapshot	DSLR-quality bokeh	Shallow depth of field
Map translation	Aerial photo	Google Maps style	And the reverse
Cityscapes	Semantic labels	Street photographs	Compared head-to-head with pix2pix

On the Cityscapes label-to-photo task, where paired data is available, the authors used pix2pix as a paired baseline. pix2pix produced sharper and more accurate results, as expected, but CycleGAN closed much of the gap without ever seeing aligned pairs ^[1]. Quantitative evaluation used three protocols: AMT perceptual studies on Amazon Mechanical Turk, where crowd workers judged real versus generated images; an "FCN-score" that runs a pretrained fully-convolutional segmentation network on generated photos to check that they are semantically interpretable; and standard semantic segmentation metrics (per-pixel accuracy, per-class accuracy, and mean class Intersection-over-Union) on the Cityscapes task ^[1]. The original paper predates and does not report the Frechet Inception Distance (FID), which was introduced concurrently in 2017; FID only became a common CycleGAN benchmark in later follow-up work.

What are CycleGAN's limitations?

The authors were unusually candid about the failure modes of their method, devoting an entire section of the paper to limitations. They summarize the pattern directly: "On translation tasks that involve color and texture changes, as many of those reported above, the method often succeeds. We have also explored tasks that require geometric changes, with little success" ^[1]. The main limitations are:

Geometric and shape changes are weak. CycleGAN learns texture and color mappings well but struggles with translations that require large shape changes. The classic failure mode is dog to cat: the body and pose of the source dog are preserved while the network only changes fur color and texture, producing an obviously wrong cat shape.
Cycle consistency assumes a one-to-one mapping. Many real translation tasks are one-to-many. There are many possible winter versions of a given summer photograph (different snowfall, different lighting). The L1 cycle loss collapses to a deterministic mapping and the model cannot capture this multimodality. MUNIT ^[6] and BicycleGAN later addressed this.
Distribution shift at test time. When the test image is far from the training distribution, for example a horse shown from an unusual angle or a photograph with a person on the horse's back, CycleGAN often produces unrealistic outputs. The authors trace the well-known horse-with-rider failure to the training data: "our model was trained on the wild horse and zebra synsets of ImageNet, which does not contain images of a person riding a horse or zebra" ^[1]. The horse-to-zebra demo famously misclassifies riders as part of the horse and turns jackets into zebra stripes in widely shared online examples.
Hallucination in high-stakes domains. Because cycle consistency only requires that a round trip reconstruct the input, a translator can hide information and invent plausible but false structure to help itself reconstruct. In medical imaging this is dangerous: the model can add tumors where none exist or remove real ones, a risk the authors and later analyses of distribution-matching losses explicitly flag ^[17].
Mode collapse can still occur despite the cycle loss, especially without the identity term.
Quality lags behind paired methods. When paired data exists, pix2pix and its successors produce sharper, more accurate results ^[2].
Slow inference relative to feed-forward style transfer networks. A forward pass through the ResNet-style generator at 256 by 256 takes longer than the small networks used for fixed-style stylization.

What are the main CycleGAN variants and extensions?

CycleGAN spawned a large family of follow-up architectures. The table below summarizes the most influential variants and how they differ from the original.

Method	Year	Venue	Authors	Key idea relative to CycleGAN
DualGAN	2017	ICCV	Yi et al.	Concurrent work with the same dual-generator and reconstruction-loss design
DiscoGAN	2017	ICML	Kim et al.	Concurrent work, also uses cross-domain reconstruction
UNIT	2017	NeurIPS	Liu, Breuel, Kautz	Adds a shared latent-space assumption between domains using weight-tied encoders ^[5]
BicycleGAN	2017	NeurIPS	Zhu et al.	Multimodal output for the paired setting
pix2pixHD	2018	CVPR	Wang et al.	High-resolution paired translation
MUNIT	2018	ECCV	Huang, Liu, Belongie, Kautz	Disentangles content and style codes for multimodal unpaired translation
DRIT	2018	ECCV	Lee et al.	Disentangled representation for diverse outputs
StarGAN	2018	CVPR	Choi et al.	Single generator covers many domains using a domain label ^[7]
StarGAN v2	2020	CVPR	Choi et al.	Multi-domain plus multimodal
FUNIT	2019	ICCV	Liu et al.	Few-shot unsupervised translation
U-GAT-IT	2020	ICLR	Kim et al.	Attention modules for selfie-to-anime style change
CUT	2020	ECCV	Park, Efros, Zhang, Zhu	Replaces cycle consistency with a patch-level contrastive loss; one-sided translation, faster training
CycleGAN-VC	2018	EUSIPCO	Kaneko, Kameoka	Voice conversion with the same cycle-consistency principle ^[14]
CycleGAN-VC2	2019	ICASSP	Kaneko et al.	Improved generator and two-step adversarial loss ^[15]

More recent work has begun to replace GANs with diffusion models for unpaired translation (for example UNIT-DDPM and SDEdit-style methods), but the cycle-consistency principle continues to appear as a regularizer in many of these models.

How does CycleGAN compare to other GAN architectures?

The table below compares CycleGAN to other major GAN architectures. The pairing column indicates whether aligned (input, output) pairs are required at training time. The multi-domain column indicates whether a single trained model handles many target domains. The multimodal column indicates whether the model can produce diverse outputs for the same input.

Architecture	Year	Pairs required	Multi-domain	Multimodal	Key paper
Vanilla GAN	2014	n/a	n/a	n/a	Goodfellow et al., NeurIPS 2014
DCGAN	2015	n/a	n/a	n/a	Radford, Metz, Chintala, ICLR 2016
Conditional GAN	2014	depends	yes via label	no	Mirza and Osindero, arXiv 1411.1784
pix2pix	2017	yes	no	no	Isola et al., CVPR 2017
CycleGAN	2017	no	no	no	Zhu et al., ICCV 2017
UNIT	2017	no	no	no	Liu et al., NeurIPS 2017
StarGAN	2018	no	yes	no	Choi et al., CVPR 2018
MUNIT	2018	no	no	yes	Huang et al., ECCV 2018
BicycleGAN	2017	yes	no	yes	Zhu et al., NeurIPS 2017
CUT	2020	no	no	no	Park et al., ECCV 2020
StyleGAN	2018	n/a (unconditional)	n/a	yes	Karras, Laine, Aila, CVPR 2019
BigGAN	2018	n/a (class-conditional)	yes	yes	Brock, Donahue, Simonyan, ICLR 2019

The Wasserstein GAN (WGAN) loss can be substituted for the LSGAN loss in CycleGAN, and several follow-up papers have done so to gain training stability on harder datasets. CycleGAN sits firmly in the family of conditional generative models for image-to-image models.

What is CycleGAN used for?

The table below organizes the most common deployment areas for CycleGAN and the cycle-consistency idea.

Application area	Description	Representative work
Artistic style transfer	Photograph to Monet, Van Gogh, Ukiyo-e, Cezanne and back	Original CycleGAN paper, 2017
Domain adaptation for self-driving	Synthetic GTA-V renders translated to real Cityscapes-style images for training perception models	Hoffman et al., CyCADA, ICML 2018 ^[16]
Medical image modality transfer	CT to MR and MR to CT for treatment planning, segmentation, and dose calculation	Wolterink et al. 2017; Hiasa et al. 2018; many follow-ups
Sim-to-real for robotics	Translating rendered camera images to photorealistic ones, or vice versa	Various Berkeley and Google Brain papers, 2018 onward
Voice conversion	Speaker identity transfer without parallel utterance pairs	CycleGAN-VC, CycleGAN-VC2, MaskCycleGAN-VC by Kaneko and Kameoka
Aerial and satellite imagery	Map style transfer, day-to-night, season change, cross-sensor adaptation	Multiple remote sensing papers
Data augmentation	Synthesizing extra training images in the minority class to balance datasets, especially in medical imaging	Multiple medical AI papers
Privacy and de-identification	Translating face images to anonymized but realistic substitutes	Various face anonymization papers
Text style transfer	Cycle consistency adapted to sequence models for politeness, formality, sentiment changes	Shen et al. 2017; many follow-ups
Art and design tooling	Powering creative tools like Runway ML, Replicate.com demos, and many web apps	Community projects

CycleGAN remains in active production use for stylization tasks, and pretrained CycleGAN models are still distributed on Hugging Face, Replicate, and the official PyTorch repository, more than nine years after the original paper.

Why is CycleGAN important?

The CycleGAN paper has been cited tens of thousands of times, roughly 19,000 records in the SciSpace index ^[20] and more than 30,000 by Google Scholar's broader count, which puts it among the most cited papers in all of computer vision. The cycle-consistency principle has been adapted well beyond images: language pairs in unsupervised machine translation (Lample et al. 2018), graph-to-graph translation, and even cross-modal embedding alignment all use variants of the same idea.

More broadly, CycleGAN demonstrated that adversarial training plus a self-supervised consistency constraint can solve problems that previously seemed to require strong supervision. It established "unpaired image-to-image translation" as a recognized task with its own benchmarks and evaluation protocols. The horse-to-zebra demo became a canonical example used in textbooks and courses to introduce GANs.

From an engineering point of view, the open-sourcing of the official PyTorch repository (junyanz/pytorch-CycleGAN-and-pix2pix) had an outsized effect. The repo combines the pix2pix and CycleGAN codebases, ships pretrained models for the most common domain pairs, and had accumulated more than 25,000 stars on GitHub by 2026, making it one of the most starred computer vision repositories on the platform ^[18]. It has been ported to TensorFlow, Keras, MXNet, JAX, and most other frameworks. Many later projects, including pix2pixHD, BicycleGAN, MUNIT, and CUT, were built directly on the same code structure.

Despite the rise of diffusion models and large pretrained image models for general-purpose translation tasks, CycleGAN and its descendants are still widely used in practice. They are smaller, faster, and easier to train than diffusion alternatives, and for narrow tasks with limited data they often remain the most cost-effective choice.

Is CycleGAN open source?

Yes. The official implementation lives at github.com/junyanz/pytorch-CycleGAN-and-pix2pix under a BSD-style license ^[18]. The repository ships:

Training scripts for both CycleGAN and pix2pix.
Dataset download scripts for the standard benchmarks (horse2zebra, summer2winter_yosemite, monet2photo, vangogh2photo, ukiyoe2photo, cezanne2photo, cityscapes, facades, maps, apple2orange).
Pretrained model weights.
Docker image, Conda environment file, Jupyter notebooks, and Google Colab notebooks.
Multi-GPU training via torchrun and W&B logging integration.

The repository was originally released alongside the paper in 2017 and has been actively maintained since ^[18]. It supports recent Python and PyTorch releases, and a separate Lua/Torch repository (junyanz/CycleGAN) preserves the original implementation used in the ICCV submission ^[17].

Unofficial ports and pretrained checkpoints are available on Hugging Face Hub, Replicate, and many community GitHub repositories. The model weights are small enough to run on a single consumer GPU at inference time, and runtime is dominated by the residual-block stack rather than memory bandwidth, so even older hardware can run translation in real time at 256 by 256 resolution.

References

Zhu, J.-Y., Park, T., Isola, P., and Efros, A. A. (2017). "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks." *Proceedings of the IEEE International Conference on Computer Vision (ICCV) 2017*, pp. 2242-2251. arXiv:1703.10593. ↩
Isola, P., Zhu, J.-Y., Zhou, T., and Efros, A. A. (2017). "Image-to-Image Translation with Conditional Adversarial Networks." *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017*. arXiv:1611.07004. ↩
Yi, Z., Zhang, H., Tan, P., and Gong, M. (2017). "DualGAN: Unsupervised Dual Learning for Image-to-Image Translation." *ICCV 2017*. arXiv:1704.02510. ↩
Kim, T., Cha, M., Kim, H., Lee, J. K., and Kim, J. (2017). "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks (DiscoGAN)." *ICML 2017*. arXiv:1703.05192. ↩
Liu, M.-Y., Breuel, T., and Kautz, J. (2017). "Unsupervised Image-to-Image Translation Networks (UNIT)." *NeurIPS 2017*. arXiv:1703.00848. ↩
Huang, X., Liu, M.-Y., Belongie, S., and Kautz, J. (2018). "Multimodal Unsupervised Image-to-Image Translation (MUNIT)." *ECCV 2018*. arXiv:1804.04732. ↩
Choi, Y., Choi, M., Kim, M., Ha, J.-W., Kim, S., and Choo, J. (2018). "StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation." *CVPR 2018*. arXiv:1711.09020. ↩
Park, T., Efros, A. A., Zhang, R., and Zhu, J.-Y. (2020). "Contrastive Learning for Unpaired Image-to-Image Translation (CUT)." *ECCV 2020*. arXiv:2007.15651. ↩
Goodfellow, I. et al. (2014). "Generative Adversarial Nets." *NeurIPS 2014*. arXiv:1406.2661. ↩
Mao, X., Li, Q., Xie, H., Lau, R. Y. K., Wang, Z., and Smolley, S. P. (2017). "Least Squares Generative Adversarial Networks (LSGAN)." *ICCV 2017*. arXiv:1611.04076. ↩
Johnson, J., Alahi, A., and Fei-Fei, L. (2016). "Perceptual Losses for Real-Time Style Transfer and Super-Resolution." *ECCV 2016*. arXiv:1603.08155. ↩
Shrivastava, A., Pfister, T., Tuzel, O., Susskind, J., Wang, W., and Webb, R. (2017). "Learning from Simulated and Unsupervised Images through Adversarial Training." *CVPR 2017*. arXiv:1612.07828. ↩
Ulyanov, D., Vedaldi, A., and Lempitsky, V. (2016). "Instance Normalization: The Missing Ingredient for Fast Stylization." arXiv:1607.08022. ↩
Kaneko, T. and Kameoka, H. (2018). "CycleGAN-VC: Non-parallel Voice Conversion Using Cycle-Consistent Adversarial Networks." *EUSIPCO 2018*. ↩
Kaneko, T., Kameoka, H., Tanaka, K., and Hojo, N. (2019). "CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion." *ICASSP 2019*. arXiv:1904.04631. ↩
Hoffman, J. et al. (2018). "CyCADA: Cycle-Consistent Adversarial Domain Adaptation." *ICML 2018*. arXiv:1711.03213. ↩
CycleGAN project page: https://junyanz.github.io/CycleGAN/ ↩
Official PyTorch implementation: https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix ↩
Jun-Yan Zhu personal homepage, Carnegie Mellon University: https://www.cs.cmu.edu/~junyanz/ ↩
SciSpace, citation record for "Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks": https://scispace.com/papers/unpaired-image-to-image-translation-using-cycle-consistent-3tefca24mm ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

1 revision by 1 contributors · full history

Suggest edit

What links here

DCGAN (Deep Convolutional GAN)Domain adaptation GAN Generative adversarial network Image-to-Image Models

What problem does CycleGAN solve?

Who created CycleGAN?

How does CycleGAN work?

Notation

Loss function

Architecture

Training

What were the results in the original paper?

What are CycleGAN's limitations?

What are the main CycleGAN variants and extensions?

How does CycleGAN compare to other GAN architectures?

What is CycleGAN used for?

Why is CycleGAN important?

Is CycleGAN open source?

References

Improve this article

Related Articles

Frechet Inception Distance

ControlNet

StyleGAN

Ideogram 3.0

Nano Banana

Seedream

What links here

Related Articles

Frechet Inception Distance

ControlNet

StyleGAN

Ideogram 3.0

Nano Banana

Seedream

What links here