Generative adversarial networks via a composite annealing of noise and diffusion

Kensuke Nakamura, Simon Korman, Byung Woo Hong

Research output: Contribution to journalArticlepeer-review

Abstract

Generative adversarial network (GAN) is a framework for generating fake data using a set of real examples. However, GAN is unstable in the training stage. In order to stabilize GANs, the noise injection has been used to enlarge the overlap of the real and fake distributions at the cost of increasing variance. The diffusion process (or data smoothing in its spatial domain) removes fine details in order to capture the structure and important patterns in data but it suppresses the capability of GANs to learn high-frequency information in the training procedure. Based on these observations, we propose a data representation for the GAN training, called noisy scale-space (NSS), that recursively applies the smoothing with a balanced noise to data in order to replace the high-frequency information by random data, leading to a coarse-to-fine training of GANs. We experiment with NSS using DCGAN and StyleGAN2 based on benchmark datasets in which the NSS-based GANs outperforms the state-of-the-arts in most cases.

Original languageEnglish
Article number110034
JournalPattern Recognition
Volume146
DOIs
StatePublished - Feb 2024

Bibliographical note

Publisher Copyright:
© 2023 Elsevier Ltd

Keywords

  • Coarse-to-fine training
  • Generative adversarial networks
  • Noise injection
  • Optimization
  • Scale-space

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Generative adversarial networks via a composite annealing of noise and diffusion'. Together they form a unique fingerprint.

Cite this