Training Variational Autoencoders with Discrete Latent Variables Using Importance Sampling. Bartler, A., Wiewel, F., Mauch, L., & Yang, B. In 2019 27th European Signal Processing Conference (EUSIPCO), pages 1-5, Sep., 2019.
Training Variational Autoencoders with Discrete Latent Variables Using Importance Sampling [pdf]Paper  doi  abstract   bibtex   
The Variational Autoencoder (VAE) is a popular generative latent variable model that is often used for representation learning. Standard VAEs assume continuous-valued latent variables and are trained by maximization of the evidence lower bound (ELBO). Conventional methods obtain a differentiable estimate of the ELBO with reparametrized sampling and optimize it with Stochastic Gradient Descend (SGD). However, this is not possible if we want to train VAEs with discrete-valued latent variables, since reparametrized sampling is not possible. In this paper, we propose an easy method to train VAEs with binary or categorically valued latent representations. Therefore, we use a differentiable estimator for the ELBO which is based on importance sampling. In experiments, we verify the approach and train two different VAEs architectures with Bernoulli and categorically distributed latent representations on two different benchmark datasets.

Downloads: 0