DOUBLY REPARAMETERIZED GRADIENT ESTIMATORS FOR MONTE CARLO OBJECTIVES

George Tucker, Dieterich Lawson, Shixiang Gu, Chris J. Maddison

ABSTRACT

Deep latent variable models have become a popular model choice due to the scalable learning algorithms introduced by (Kingma & Welling, 2013; Rezende et al., 2014).

These approaches maximize a variational lower bound on the intractable log likelihood of the observed data.

Burda et al. (2015) introduced a multi-sample variational bound, IWAE, that is at least as tight as the standard
variational lower bound and becomes increasingly tight as the number of samples increases.

Counterintuitively, the typical inference network gradient estimator for the IWAE bound performs poorly as the number of samples increases (Rainforth et al., 2018; Le et al., 2018).

Roeder et al. (2017) propose an improved gradient estimator, however, are unable to show it is unbiased. We show that it is in fact biased and that the bias can be estimated efficiently with a second application of the reparameterization trick.

The doubly reparameterized gradient (DReG) estimator does not suffer as the number of samples increases, resolving the previously raised issues.

The same idea can be used to improve many recently introduced training techniques for latent variable models.

In particular, we show that this estimator reduces the variance of the IWAE gradient, the reweighted wake-sleep update (RWS) (Bornschein & Bengio, 2014), and the jackknife variational inference (JVI) gradient (Nowozin, 2018).

Finally, we show that this computationally efficient, unbiased drop-in gradient estimator translates to improved performance for all three objectives on several modeling tasks.

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時(shí)請(qǐng)結(jié)合常識(shí)與多方信息審慎甄別。
平臺(tái)聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點(diǎn),簡書系信息發(fā)布平臺(tái),僅提供信息存儲(chǔ)服務(wù)。

相關(guān)閱讀更多精彩內(nèi)容

  • 嶺南好風(fēng)光2015-7-31 00:25 嶺南好風(fēng)光,我即刻想起那年去三亞時(shí)導(dǎo)游的幽默:海南,插根筷子也發(fā)芽。神奇...
    甘懷閱讀 253評(píng)論 0 0
  • -有合約形式,有外判形式 -市價(jià)大約$60-$70一小時(shí) -有人覺得好爽/不用用腦袋/一份X工作/培訓(xùn)期完結(jié)前已經(jīng)...
    SumLam閱讀 895評(píng)論 0 0
  • 十余年來北大旁聽求學(xué)歲月,時(shí)常涌現(xiàn)在夢(mèng)寐之中。寓居北京古城有年,夜半,睡中驚起坐,暗風(fēng)入寒窗。醒后起視,驚回十年夢(mèng)...
    鄭球洋閱讀 967評(píng)論 0 2
  • 上周六無意中看到一個(gè)白色書架,我曾經(jīng)有兩個(gè)一模一樣的……加上昨晚翻的那本書,今天腦子里一直想這些—— 有一段時(shí)間,...
    Ranrna閱讀 428評(píng)論 0 1

友情鏈接更多精彩內(nèi)容