arxiv:2210.06143

On the Importance of Gradient Norm in PAC-Bayesian Bounds

Published on Oct 12, 2022

Authors:

Itai Gat ,

Abstract

Generalization bounds which assess the difference between the true risk and the empirical risk, have been studied extensively. However, to obtain bounds, current techniques use strict assumptions such as a uniformly bounded or a Lipschitz loss function. To avoid these assumptions, in this paper, we follow an alternative approach: we relax uniform bounds assumptions by using on-average bounded loss and on-average bounded gradient norm assumptions. Following this relaxation, we propose a new generalization bound that exploits the contractivity of the log-Sobolev inequalities. These inequalities add an additional loss-gradient norm term to the generalization bound, which is intuitively a surrogate of the model complexity. We apply the proposed bound on Bayesian deep nets and empirically analyze the effect of this new loss-gradient norm term on different neural architectures.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2210.06143 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2210.06143 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2210.06143 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.