Local Regularizer Improves Generalization

Authors

  • Yikai Zhang Rutgers University
  • Hui Qu Rutgers University
  • Dimitris Metaxas Rutgers University
  • Chao Chen Stony Brook University

DOI:

https://doi.org/10.1609/aaai.v34i04.6167

Abstract

Regularization plays an important role in generalization of deep learning. In this paper, we study the generalization power of an unbiased regularizor for training algorithms in deep learning. We focus on training methods called Locally Regularized Stochastic Gradient Descent (LRSGD). An LRSGD leverages a proximal type penalty in gradient descent steps to regularize SGD in training. We show that by carefully choosing relevant parameters, LRSGD generalizes better than SGD. Our thorough theoretical analysis is supported by experimental evidence. It advances our theoretical understanding of deep learning and provides new perspectives on designing training algorithms. The code is available at https://github.com/huiqu18/LRSGD.

Downloads

Published

2020-04-03

How to Cite

Zhang, Y., Qu, H., Metaxas, D., & Chen, C. (2020). Local Regularizer Improves Generalization. Proceedings of the AAAI Conference on Artificial Intelligence, 34(04), 6861-6868. https://doi.org/10.1609/aaai.v34i04.6167

Issue

Section

AAAI Technical Track: Machine Learning