Beyond Uniform Lipschitz Condition in Differentially Private Optimization

Rudrajit Das, Satyen Kale, Zheng Xu, Tong Zhang, Sujay Sanghavi

Proceedings of the 40th International Conference on Machine Learning, PMLR 202:7066-7101, 2023.

Abstract

Most prior results on differentially private stochastic gradient descent (DP-SGD) are derived under the simplistic assumption of uniform Lipschitzness, i.e., the per-sample gradients are uniformly bounded. We generalize uniform Lipschitzness by assuming that the per-sample gradients have sample-dependent upper bounds, i.e., per-sample Lipschitz constants, which themselves may be unbounded. We provide principled guidance on choosing the clip norm in DP-SGD for convex over-parameterized settings satisfying our general version of Lipschitzness when the per-sample Lipschitz constants are bounded; specifically, we recommend tuning the clip norm only till values up to the minimum per-sample Lipschitz constant. This finds application in the private training of a softmax layer on top of a deep network pre-trained on public data. We verify the efficacy of our recommendation via experiments on 8 datasets. Furthermore, we provide new convergence results for DP-SGD on convex and nonconvex functions when the Lipschitz constants are unbounded but have bounded moments, i.e., they are heavy-tailed.

Cite this Paper

BibTeX

@InProceedings{pmlr-v202-das23c,  title =  {Beyond Uniform {L}ipschitz Condition in Differentially Private Optimization},  author =       {Das, Rudrajit and Kale, Satyen and Xu, Zheng and Zhang, Tong and Sanghavi, Sujay},  booktitle =  {Proceedings of the 40th International Conference on Machine Learning},  pages =  {7066--7101},  year =  {2023},  editor =  {Krause, Andreas and Brunskill, Emma and Cho, Kyunghyun and Engelhardt, Barbara and Sabato, Sivan and Scarlett, Jonathan},  volume =  {202},  series =  {Proceedings of Machine Learning Research},  month =  {23--29 Jul},  publisher =    {PMLR},  pdf =  {https://proceedings.mlr.press/v202/das23c/das23c.pdf},  url =  {https://proceedings.mlr.press/v202/das23c.html},  abstract =  {Most prior results on differentially private stochastic gradient descent (DP-SGD) are derived under the simplistic assumption of uniform Lipschitzness, i.e., the per-sample gradients are uniformly bounded. We generalize uniform Lipschitzness by assuming that the per-sample gradients have sample-dependent upper bounds, i.e., per-sample Lipschitz constants, which themselves may be unbounded. We provide principled guidance on choosing the clip norm in DP-SGD for convex over-parameterized settings satisfying our general version of Lipschitzness when the per-sample Lipschitz constants are bounded; specifically, we recommend tuning the clip norm only till values up to the minimum per-sample Lipschitz constant. This finds application in the private training of a softmax layer on top of a deep network pre-trained on public data. We verify the efficacy of our recommendation via experiments on 8 datasets. Furthermore, we provide new convergence results for DP-SGD on convex and nonconvex functions when the Lipschitz constants are unbounded but have bounded moments, i.e., they are heavy-tailed.}}

Endnote

%0 Conference Paper%T Beyond Uniform Lipschitz Condition in Differentially Private Optimization%A Rudrajit Das%A Satyen Kale%A Zheng Xu%A Tong Zhang%A Sujay Sanghavi%B Proceedings of the 40th International Conference on Machine Learning%C Proceedings of Machine Learning Research%D 2023%E Andreas Krause%E Emma Brunskill%E Kyunghyun Cho%E Barbara Engelhardt%E Sivan Sabato%E Jonathan Scarlett%F pmlr-v202-das23c%I PMLR%P 7066--7101%U https://proceedings.mlr.press/v202/das23c.html%V 202%X Most prior results on differentially private stochastic gradient descent (DP-SGD) are derived under the simplistic assumption of uniform Lipschitzness, i.e., the per-sample gradients are uniformly bounded. We generalize uniform Lipschitzness by assuming that the per-sample gradients have sample-dependent upper bounds, i.e., per-sample Lipschitz constants, which themselves may be unbounded. We provide principled guidance on choosing the clip norm in DP-SGD for convex over-parameterized settings satisfying our general version of Lipschitzness when the per-sample Lipschitz constants are bounded; specifically, we recommend tuning the clip norm only till values up to the minimum per-sample Lipschitz constant. This finds application in the private training of a softmax layer on top of a deep network pre-trained on public data. We verify the efficacy of our recommendation via experiments on 8 datasets. Furthermore, we provide new convergence results for DP-SGD on convex and nonconvex functions when the Lipschitz constants are unbounded but have bounded moments, i.e., they are heavy-tailed.

APA

Das, R., Kale, S., Xu, Z., Zhang, T. & Sanghavi, S.. (2023). Beyond Uniform Lipschitz Condition in Differentially Private Optimization.Proceedings of the 40th International Conference on Machine Learning, inProceedings of Machine Learning Research 202:7066-7101 Available from https://proceedings.mlr.press/v202/das23c.html.

Movatterモバイル変換

Beyond Uniform Lipschitz Condition in Differentially Private Optimization

Abstract

Cite this Paper

Related Material