Skip to main content

🏢 Jiangnan University

A PID Controller Approach for Adaptive Probability-dependent Gradient Decay in Model Calibration
·2215 words·11 mins· loading · loading
Machine Learning Deep Learning 🏢 Jiangnan University
Deep learning models often suffer from overconfidence; this paper introduces a PID controller to adaptively adjust a probability-dependent gradient decay rate, ensuring consistent optimization of both…