🏢 Jiangnan University
A PID Controller Approach for Adaptive Probability-dependent Gradient Decay in Model Calibration
·2215 words·11 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Jiangnan University
Deep learning models often suffer from overconfidence; this paper introduces a PID controller to adaptively adjust a probability-dependent gradient decay rate, ensuring consistent optimization of both…