Tsinghua U Proposes Stochastic Scheduled Sharpness-Aware Minimization for Efficient DNN Training

While deep neural networks (DNNs) have achieved astonishing performance in solving complex real-life problems, training a good DNN has become increasingly challenging, as it is difficult to ensure the optimizers used will converge to reliable minima with satisfactory model performance when only minimizing the conventional…