Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction