Report copyright - On Large-Batch Training for Deep Learning: Generalization ... · 100 (1) A can be the identity matrix I n Nitish Shirish Keskar, Dheevatsa Mudigere, Jorge Nocedal, Mikhail Smelyanskiy,
Please pass captcha verification before submit form