Articles

Optimization Algorithms

less than 1 minute read

So.. do I use vanilla SGD? SGD with momentum? Nesterov? RMSProp?? Adagrad? Adadelta? Adam?!? lookahead?? I am new to this deep learning business and after d...