alpha
beta, number of hidden units, mini batch size
number of layers, learning rate decay
beta1, beta2, epsilon
1.Try random values. Don't use a grid 2.Use coarse to fine sampling scheme
Last updated 5 years ago