[Improving: Hyper-parameter tuning, Regularization and Optimization] Optimization

Software Courses/Improving Deep Neural Networks

[Improving: Hyper-parameter tuning, Regularization and Optimization] Optimization - Adam

김 정 환 2020. 4. 17. 11:42

This note is based on Coursera course by Andrew ng.

(It is just study note for me. It could be copied or awkward sometimes for sentence anything, because i am not native. But, i want to learn Deep Learning on English. So, everything will be bettter and better :))

INTRO

Adam stands for Adaptive Moment Estimation.

The Adam optimization algorithm is basically taking Momentum and RMSprop and putting them together.

MAIN

The algorithm has a number of hyper parameters.

α : needs to be tune
β1 : default is 0.9 (this is computing the moving weighted average of dw and db)
β2 : default is 0.999 (this is computing the moving weighted average of dw^2 and db^2)
ε : default 10^-8

CONCLUSION

'Software Courses > Improving Deep Neural Networks' 카테고리의 다른 글

[Improving: Hyper-parameter tuning, Regularization and Optimization] The Problem of local optima (0)	2020.04.17
[Improving: Hyper-parameter tuning, Regularization and Optimization] Learning rate decay (0)	2020.04.17
[Improving: Hyper-parameter tuning, Regularization and Optimization] Optimization - RMSprop (0)	2020.04.17
[Improving: Hyper-parameter tuning, Regularization and Optimization] Optimization - Momentum (0)	2020.04.16
[Improving: Hyper-parameter tuning, Regularization and Optimization] Exponentially weighted averages (0)	2020.04.16

현재글[Improving: Hyper-parameter tuning, Regularization and Optimization] Optimization - Adam

거창한 시작