[Neural Network and Deep Learning] Hyper parameters

Software Courses/Neural network and Deep learning

[Neural Network and Deep Learning] Hyper parameters

김 정 환 2020. 3. 27. 15:02

This note is based on Coursera course by Andrew ng.

(It is just study note for me. It could be copied or awkward sometimes for sentence anything, because i am not native. But, i want to learn Deep Learning on English. So, everything will be bettter and better :))

INTRO

What are hyperparameters?

MAIN

Parameters our model are W and b, and there are other things we need to know such as the learning rate alpha, the number of iteration, the number of hidden layers, the number of hidden units, and choice of activation function. These parameters control the ultimate parameters W and b. So, we call all of these things above hyper parameters.

When we are training a deep net for our own application, we find that there may be a lot of possible settings for the hyper parameters that we need to just try out. So, applied deep learning today is very impirical process where we often might have an idea for the best value of hyper parameters.

For example, alpha = 0.01 is that we want to try. Then we try it out and see how that works, and the based on that outcome we might want to chagne the learning rate to 0.05.

CONCLUSION

One rule of thumb is just try a few value for the hyper parameters and double check if there is a better value for the hyper parameters and as we do so we slowly gain intuition as well about the hyper parameters that work best for our problems. It seems like unstisfying part of deep learning that we just have to try on all the values, but this is one area where deep learning research is still advancing.

'Software Courses > Neural network and Deep learning' 카테고리의 다른 글

[Neural Network and Deep Learning] Building blocks of deep neural networks (0)	2020.03.24
[Neural Network and Deep Learning] Why deep representations? (0)	2020.03.24
[Neural Network and Deep Learning] Forward Propagation in a Deep Network (0)	2020.03.23
[Neural Network and Deep Learning] Deep L-layer neural network (0)	2020.03.23
[Neural Network and Deep Learning] Random Initialization (0)	2020.03.19

현재글[Neural Network and Deep Learning] Hyper parameters

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

거창한 시작