This note is based on Coursera course by Andrew ng. (It is just study note for me. It could be copied or awkward sometimes for sentence anything, because i am not native. But, i want to learn Deep Learning on English. So, everything will be bettter and better :)) INTRO The name Softmax comes from constrasting it to a Hardmax which would have taken the vector Z and matched it to vector like this ..