Discussion about this post

User's avatar
Shanya Chaubey's avatar

The simplicity of the explanation was very helpful.

Thank you for creating this

Expand full comment
siyu's avatar

In the middle of the article H(x) shown is 3x3 whereas softmax output G(x) has 4 values. That illustration is a bit confusing.

Is softmax distribution calculated per row or per column of H(x)?

Thanks

Expand full comment
15 more comments...

No posts