top of page

It's a simple three layer neural network.  The input layer consists of 401 units: 20 x 20 image input and a bias unit. Hidden layer size is 25. Sigmoid functions are chosen as activation function. The output is a vector of length 10, each unit represents the probability that the input image is a certain number.

bottom of page