Deep Learning: Foundations and Concepts

285

Exercises Hence, show that the expected error function for this dropout model is given by E [E(W)] =

N X K X n=1 k=1

( ynk − ρ

+ ρ(1 − ρ)

D X

)2 wki xni

(9.72)

i=1

N X K X D X

2 2 wki xni .

(9.73)

n=1 k=1 i=1

Thus, we see that the expected error function corresponds to a sum-of-squares error with a quadratic regularizer in which the regularization coefficient is scaled separately for each input variable according to the data values seen by that input. Finally, write down a closed-form solution for the weight matrix that minimizes this regularized error function.

Turn static files into dynamic content formats.

Create a flipbook

Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.