hello, andersbll:
Thanks for your code. it is very useful for me.
i read your code and want to ask a question.
Line68 in layers.py:
self.dW = np.dot(self.last_input.T, output_grad)/n - self.weight_decay*self.W
In L2 regularization, i think this program need modify into
self.dW = np.dot(self.last_input.T, output_grad)/n + self.weight_decay*self.W
Would you tell me what you think to use "- self.weight_decay*self.W"?
B.R
heibanke
hello, andersbll:
Thanks for your code. it is very useful for me.
i read your code and want to ask a question.
Line68 in layers.py:
self.dW = np.dot(self.last_input.T, output_grad)/n - self.weight_decay*self.WIn L2 regularization, i think this program need modify into
self.dW = np.dot(self.last_input.T, output_grad)/n + self.weight_decay*self.WWould you tell me what you think to use "- self.weight_decay*self.W"?
B.R
heibanke