Unverified Commit b07f4c6d authored by Scott Cyphers's avatar Scott Cyphers Committed by GitHub

Fix MNIST MLP model (#1000)

Removed ReLu activation present just before softmax.
(It's not a good idea to have ReLu > Softmax immediately. The test accuracy of the model is now much better.)
parent 00c4830e
......@@ -146,7 +146,7 @@ int main(int argc, const char* argv[])
auto l1_dot = std::make_shared<op::Dot>(l0, W1, 1);
auto b1_broadcast = std::make_shared<op::Broadcast>(
b1, Shape{batch_size, l1_size}, AxisSet{0});
auto l1 = std::make_shared<op::Relu>(l1_dot + b1_broadcast);
auto l1 = l1_dot + b1_broadcast;
// Softmax
auto softmax = std::make_shared<op::Softmax>(l1, AxisSet{1});
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment