I have tried various learning rates like : 0.1 , 0.01 , 0.001 , 1e-3, 1e-4 ,… etc
I also tried to vary the max epochs like 100, 200, 300
but at max I was able to achieve 97%
I plotted the error functions as well ( some results were beyond expectations)
Then I also used scikit leran but then also I got 96% accuracy.
Now, what to do next ??