Chapter 5: can't reproduce "often end up with more than 95% accuracy in less than 10 epochs" #17

jeffhgs · 2019-02-21T04:07:09Z

Chapter 5 reads:

But it’s noteworthy to observe that you often end up with more than 95% accuracy in less than 10 epochs.

However, I don't share this experience.

I have captured some scenarios in a branch.

https://github.com/jeffhgs/deep_learning_and_the_game_of_go/blob/test_nn_chapter_5/code/dlgo/nn/test_nn.py

You can run as:

(cd code/dlgo/nn && python test_nn.py)

The program downsamples mnist train and test data, train, repeats, and then computes summary statistics.

The program uses the unittest framework for regression testing, and it also emits json documents between scenarios for offline analysis.

The program depends on the incomplete branch I made for #12.

The typical fitting accuracy I get for the full mnist dataset is around 57%.

For purposes of helping your reproduction, I am holding back all inessential changes, including fix for performance related #11.

I tried to run all the configurations listed in the test, but unfortunately my job just ran past the 4h timeout I assigned it, so I'll have to up it and try again. I will attach the 4 instance hours worth of data I have.

This is my configuration:

host: AWS t2.large
OS: ubuntu
Python: 3.6.8

The text was updated successfully, but these errors were encountered:

jeffhgs · 2019-02-21T04:17:54Z

test_nn_jeffhgs_2019-02-20_try1b.txt

jeffhgs · 2019-02-23T04:49:07Z

Data from a longer run. Note this timed out also, so doesn't end cleanly. But you can clearly see the fit measurements. Each full data set run just under 2h for 10 epochs.

test_nn_jeffhgs_2019-02-20_try3.txt

maxpumperla · 2019-02-23T09:03:58Z

that's pretty strange. last time I evaluated this, I certainly ended up with 95% plus, most of the time. I mean, not that it really matters, the lesson is that the algo learns. But I get your frustration, sorry for the inconvenience.

My suspicion is that I used a different learning rate altogether for my experiments.

arisliang · 2021-10-13T09:17:39Z

For me, the network doesn't train most of the time.

arisliang · 2021-10-13T16:57:07Z

Shallower and smaller network architecture, and dense weight initial value scaled with 0.1 worked better for me.

Rock-New · 2024-11-17T12:56:58Z

I met the same problem. What can I do to solve it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chapter 5: can't reproduce "often end up with more than 95% accuracy in less than 10 epochs" #17

Chapter 5: can't reproduce "often end up with more than 95% accuracy in less than 10 epochs" #17

jeffhgs commented Feb 21, 2019

jeffhgs commented Feb 21, 2019

jeffhgs commented Feb 23, 2019

maxpumperla commented Feb 23, 2019

arisliang commented Oct 13, 2021 •

edited

Loading

arisliang commented Oct 13, 2021

Rock-New commented Nov 17, 2024

Chapter 5: can't reproduce "often end up with more than 95% accuracy in less than 10 epochs" #17

Chapter 5: can't reproduce "often end up with more than 95% accuracy in less than 10 epochs" #17

Comments

jeffhgs commented Feb 21, 2019

jeffhgs commented Feb 21, 2019

jeffhgs commented Feb 23, 2019

maxpumperla commented Feb 23, 2019

arisliang commented Oct 13, 2021 • edited Loading

arisliang commented Oct 13, 2021

Rock-New commented Nov 17, 2024

arisliang commented Oct 13, 2021 •

edited

Loading