early nn stuff #89

jmsull · 2023-06-15T21:27:37Z

No description provided.

jmsull · 2023-06-15T21:28:53Z

Round 1 of very simple Adam opt plots on cdm at fixed background:

delta and v

reconstructed delta', v'

jmsull · 2023-06-15T21:29:07Z

v and v' look pretty bad - lots of room to improve

Sorry title label on second to last plot is wrong, should say delta'

jmsull · 2023-06-15T21:58:54Z

BTW this is really long mode, $k \sim 0.003$

jmsull · 2023-06-16T01:52:20Z

Now training with 50 iters of Adam with $\eta=1$, followed by 50 with $\eta=0.1$, 20 with $\eta=0.01$, and 10 iters of BFGS (default hyperparameters) - it looks a little better, especially in the solutions, maybe not so much in the reconstructions of $u'$.

Reconstruction:

jmsull · 2023-06-16T01:55:18Z

The loss curve:

It looks like maybe bfgs is starting to just turn down? But the BFGS iters are super expensive (I suppose due to Hessian approximation, even with forward diff, which I assume it is using for that).
We should perhaps run this on something with more oomph than my laptop...

jmsull · 2023-06-16T02:01:13Z

Some other observations:

The solutions for $\delta$ and $v$ look way better with more optimization, which is encouraging
Both solutions are super wrong initially, which perhaps is an implementation error in taking out the neutrinos from the ICs for this simplified example? I will check on this
Otherwise, what the optimization is doing makes sense - it focuses on the last part of the evolution because the solution is biggest there, so it can afford to do much worse in the initial part of the evolution. We may want to try some of the scaling tricks we talked about today (that are also in the stiff neural ode paper) or something hackier
This step-y behavior in the $u'$ function is pretty interesting - here maybe I am not using enough weights - the input is $u$, which is of size 37 in this case, and I am only using a 37->8->8->2 network. Going wider will almost certainly help with this so I can try that.

jmsull · 2023-06-16T02:02:45Z

Another thing I'm eager to try is adding more data and batching over k, which will be closer to what we want to do eventually...

jmsull added 3 commits May 17, 2023 16:40

take out neutrinos, no nn

e9a326e

525

e870583

ude delta 10% rel noise

4c5ca1f

jmsull added 7 commits June 15, 2023 19:13

loss plot and more opt, hacky recon plot though

9211bde

commnd line file

2220e79

bg hmc on delta

286cc64

sc version of opt

3cac1fa

basic adj nn code

731e81b

script with type and reversediff

61daa6e

update

cc70300

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

early nn stuff #89

early nn stuff #89

jmsull commented Jun 15, 2023

jmsull commented Jun 15, 2023

jmsull commented Jun 15, 2023 •

edited

Loading

jmsull commented Jun 15, 2023

jmsull commented Jun 16, 2023

jmsull commented Jun 16, 2023

jmsull commented Jun 16, 2023

jmsull commented Jun 16, 2023

early nn stuff #89

Are you sure you want to change the base?

early nn stuff #89

Conversation

jmsull commented Jun 15, 2023

jmsull commented Jun 15, 2023

jmsull commented Jun 15, 2023 • edited Loading

jmsull commented Jun 15, 2023

jmsull commented Jun 16, 2023

jmsull commented Jun 16, 2023

jmsull commented Jun 16, 2023

jmsull commented Jun 16, 2023

jmsull commented Jun 15, 2023 •

edited

Loading