NNUE 1: Get reliabilty score beside value #4051

xefoci7612 · 2022-06-05T12:33:34Z

xefoci7612
Jun 5, 2022

Currently we have some quite complex heuristic to chose which evaluation to use.

  bool useClassical = (pos.this_thread()->depth > 9 || pos.count<ALL_PIECES>() > 7) &&
          abs(eg_value(pos.psq_score())) * 5 > (856 + pos.non_pawn_material() / 64) * (10 + pos.rule50_count());

Not only it is complex, possibly is sub-optimal too.

The idea I suggest is the following:

In the set of train positions, beside evaluation value at a given depth, add also classical eval score (or diff between classical and target), and also a pre-trained NNUE eval scores (see point 3).
When training, beside training for target value, also train to output useClassic signal defined as:
```
                 useClassic  = classic eval better than NNUE eval
```

where better it means closer to target searched value than NNUE. It comes natural to define a loss on it using the differences to target value of classical and NNUE output.

Maybe train an already trained net keeping frozen the weights, and only training the part regarding the new useClassic signal. If this is not possible than you may want to compute the useClassic loss not against the position evaluation of the current net under training, for which eval scores are still weak, but against a pre-trained known good NNUE.
At inference time, just get the useClassic out of NNUE to decide which evaluation to use

dsmsgms · 2022-06-06T17:11:41Z

dsmsgms
Jun 6, 2022

I don't think classic eval is better, it is just faster. You are suggesting to make this decision based on net output. How will that more efficient?

0 replies

xefoci7612 · 2022-06-07T11:18:15Z

xefoci7612
Jun 7, 2022
Author

To give a quantitative answer to your question, one possible way would be to save classical eval and NNUE eval along searched eval when collecting the big set of positions for training. With that info in our hands we could:

See if and when classical eval is better than NNUE
Eventually infer some heuristic for switching between the 2 (now there is a big complex formula)
In case a sensible percent of positions show classical is better, then further work on the ideas of these 2 discussions, otherwise close it, but with an informed decision.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NNUE 1: Get reliabilty score beside value #4051

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

NNUE 1: Get reliabilty score beside value #4051

xefoci7612 Jun 5, 2022

Replies: 2 comments

dsmsgms Jun 6, 2022

xefoci7612 Jun 7, 2022 Author

xefoci7612
Jun 5, 2022

dsmsgms
Jun 6, 2022

xefoci7612
Jun 7, 2022
Author