fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout … #202

mpjlu · 2018-07-30T07:17:59Z

Python Dropout op uses the following code to check keep_prob value:
if tensor_util.constant_value(keep_prob) == 1: return x
If keep_prob is placeholder, tensor_util.constant_value(keep_prob) will return None, if statement will always be false.

…layer

jakeret · 2018-07-31T06:40:37Z

Thx for you contribution. I see why this is better during training. But how should we control the dropout during validation and prediction? There we want to set the dropout to 1.
Or am I missing something?

mpjlu · 2018-08-02T02:12:37Z

For prediction, we don't need dropout.
If set dropout to 1. The right behavior is dropout layer return directly.

jakeret · 2018-08-02T10:35:17Z

Right. So during training we want dropout to be < 1 and during validation it should be = 1.
How can we control this?

mpjlu · 2018-08-03T01:47:57Z

We can create two Unet with different keep_prob for training and validation. How do you think about it?
Since the Dropout layer is very time-consuming, it is better to skip Dropout during validation and inference.

jakeret · 2018-08-03T06:46:08Z

Don't we have to train two models then?
I wasn't aware that dropout is so time consuming. How much does it affect training/validation performance?

mpjlu · 2018-08-06T07:38:31Z

For inference, the Dropout is about 16% of iteration time.
Second row of this picture.
We don't need to train two model. Just need to create a new model (model with keep_prob = 1) for the inference/validation.

mpjlu · 2018-08-15T02:17:37Z

Hi @jakeret , any comment on the data. The data is based on CPU.

jakeret · 2018-08-16T12:36:00Z

An 16% performance improvement is nice.
However, i still don't fully understand how the training/validation procedure would look like. If a new model is created for validation, how would you transfer the learned weights?

mpjlu · 2018-09-18T05:41:38Z

I am sorry for reply later.
How about input two nets when creating Trainer object. The train_net for train, and the validation_net for validation. train_net can save the model for each epoch, and validation_net can restore the model for validation. What do you think about it?

jakeret · 2018-09-19T14:33:23Z

I don't see how this should be implemented. The computation-graph would be different for the two networks, which makes it hard to transfer the weights from one to the other

mpjlu · 2018-09-20T12:24:25Z

There is no weight for the dropout layer, it is ok to save model in the train net, and restore them in the validation net.

fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout …

ed64abf

…layer

add inter and intra param

8fc7cd2

fix keep_prob type

7a2adbf

ashahba mentioned this pull request Jul 9, 2019

Sync "https://github.com/jakeret/tf_unet/pull/202" with master and resolve conflicts #276

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout … #202

fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout … #202

mpjlu commented Jul 30, 2018

jakeret commented Jul 31, 2018

mpjlu commented Aug 2, 2018

jakeret commented Aug 2, 2018

mpjlu commented Aug 3, 2018

jakeret commented Aug 3, 2018

mpjlu commented Aug 6, 2018

mpjlu commented Aug 15, 2018

jakeret commented Aug 16, 2018

mpjlu commented Sep 18, 2018

jakeret commented Sep 19, 2018

mpjlu commented Sep 20, 2018

fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout … #202

Are you sure you want to change the base?

fix dropout = 1.0 issue. If dropout = 1.0, it should not run dropout … #202

Conversation

mpjlu commented Jul 30, 2018

jakeret commented Jul 31, 2018

mpjlu commented Aug 2, 2018

jakeret commented Aug 2, 2018

mpjlu commented Aug 3, 2018

jakeret commented Aug 3, 2018

mpjlu commented Aug 6, 2018

mpjlu commented Aug 15, 2018

jakeret commented Aug 16, 2018

mpjlu commented Sep 18, 2018

jakeret commented Sep 19, 2018

mpjlu commented Sep 20, 2018