Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does L_total = 0.5 * L_perceptual + 50.0 * L_style + 25.0 * L_tv only apply to optimize VGG? #20

Open
lovecambi opened this issue Feb 5, 2020 · 2 comments

Comments

@lovecambi
Copy link

lovecambi commented Feb 5, 2020

https://github.com/HCIILAB/Scene-Text-Removal/blob/master/network.py#L78
https://github.com/HCIILAB/Scene-Text-Removal/blob/master/train.py#L167
The trainerV seems to only optimize parameters of VGG. So why this loss is needed?

https://github.com/HCIILAB/Scene-Text-Removal/blob/master/train.py#L157
This line I_comp = where(mask, real, fake) contradicts to the description in the paper where I_comp = where(mask, fake, real).

@zhangshuaitao
Copy link

@lovecambi,The trainerV,trainerD, and trainerG are trained alternately to optimize the results. for
I_comp = where(mask, real, fake), 1 and 0 of the mask represent non-text and text areas, respectively.

@lovecambi
Copy link
Author

@lovecambi,The trainerV,trainerD, and trainerG are trained alternately to optimize the results. for
I_comp = where(mask, real, fake), 1 and 0 of the mask represent non-text and text areas, respectively.

So for this line https://github.com/HCIILAB/Scene-Text-Removal/blob/master/train.py#L119
the mask here also uses 1 to represent non-text region?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants