GPU requirements? #14

JohnHammell · 2019-12-20T21:15:55Z

What sort of GPU would be required to run few-shot-vid2vid? Would a Geforce 1050 or 1080 be sufficient?

mpottinger · 2019-12-20T23:37:02Z

I am not 100% certain because I have not tried training a model yet, but I am pretty sure the requirements are similar to the original vid2vid, which means pretty hefty. A consumer graphics card may not suffice, you need a lot of VRAM, 15GB or so, and at least two GPUs, with possibility to support training on one GPU but not recommended.

I never trained a model for the original vid2vid because I determined it would be too expensive.

JohnHammell · 2019-12-21T06:31:58Z

Thanks a lot for the information. I looked over the vid2vid details a bit more closely and you're right that it does have some quite hefty GPU requirements. As few-shot-vid2vid is also based on pix2pixHD, I quickly looked over their page and they do require a minimum of 11GB of video RAM, which might make the Geforce 1050 to 1080 Ti GPUs not possible for this repo project (but still unsure if those GPU requirements also apply here exactly).

If anyone reading this has already trained few-shot-vid2vid, please mention how many GPUs you used and which model GPU it was. Thanks in advance for any additional info regarding this.

AaronWong · 2019-12-21T09:33:05Z

you can try

set --batchSize 1
add --debug and change debug options in ./options/base_options.py
But if you want a good model result, this requires a long long long training time.

JohnHammell · 2019-12-22T01:17:18Z

Hi Aaron, thanks for the info.

By 'long long long' would that be maybe 2 weeks of training with a Geforce 1080? Or more like 2 months?

k4rth33k · 2019-12-22T02:09:48Z

I'm training on a GTX 1060 and it takes 2 hours per epoch, so to get anywhere near those results I guess I need 400 hours of training. It might be a little less if you are using GTX 1080.

pythagoras000 · 2019-12-22T02:21:07Z

@k4rth33k how can I find GTX 1060 pricing on AWS? Would like to get an estimate pricing for the training to achieve similar results as on the paper.

k4rth33k · 2019-12-22T03:10:33Z

GTX 1060 is a consumer-grade card. Mostly used for content creation and games. If you are willing to go with AWS the options you have (as far as my knowledge goes) are instances with K80, M60 or V100 cards which are more efficient. You can find the details in this link (https://docs.aws.amazon.com/dlami/latest/devguide/gpu.html). A very vague and rough estimate is that it will cost you around $300 - 350 if you are using a p3.8xlarge instance. I may be wrong about the estimate.
Edit: The estimate is for training on the pose data that comes with the repo.

pythagoras000 · 2019-12-22T12:35:40Z

Thanks @k4rth33k can you please confirm if the data that comes with the repo (both for poses but also face and street) is enough to replicate the same results? I thought the data included in the repo was just for demo purposes and was not complete.

For example, here they mention the size of the FaceForensics dataset, can you please confirm what of the sizes we should consider for training (38.5, 10GB, 2TB)?

AaronWong · 2019-12-23T06:51:19Z

Hi JohnHammell & k4rth33k
The training has 3 parts:

niter_single: # of iter for single frame training
niter: # of iter at starting learning rate
niter_decay: # of iter to linearly decay learning rate to zero
The part 1 (training the few shot) is fast, like > training on a GTX 1060 and it takes 2 hours per epoch.
But the part 2 & part 3 ( vid2vid traing ), it takes much more time, which depends on your scripts(niter_step, n_frames_total, max_dataset_size...).
Part 2 takes 1.808s per step on two V100.(1.808s * 10000 steps / 3600, 5 hours per epoch)

ndyashas · 2020-01-03T06:44:12Z

@AaronWong Thank you for the details !. Could you please share the model that you have trained?

AaronWong · 2020-01-04T08:21:18Z

hi, @yashasbharadwaj111
I'm sorry. I can't share our model because our data set is divided into two parts:

Collected from youtube, we have not obtained the consent of the host
I don’t have the right to share the video data of our lab

danny-wu · 2020-01-17T05:08:18Z

Has anyone been able to get the model training with 6GB of VRAM? I understand performance would suffer, but that is the card I have.

mamunctg mentioned this issue Apr 26, 2022

hi, @yashasbharadwaj111 #88

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU requirements? #14

GPU requirements? #14

JohnHammell commented Dec 20, 2019 •

edited

Loading

mpottinger commented Dec 20, 2019 •

edited

Loading

JohnHammell commented Dec 21, 2019

AaronWong commented Dec 21, 2019

JohnHammell commented Dec 22, 2019

k4rth33k commented Dec 22, 2019

pythagoras000 commented Dec 22, 2019

k4rth33k commented Dec 22, 2019 •

edited

Loading

pythagoras000 commented Dec 22, 2019

AaronWong commented Dec 23, 2019

ndyashas commented Jan 3, 2020

AaronWong commented Jan 4, 2020

danny-wu commented Jan 17, 2020

GPU requirements? #14

GPU requirements? #14

Comments

JohnHammell commented Dec 20, 2019 • edited Loading

mpottinger commented Dec 20, 2019 • edited Loading

JohnHammell commented Dec 21, 2019

AaronWong commented Dec 21, 2019

JohnHammell commented Dec 22, 2019

k4rth33k commented Dec 22, 2019

pythagoras000 commented Dec 22, 2019

k4rth33k commented Dec 22, 2019 • edited Loading

pythagoras000 commented Dec 22, 2019

AaronWong commented Dec 23, 2019

ndyashas commented Jan 3, 2020

AaronWong commented Jan 4, 2020

danny-wu commented Jan 17, 2020

JohnHammell commented Dec 20, 2019 •

edited

Loading

mpottinger commented Dec 20, 2019 •

edited

Loading

k4rth33k commented Dec 22, 2019 •

edited

Loading