-
Notifications
You must be signed in to change notification settings - Fork 520
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increase Performance with Vectorized Memory Access #226
base: master
Are you sure you want to change the base?
Conversation
Have you finds anything on 2080? How much have you checked? |
I'm not looking so I can't find anything. It's just for programming challenge... |
could you help out in add these features, addons if --keyspace is 1:3000, new switch --loop --count 2000 --stride 100 |
Hello good morning, I want to know if I put several video cards on the same computer to give you an example 4 video cards, these 4 video cards when running the program would have greater power and speed or not? I await your comments. |
Yes - you will have greater power and speed. |
First of all, thank you very much for your answer and other questions and the video cards can be any model, for example gtx 1080ti 11GB, some 2 video cards and adding rx 580 8GB about 3 video cards and adding rtx 3060TI 8GB, I would have no problems or have to be all the same models and nvidia or AMD ?, I await your answer. |
Yes - they can be any Nvidia cards. I don't know about AMD cards. |
Again thank you very much for responding and I will see to incorporate more video cards then to achieve greater power and speed, I ask you, what video cards do you use? Have you tried the Nvidia GTX, RTX or QUADDRO? Which ones do you recommend using? |
I use 12 Nvidia P106-100 mining cards and 2 Nvidia Tesla K80's. |
were you lucky to use so much power and speed with Bitcarck? |
Not yet - I just started. |
are they on 2 separate PCs or 1? as if it were a mining rig? |
They are on one PC - an old converted mining rig. |
…e changelog file for more informations.
@ByLamacq Is this a patch, which could be also applied to OpenCL? Or is this a CUDA-specific optimization? |
It's not really a patch but yes it's can be apply to Opencl. Amd gpu have also specific microcode for vector data load. So i think this change in cl code can increase performance. |
@BitCrackEvo |
@Uzlopak Actualy, opencl read an array of structure : It's not very good but OpenCl is a high language of programmation so it's depend on the implementation by the compilator... But, you can also try to do that yourself... It's a good training to upgrade your skills. |
Hi @BitCrackEvo I started to dig deeper. Very interesting. Can you Help me with this question on stack overflow? |
This boosted my Jetson Nano about 20% faster. |
Hello,
I changed global memory access from scalar to vector.
Plateform : Ubuntu 16.04, GTX 1050ti, Cuda 10.1 (Up : Original)
Plateform : Ubuntu 18.04, RTX 2080, Cuda 10.2 (Up : Original)
Best regards,
ByLamacq