poor scalability when using multiple gpus #99

Joeyzhouqihui · 2021-11-26T14:20:51Z

When we use multiple gpus to do sampling with quiver in the mode of gpu sampling(graph stored in gpu memory), we found that the scalability is poor.

To be specific, we run the example code on reddit and the sampling cost is about 1.11s when using 1 gpu. We expect the time cost of sampling using 8 gpus to be about 8x lower since all gpus do sampling independently. However, when 8 gpus are used, the sampling cost is 0.79s, which is much higher than we have expected. In addition, when using 4 gpus, the sampling cost is 0.66s which is lower than the case of 8 gpus.

Could you please give us some insight or explanation about this phenomenon? Thank you so much!

ZenoTan · 2021-11-27T00:03:10Z

We will test the sampling scalability on our machine. We suspect it could be some contention of shared resource.

eedalong · 2021-11-27T01:44:54Z

Well, this is strange, it should not happen because they are doing on different devices as you said. But we will look into this problem and give you feedback ASAP.

Joeyzhouqihui · 2021-11-28T06:25:18Z

Thank you so much and look forward to your findings!

Joeyzhouqihui · 2021-12-10T02:01:37Z

Hi, sorry for bothering you. I am a little wondering if there is any profile results or findings?

eedalong assigned eedalong and ZenoTan Nov 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

poor scalability when using multiple gpus #99

poor scalability when using multiple gpus #99

Joeyzhouqihui commented Nov 26, 2021

ZenoTan commented Nov 27, 2021

eedalong commented Nov 27, 2021 •

edited

Loading

Joeyzhouqihui commented Nov 28, 2021

Joeyzhouqihui commented Dec 10, 2021

poor scalability when using multiple gpus #99

poor scalability when using multiple gpus #99

Comments

Joeyzhouqihui commented Nov 26, 2021

ZenoTan commented Nov 27, 2021

eedalong commented Nov 27, 2021 • edited Loading

Joeyzhouqihui commented Nov 28, 2021

Joeyzhouqihui commented Dec 10, 2021

eedalong commented Nov 27, 2021 •

edited

Loading