Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

poor scalability when using multiple gpus #99

Open
Joeyzhouqihui opened this issue Nov 26, 2021 · 4 comments
Open

poor scalability when using multiple gpus #99

Joeyzhouqihui opened this issue Nov 26, 2021 · 4 comments
Assignees

Comments

@Joeyzhouqihui
Copy link

When we use multiple gpus to do sampling with quiver in the mode of gpu sampling(graph stored in gpu memory), we found that the scalability is poor.

To be specific, we run the example code on reddit and the sampling cost is about 1.11s when using 1 gpu. We expect the time cost of sampling using 8 gpus to be about 8x lower since all gpus do sampling independently. However, when 8 gpus are used, the sampling cost is 0.79s, which is much higher than we have expected. In addition, when using 4 gpus, the sampling cost is 0.66s which is lower than the case of 8 gpus.

Could you please give us some insight or explanation about this phenomenon? Thank you so much!

@ZenoTan
Copy link
Member

ZenoTan commented Nov 27, 2021

We will test the sampling scalability on our machine. We suspect it could be some contention of shared resource.

@eedalong
Copy link
Member

eedalong commented Nov 27, 2021

Well, this is strange, it should not happen because they are doing on different devices as you said. But we will look into this problem and give you feedback ASAP.

@Joeyzhouqihui
Copy link
Author

Thank you so much and look forward to your findings!

@Joeyzhouqihui
Copy link
Author

Hi, sorry for bothering you. I am a little wondering if there is any profile results or findings?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants