-
Notifications
You must be signed in to change notification settings - Fork 186
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE REQUEST] Progress tracking on long-running jobs #217
Comments
Hi @MaybeJustJames , Thanks for the suggestion. pySCENIC does actually have process tracking built in for a number of steps, although it's maybe not always obvious.
But maybe you could tell me your specific use case and what kind of progress tracking would help you? |
Hi @cflerin I had a situation when I used your standard pipeline using dask. I had a dataset of 13k cells from 10x of human samples. Usually, that kind of data never takes more than 20-30 hours for me. However, this one took 5-6 days and still never finished computing the adjacencies. Especially because I was troubleshooting the problems with dask and trying the solutions discussed in other threads here. Here is what I tried.
Below is what I have on screen and it was there for 5-6 days.
I see some documentation here https://docs.dask.org/en/latest/diagnostics-distributed.html about tracking. But maybe you can give a guideline how we can track this on a local machine.
Thanks a lot for your time. |
Ok, thanks for describing your workflow, @saeedfc. If you want to monitor the Dask progress, the first thing I would suggest is to check out the tutorial in Arboreto describing how to connect to the Dask scheduler. Otherwise, I see what you mean about progress reporting to the command prompt but I don't know if we change that really. One thing I would suggest if you're having problems with the GRN step is to try the multiprocessing script, which is more stable, and gives more a informative progress report. |
Thank you @cflerin . I shall try the multiprocessing script and maybe the dask scheduler as well. Thanks and Kind Regards, |
Many SCENIC jobs are very long running and a user can wonder if progress is being made. Having a mechanism to track progress would be very useful.
The text was updated successfully, but these errors were encountered: