The :doc:`Dask distributed scheduler <scheduling>` provides feedback in two forms:
- A progress bar suitable for interactive use in consoles or notebooks
- An interactive dashboard, containing several plots and tables with live information
.. currentmodule:: dask.distributed
.. autosummary:: progress
The dask.distributed progress bar differs from the ProgressBar
used for
:doc:`local diagnostics <diagnostics-local>`.
The progress
function takes a Dask object that is executing in the background.
# Single machine progress bar
from dask.diagnosics import ProgressBar
with ProgressBar():
x.compute()
# Distributed scheduler ProgressBar
from dask.distributed import Client, progress
client = Client() # use dask.distributed by default
x = x.persist() # start computation in the background
progress(x) # watch progress
x.compute() # convert to final result when done if desired
.. currentmodule:: dask.distributed
.. autosummary:: Client
If Bokeh is installed then the dashboard will start up automatically whenever the scheduler is created. For local use this happens automatically when you create a client with no arguments
from dask.distributed import Client
client = Client() # start distributed scheduler locally. Launch dashboard
It is typically served at http://localhost:8787/status , but may be served elsewhere if this port is taken. The address of the dashboard will be displayed if you are in a Jupyter Notebook.
There are numerous pages with information about task runtimes, communication, statistical profiling, load balancing, memory use, and much more. For more information we recommend the following video guide:
More in-depth technical documentation about Dask's distributed scheduler is available at https://distributed.readthedocs.io/en/latest
.. autofunction:: progress