Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deployment and running of Spark and Dask. #113

Open
bkmgit opened this issue May 25, 2021 · 4 comments
Open

Deployment and running of Spark and Dask. #113

bkmgit opened this issue May 25, 2021 · 4 comments

Comments

@bkmgit
Copy link

bkmgit commented May 25, 2021

Big data software such as Spark and Dask uses ip addresses and fixed ports that are usually not provided by cluster managers such as SLURM. Deployment of these should be made possible on EESSI, and documentation on their use provided. While it is helpful to have these as modules, self installation in user home directory should also be enabled.

@ocaisa
Copy link
Member

ocaisa commented May 25, 2021

This is not related to EESSI (or the software stack in general) but the underlying system and the (network) policies used there. This is not something that EESSI can influence, aside from maybe documenting what tweaking might be required.

@bkmgit
Copy link
Author

bkmgit commented May 25, 2021

NERSC does seem to support this, https://docs.nersc.gov/analytics/dask/ not sure what their setup is though.

@bkmgit
Copy link
Author

bkmgit commented May 25, 2021

A suggested network policy would be helpful to have. Would initially suggest an internal zone for the compute nodes that has many open ports (if using TCP/IP), but which is separated from communication to the outside world.

@ocaisa
Copy link
Member

ocaisa commented May 25, 2021

NERSC does seem to support this, https://docs.nersc.gov/analytics/dask/ not sure what their setup is though.

Sure, many sites support Dask, I'm just saying that it is dependent on network policies (and these can even differ between login and compute nodes).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants